登录查看更多内容

Retrieval-Augmented Generation

Vadim Zaripov

EMEA Data Analytics Solutions Lead at Google

发布日期: 2023年6月16日

Good morning! On this pleasant morning I’d like to talk to you about hallucinations.

Not like it’s 1960s, though, but another kind of modern hallucinations related to the field of Artificial Intelligence. Unless you have been living under a rock this past year you know that the modern Large Language Models (LLM, which are subset of Generative AI) is the talk of the town. And one of the more discussed topics about LLM is their hallucinations.?

LLM hallucination is a phenomenon that occurs when a large language model? generates text that is factually incorrect or nonsensical. This can happen for a variety of reasons, including:

The LLM may be trained on a dataset that contains inaccurate or incomplete information.
The LLM may be asked to generate text about a topic that it does not have enough information about (not enough domain knowledge).
The LLM may be asked to generate text that is too complex or challenging for it to handle.

You may say, well, that’s very human, right? Right!

When it comes to humans we call similar behavior "intellectual dishonesty." Intellectual dishonesty refers to the act of deliberately presenting false or misleading information or arguments with the intention of appearing knowledgeable or avoiding being seen as ignorant or unintelligent.

It involves intentionally distorting facts or fabricating information to support one's position or to gain an advantage in a discussion or debate. Intellectual dishonesty can undermine intellectual integrity and hinder genuine learning and understanding. The difference is that LLMs would not actually do this intentionally, they would do this because they are trained the way that they will try to find or fabricate the answer no matter what.

Now, LLM hallucination can be a problem because it can lead to the spread of misinformation and the creation of unrealistic expectations about what LLMs can do (and we can see this happening a lot over the past year, since OpenAI made GPT-3 public). It is important to be aware of the potential for LLM hallucination and to take steps to mitigate it (as a user and as a developer).

Recently I was experimenting with the method called Retrieval-Augmented Generation (RAG) and it’s application for Enterprise Search to mitigate the effects of hallucination (which is extremely important in search interpretation) and this method in my mind is one of the key approaches to better LLMs, to better domain specific LLMs and fine-tuned LLMs.

So what is this method about?

RAG is a technique for improving the quality of text generation by retrieving relevant documents (or facts) from a verified and/or curated knowledge base (KB) and then using those documents to augment the generation process. RAG works by firstly generating a prompt for the text generation model. The prompt is then used to retrieve relevant documents from the KB. The documents/facts are then used to augment the generation process by providing additional information and context.

领英推荐

What is a Claude 3.5 Sonnet, and how does it compare…

Innovation Incubator Advisory 8 个月前

The Battle of the LLMs: Llama 3 vs. GPT-4 vs. Gemini

CapeStart 9 个月前

Claude 3: A First Look at this Exciting New Technology

Merlin Search Technologies 1 年前

RAG has been shown to improve the quality of text generation in a variety of tasks, including summarisation, question answering, and creative writing, basically, most (if not all) of the LLM use cases.

Here is an example of how RAG can be used to improve the quality of a summary:

Prompt phase: Write a summary of the book "War and Peace".

Retrieval phase: The knowledge base retrieves the following documents:

* "War and Peace" Wikipedia article

* "War and Peace" Goodreads page

* "War and Peace" Amazon page

* "War and Peace" Britannica page

Augmentation phase: The documents are used to augment the generation process by providing additional information and context. For example, the Wikipedia and Britannica articles provide information about the plot, characters,setting of the book and some additional historical context. The Goodreads page provides information about the reviews of the book. The Amazon page provides information about the price of the book and also some additional review information which could be beneficial (or not :) ).

Generation phase: The text generation model is then used to generate a summary of the book, using the information from the documents as a guide. The generated summary is more accurate and informative than a summary that is generated without using RAG because the model is mostly used to summarise the factual data rather then generate potential hallucination where the model can mix together "War and Peace" and "White Guard", for example.

In a nutshell - RAG is a powerful technique that can be used to improve the quality of text generation. It is a promising new[ish] technology that has the potential to revolutionise the way we generate text. Check the research paper to get more details on how to use RAGs, if this short post sparked your interest.

要查看或添加评论，请登录

Vadim Zaripov的更多文章

Loading Bank of England Base Rate to BigQuery

2023年6月20日

Loading Bank of England Base Rate to BigQuery

Data engineering is an integral part of the data lifecycle, playing a vital role in obtaining accurate and timely…
SAP HANA Express Edition on Windows 10 PC

2020年5月14日

SAP HANA Express Edition on Windows 10 PC

Hi everyone. Guess which news I read this morning? Awesome.

14 条评论
S/4HANA CDS for SAP BW folk - Part I

2019年9月6日

S/4HANA CDS for SAP BW folk - Part I

It's been awhile since my last blog post here. Since then I have changed employer and a project and started working…

5 条评论
SAP HANA, SAP Analytics Cloud, and Brexit: The Automation

2019年4月6日

SAP HANA, SAP Analytics Cloud, and Brexit: The Automation

In the last article we have discussed how we can easily get big data from the internet, convert it to the required…

1 条评论
The tale of SAP HANA, SAP Analytics Cloud, and Brexit

2019年3月29日

The tale of SAP HANA, SAP Analytics Cloud, and Brexit

Last Saturday morning just before the London march I was looking at the “Revoke Article 50” petition on the UK…

7 条评论
SAP HANA Express Edition in Docker (End to End Example)

2019年3月16日

SAP HANA Express Edition in Docker (End to End Example)

May 2020 Update: Check this updated article https://www.linkedin.

17 条评论

See all articles

Retrieval-Augmented Generation

Vadim Zaripov

EMEA Data Analytics Solutions Lead at Google

领英推荐

Vadim Zaripov的更多文章

社区洞察

其他会员也浏览了

The Shift from Large Language Models to Agentic AI: The Next Frontier

Mistral Large: A New Challenger Emerges in the LLM Arena

GPT-4 Is Here and It Is Powerful: Here Is All It Encompasses

AI and Your Business

This AI tool draws anything you want

Legal tech: Beyond the myths #3 – How can robots read?

A.I. to A.I.: When Machines Start Talking - A Glimpse into the Future of Artificial Intelligence Communication?

Generation Model – What Do They Know? Cracking Length Generalization: AI's Reasoning Evolution; Can We Drastically Reduce Training Costs?; and More.

A Primer on Agentic Systems

?????? LLMs Opening Their Inner Eyes

领英推荐

Vadim Zaripov的更多文章

Loading Bank of England Base Rate to BigQuery

SAP HANA Express Edition on Windows 10 PC

S/4HANA CDS for SAP BW folk - Part I

SAP HANA, SAP Analytics Cloud, and Brexit: The Automation

The tale of SAP HANA, SAP Analytics Cloud, and Brexit

SAP HANA Express Edition in Docker (End to End Example)

社区洞察

其他会员也浏览了

The Shift from Large Language Models to Agentic AI: The Next Frontier

Mistral Large: A New Challenger Emerges in the LLM Arena

GPT-4 Is Here and It Is Powerful: Here Is All It Encompasses

AI and Your Business

This AI tool draws anything you want

Legal tech: Beyond the myths #3 – How can robots read?

A.I. to A.I.: When Machines Start Talking - A Glimpse into the Future of Artificial Intelligence Communication?

Generation Model – What Do They Know? Cracking Length Generalization: AI's Reasoning Evolution; Can We Drastically Reduce Training Costs?; and More.

A Primer on Agentic Systems

?????? LLMs Opening Their Inner Eyes