What is Retrieval Augmented Generation?
A symbolic representation of an LLM

What is Retrieval Augmented Generation?

Retrieval Augmented Generation — this AI technique uses an external authoritative data source in order to enhance a an LLM's response with accuracy and relevance to the particular task at hand.

RAG will extend the powers of an LLM by giving it specific data to generate a response. Using the RAG technique will ensure that the LLM is informed by the knowledge source of your choice.


How it functions sequentially:

The owner of the data converts the specific data source you want to use into a vector database.

The user creates a prompt.

The RAG takes the user prompt and uses it to search the specific data source you have provided it with.

The RAG performs a relevancy search - the prompt is converted to a vector representation and mapped to the source’s vector database.

Then the prompt is augmented - RAG will use the data it found to create a new augmented version of the prompt.

The augmented prompt is then fed into the LLM to generate a more accurate and contextualized answer…


Properties of RAG:

Citations - RAG provides reference sources that the user can look up.

Relevancy- the RAG data source can be: live social media feeds, news sites, or other frequently updated info sources.

Privacy - Because RAG only uses the data sources that you select, it ensures that sensitive information within the org is not accessed and used to generate responses.

Efficiency - since the initial retrieval phase narrows down the context and reduces the amount of data that needs to be searched, RAG models are more efficient.


Use Cases:

Company Knowledge Base - The most common use case here is for a company to use a knowledge base in order to generate answers for customer service. This is an area for a lot of growth: as company’s leverage AI with customer service, they can use RAG to give agents very specific context.

RAG-Powered Textbooks - Another cool use case is studying with this. I would love a textbook that has an LLM with RAG connected to it and allows me to ask it questions. Of course, there are risks here, and I would have to go back and look through the text to ensure 100% accuracy of any answers generated… but many RAG’s today provide the source (or page number) that it retrieved its answers from.


Bio

Mattias is a systems thinker who drives innovation and builds character. He is also well-versed in Cybersecurity and can help you protect your organization's AI models. Contact him today to chat!


Sources:

https://www.markovml.com/blog/retrieval-augmented-generation

https://www.ibm.com/docs/en/watsonx/saas?topic=solutions-retrieval-augmented-generation


Eduardo A. Franceschi

Aerospace Engineering Student at University of Illinois at Urbana-Champaign

9 个月

Informative article!

要查看或添加评论,请登录

Mattias Acosta的更多文章

  • Learn How AI Agents Use Knowledge to Make Smarter, Faster Decisions (Without Jargon)

    Learn How AI Agents Use Knowledge to Make Smarter, Faster Decisions (Without Jargon)

    Knowledgeable people run the world. But what if AI had the capacity to hold and utilize knowledge? Spoiler: It already…

  • Medicine in the Age of AI

    Medicine in the Age of AI

    Medicine is evolving. Every day, we understand more about our bodies at the cellular, physiological, and psychological…

  • ML vs Deep Learning vs Generative AI

    ML vs Deep Learning vs Generative AI

    Machine Learning, Deep Learning, and Generative AI are accelerating business. It is important to understand them so…

    15 条评论
  • Learn how to create a strategy and organize your life

    Learn how to create a strategy and organize your life

    Cal Newport is a successful distributed computing theoretician and productivity writer. Recently, I have been reading…

  • How to Secure your Data

    How to Secure your Data

    A world that leverages data to do just about everything needs infrastructure. The most competitive companies of our…

    8 条评论
  • An Adventure to the Tallest Waterfall on the Planet

    An Adventure to the Tallest Waterfall on the Planet

    This is not one of my traditional posts. This is an adventure.

    1 条评论
  • Get an Automated Sensitive Data Inventory for your Cloud

    Get an Automated Sensitive Data Inventory for your Cloud

    As you and your development team continue building out your cloud infrastructure to manage your company’s compute, you…

    1 条评论
  • What is an LLM?

    What is an LLM?

    LLM’s learn patterns from giant datasets of media and produce multi-modal content. An LLM is a kind of neural network…

    1 条评论
  • Why is Reading Valuable?

    Why is Reading Valuable?

    Reading gives us the perspective of others, bringing us thoughts, experiences, and knowledge. It is a medium through…

    1 条评论
  • What is a computer network?

    What is a computer network?

    Without data pathways connecting networks of computers, computing would be a very solitary enterprise. A network is a…

社区洞察

其他会员也浏览了