登录查看更多内容

Basic RAG (Retrieval-Augmented Generation) Model

Sanjay Kumar MBA,MS,PhD

发布日期: 2024年6月30日

+ 关注

Problem with Pre-trained LLMs:

Hallucinations: LLMs can generate plausible-sounding but incorrect or nonsensical answers. This is often due to the fact that these models generate text based on patterns learned during training rather than verifying facts.
Limited Scope of Training Corpus: LLMs might not have encountered certain information during training, especially if the information is domain-specific or has been updated after the model's training period.
Lack of Access to Latest Information: Since LLMs are static once trained, they do not have real-time access to new information or updates. This can lead to answers that are outdated or irrelevant in the current context.

Basic RAG Structure:

Components:

Question: The input question posed by the user.
Retriever: A component that searches for and retrieves the most relevant external documents that may contain information necessary to answer the user's question.
External Knowledge: The set of documents retrieved by the retriever, which are considered relevant to the question.
Generator (LLM): The language model that uses the information from the retrieved documents to generate an answer.

Process:

User's Question: The process begins with a question from the user.
Retrieving Relevant Documents: The retriever component analyzes the user's question and searches a database or corpus for documents that are likely to contain relevant information. This is typically done using techniques like: Vector Search: Representing documents and queries as vectors and finding the closest matches. Keyword Matching: Using keywords from the question to find matching documents.

External Knowledge: The retriever compiles a set of top-k documents that are deemed most relevant. These documents form the external knowledge base.

Generating the Answer: The generator (LLM) takes these retrieved documents as context and generates an answer. This allows the LLM to produce answers that are grounded in up-to-date and specific information from the external documents.

领英推荐

NEW this week from Learning Data!

Maven Analytics 1 年前

Model Training - K Fold Cross Validation

Mage 3 年前

9 Unbelievable Machine Learning Hacks That Work:…

Garranto Academy Malaysia 4 个月前

Key Abilities:

Noise Robustness: The model's ability to handle and filter out irrelevant or noisy information within the retrieved documents. It ensures that the generator uses only the most pertinent information.
Negative Rejection: The ability of the model to recognize when it does not have sufficient information to answer a question accurately and therefore refrain from providing a misleading or incorrect answer.
Information Integration: The capacity to synthesize information from multiple sources and create a coherent and comprehensive answer, particularly useful for complex questions that require diverse pieces of information.
Counterfactual Robustness: The ability to detect and handle known errors or contradictions within the retrieved documents, ensuring that such misinformation does not influence the generated answer.

Quality Scores:

Context Relevance: The retrieved context must be directly relevant to the user's question. This ensures that the documents used to generate the answer are applicable to the query.
Answer Relevance: The generated answer must address the user's question directly and appropriately. It should not deviate from the topic or provide extraneous information.
Faithfulness: The generated answer must remain faithful to the information contained in the retrieved documents. It should accurately reflect the content without introducing distortions or inaccuracies.

High-Level Requirements for Success:

Effective Retrieval: The retriever must be proficient at finding the most relevant documents that contain the necessary information to answer the question.
Generation Utilization: The generator must be capable of effectively using the retrieved documents to produce a coherent, accurate, and relevant answer.

Conclusion:

The Basic RAG model enhances the capabilities of LLMs by addressing their limitations with hallucinations, outdated information, and lack of real-time data access. By integrating a retrieval step, the model ensures that answers are grounded in the most relevant and current information available. This approach significantly improves the accuracy, relevance, and faithfulness of the generated responses, making it a powerful tool for various question-answering tasks.

要查看或添加评论，请登录

Sanjay Kumar MBA,MS,PhD的更多文章

Azure AI Agents vs. AWS AI Agents vs. Google Vertex AI Agent Builder

2025年2月27日

Azure AI Agents vs. AWS AI Agents vs. Google Vertex AI Agent Builder

AI agents are rapidly transforming how businesses automate workflows, enhance customer experiences, and optimize…
Securing Agentic AI: Identifying Threats, Mitigation Strategies, and Future Challenges

2025年2月26日

Securing Agentic AI: Identifying Threats, Mitigation Strategies, and Future Challenges

Introduction: The Rise of Agentic AI and Its Security Risks As AI systems evolve, Agentic AI is emerging as a…
Securing AI Systems in a Rapidly Evolving Landscape

2025年1月5日

Securing AI Systems in a Rapidly Evolving Landscape

Introduction Artificial Intelligence (AI) has transformed industries, driving innovation and decision-making at…
A Comparison of Vector RAG and Graph RAG

2024年12月30日

A Comparison of Vector RAG and Graph RAG

As language models grow more powerful, the challenge of retrieving relevant and accurate external information to…
Understanding Hallucinations in LLMs

2024年12月27日

Understanding Hallucinations in LLMs

Introduction Large Language Models (LLMs) have revolutionized AI with their capacity for generating human-like text…
Retrieval-Augmented Generation (RAG) and Agentic RAG

2024年12月23日

Retrieval-Augmented Generation (RAG) and Agentic RAG

In the rapidly evolving world of AI, large language models (LLMs) have shown remarkable capabilities. However, they are…
Snowflake vs. Databricks: A Comprehensive Comparison

2024年12月20日

Snowflake vs. Databricks: A Comprehensive Comparison

In today’s data-driven world, businesses rely on powerful platforms to manage, process, and analyze data efficiently…
Parameter-Efficient Fine-Tuning (PEFT): Fine-Tuning of LLM

2024年12月17日

Parameter-Efficient Fine-Tuning (PEFT): Fine-Tuning of LLM

The rise of Large Language Models (LLMs) such as GPT-3, BERT, and LLaMA has transformed the landscape of Natural…
Understanding Difference between Generative AI and Predictive AI

2024年12月15日

Understanding Difference between Generative AI and Predictive AI

As artificial intelligence evolves, two distinct approaches—Generative AI and Predictive AI—are shaping the future of…
Methods to Test ML Models in Production

2024年12月9日

Methods to Test ML Models in Production

Machine Learning (ML) models are essential for modern applications, but deploying them in production requires careful…

See all articles

Basic RAG (Retrieval-Augmented Generation) Model

Sanjay Kumar MBA,MS,PhD

Basic RAG Structure:

领英推荐

Key Abilities:

Quality Scores:

High-Level Requirements for Success:

Conclusion:

Sanjay Kumar MBA,MS,PhD的更多文章

社区洞察

其他会员也浏览了

Essential Dos and Don'ts of Machine Learning

The data flood we have been waiting for

T-Shaped Skills: Building muscle aligned to Future FIT (IT) Skills

8 Steps to Building a Machine Learning Model for Classification

When AI Becomes Table Stakes

ML Day 11: Fun Quiz: ML Terms and Concepts

From 'I Do' to Training Data Power.

The 10 Commandments of Self-Taught Machine Learning Engineers

7 Regression Techniques you should know!

Overview of the Steps in a Machine Learning Pipeline

Basic RAG Structure:

领英推荐

Key Abilities:

Quality Scores:

High-Level Requirements for Success:

Conclusion:

Sanjay Kumar MBA,MS,PhD的更多文章

Azure AI Agents vs. AWS AI Agents vs. Google Vertex AI Agent Builder

Securing Agentic AI: Identifying Threats, Mitigation Strategies, and Future Challenges

Securing AI Systems in a Rapidly Evolving Landscape

A Comparison of Vector RAG and Graph RAG

Understanding Hallucinations in LLMs

Retrieval-Augmented Generation (RAG) and Agentic RAG

Snowflake vs. Databricks: A Comprehensive Comparison

Parameter-Efficient Fine-Tuning (PEFT): Fine-Tuning of LLM

Understanding Difference between Generative AI and Predictive AI

Methods to Test ML Models in Production

社区洞察

其他会员也浏览了

Essential Dos and Don'ts of Machine Learning

The data flood we have been waiting for

T-Shaped Skills: Building muscle aligned to Future FIT (IT) Skills

8 Steps to Building a Machine Learning Model for Classification

When AI Becomes Table Stakes

ML Day 11: Fun Quiz: ML Terms and Concepts

From 'I Do' to Training Data Power.

The 10 Commandments of Self-Taught Machine Learning Engineers

7 Regression Techniques you should know!

Overview of the Steps in a Machine Learning Pipeline