登录查看更多内容

How We Increased Search Accuracy for RAG based GPT from 65% to 90%

Touchapon Kraisingkorn

Co-founder CTO & Head of AI Labs @ Amity

发布日期: 2023年11月17日

Retrieval Augmented Generation (RAG) represents a groundbreaking approach in information retrieval, where the accuracy of search results directly influences the quality of generated answers. In essence, RAG combines traditional search mechanisms with Large Language Model's ability to understand and generate answers. Search accuracy becomes particularly significant when considering that the answers generated by RAG are only as accurate as the documents it retrieves.

In this article we will explore how we improve search arruracy for RAG application from 65% using basic text search to over 90%.

The Initial Framework: Setting the Stage for Advanced Search

Our initial setup for testing and improving accuracy involved several key components:

Azure Cognitive Search: A sophisticated search service provided by Microsoft Azure. It's designed to offer scalable and reliable search capabilities, crucial for handling large volumes of data.
Document and Question Database: We indexed 121 document chunks and linked 186 questions to specific documents containing the answers.
Success Measurement: The system's effectiveness was gauged by its ability to retrieve relevant documents within a 1000-token context window.

This foundational structure was essential for our subsequent enhancements.

?? Basic Text Search: The Starting Point

The initial approach was basic text search, a straightforward method:

How Basic Text Search Operates

Keyword Matching: Searches for exact match keywords within documents.
Limitations: Tends to miss out on the context and deeper semantic meanings.

Initial Results

Baseline Accuracy: This method achieved a starting accuracy of 65.41%.

Though basic text search was a good starting point, it was clear that more sophisticated methods were needed.

?? Implementing Search Term Expansion: Enhancing Queries

To improve upon basic text search, we introduced a Search Term Expansion with the following approach:

Questions are processed through a GPT-3.5-powered Search Term Expansion step, deriving additional relevant search terms and keywords.
GPT-3.5 analyzes the query to generate contextually enriched search terms, which are then used in Azure Cognitive Search.

Accuracy Uplift: This approach raised our search accuracy to 70.81%.

Integrating Search Term Expansion was a key move in bridging the gap between simple queries and the complex content within our documents.

领英推荐

Building and Optimizing a Retrieval-Augmented…

Sanjay Kumar MBA,MS,PhD 1 周前

How Perplexity AI Revolutionizes Online Search: A Deep…

Claudio Gionti 4 个月前

Foundation Models & Vector Databases in AWS…

Amit Singh 1 年前

?? Semantic Reranking: The Leap to Contextual Understanding

While text search is great for finding an initial set of documents, it often lacks a contextual understanding of the questions. As a result, the relevancy score - typically ranked using BM25 or RRF methods for text-based searches - of the resulting documents is often not accurate. To solve this, we have enabled the Semantic Ranking feature in Cognitive Search, which uses natural language understanding to analyze the initial set of documents returned from the search and then re-ranks them based on its own natural language understanding capability.

Impact on Accuracy

Increased Precision: This method boosted our search accuracy to 82.70%. The ability of semantic search to interpret the nuances of queries was a major factor in improving search accuracy!

?? Final Refinement: Incorporating Sample Questions

The last refinement involved adding sample questions to documents:

Enhancing Document Relevance

Creating Targeted Queries: Questions generated through human input and the Search Term Expansion step were added to documents.
Semantic Ranking Adjustment: We reconfigured semantic search to give more weight to these new questions.

Accuracy Improvement

Notable Accuracy Increase: This strategy further improved our accuracy to 90.27%. Incorporating sample questions ensured that our search system was accurate and highly relevant.

?? What didn't make the cut

In addition to the features that enhanced our search accuracy, we evaluated other functionalities but ultimately chose not to implement them. This includes

Hybrid Search, which combined vector, text, and semantic approaches. Although this model boosted accuracy by a modest 0.5%, it came with considerable resource and latency drawbacks.
Search Term Expansion using GPT-4, which offered an accuracy improvement to 93%. However, its higher latency and costs led us to continue with GPT-3.5.

Final Thoughts: Striking the Right Balance

Our efforts to boost RAG's search result accuracy from 65% to 90% were marked by innovation and learning.

Through exploring various methods and understanding their trade-offs, we achieved a balance that significantly enhances the precision and efficiency of information retrieval. This journey highlights a significant step forward in search technology and its application in the business world.

David Zhang

CEO @ Aomni | Learning by shipping

1 年

Love the eval driven approach, this is good work!

1 次回应

Stephen Chan, M.S.I.D

1 年

Nice ??

1 次回应

查看更多评论

要查看或添加评论，请登录

Touchapon Kraisingkorn的更多文章

Key Metrics when Implementing RAG-based Generative-AI Chatbots

2023年11月29日

Key Metrics when Implementing RAG-based Generative-AI Chatbots

Understanding RAG-based Chatbots Retrieval-augmented generation (RAG) is a framework in artificial intelligence that…

1 条评论
?? How to Handle Hallucinations in LLM Deployment

2023年11月21日

?? How to Handle Hallucinations in LLM Deployment

Generative AI, particularly Large Language Models (LLMs) like GPT, revolutionizes interaction by crafting responses…

4 条评论
OpenAI's Assistant API: A Simplified Guide for Business Executives

2023年11月12日

OpenAI's Assistant API: A Simplified Guide for Business Executives

The Assistant API from OpenAI marks a significant advancement in the realm of artificial intelligence, offering an…
The Three Pillars of Creating Large Language Model (LLM) Bots for Business ??

2023年11月11日

The Three Pillars of Creating Large Language Model (LLM) Bots for Business ??

In the rapidly evolving landscape of artificial intelligence, Large Language Models (LLMs) have emerged as…
How to Build & Deploy Phone Voting System in 2 hours on IBM Bluemix

2015年1月4日

How to Build & Deploy Phone Voting System in 2 hours on IBM Bluemix

Recently the local IBM Thailand team has arranged a festive singing contest where they would arrange 3 rounds of…

5 条评论

See all articles

How We Increased Search Accuracy for RAG based GPT from 65% to 90%

Touchapon Kraisingkorn

Co-founder CTO & Head of AI Labs @ Amity

The Initial Framework: Setting the Stage for Advanced Search

?? Basic Text Search: The Starting Point

How Basic Text Search Operates

Initial Results

?? Implementing Search Term Expansion: Enhancing Queries

领英推荐

?? Semantic Reranking: The Leap to Contextual Understanding

Impact on Accuracy

?? Final Refinement: Incorporating Sample Questions

Enhancing Document Relevance

Accuracy Improvement

?? What didn't make the cut

Final Thoughts: Striking the Right Balance

Touchapon Kraisingkorn的更多文章

社区洞察

其他会员也浏览了

Sparse Embedding vs Dense Embedding

Semantic chunking, Vectorization and role of Graph Databases

Understanding Search GPT: How It Differs from Google Search

Knowledge Graph Semantic Enhancement of Input Data for Improving AI

Using GPT-3 and Google Cloud Vision to Quantify the Visual Density of Languages

RAG (Retrieval-Augmented Generation):Technical explanation of each word

?? Why the GPT Store isn’t very good yet

Search GPT vs. Google Search: Which Tool Fits Your Needs?

ML Papers Digest - Inference Scaling for Long-Context Retrieval Augmented Generation

Azure AI Search

The Initial Framework: Setting the Stage for Advanced Search

?? Basic Text Search: The Starting Point

How Basic Text Search Operates

Initial Results

?? Implementing Search Term Expansion: Enhancing Queries

领英推荐

?? Semantic Reranking: The Leap to Contextual Understanding

Impact on Accuracy

?? Final Refinement: Incorporating Sample Questions

Enhancing Document Relevance

Accuracy Improvement

?? What didn't make the cut

Final Thoughts: Striking the Right Balance

Touchapon Kraisingkorn的更多文章

Key Metrics when Implementing RAG-based Generative-AI Chatbots

?? How to Handle Hallucinations in LLM Deployment

OpenAI's Assistant API: A Simplified Guide for Business Executives

The Three Pillars of Creating Large Language Model (LLM) Bots for Business ??

How to Build & Deploy Phone Voting System in 2 hours on IBM Bluemix

社区洞察

其他会员也浏览了

Sparse Embedding vs Dense Embedding

Semantic chunking, Vectorization and role of Graph Databases

Understanding Search GPT: How It Differs from Google Search

Knowledge Graph Semantic Enhancement of Input Data for Improving AI

Using GPT-3 and Google Cloud Vision to Quantify the Visual Density of Languages

RAG (Retrieval-Augmented Generation):Technical explanation of each word

?? Why the GPT Store isn’t very good yet

Search GPT vs. Google Search: Which Tool Fits Your Needs?

ML Papers Digest - Inference Scaling for Long-Context Retrieval Augmented Generation

Azure AI Search