登录查看更多内容

Advanced RAG: Exploring Query Rewriting

Kulbir Minhas

My next adventure - Entrepreneur and Skilled AI/ML professional with 20+ years of experience in design, development, and project management.

发布日期: 2024年3月13日

Sharing a article summary, I have read about RAG. Wonderful article by Florian. Some of these principles seem very natural and are part of normal conversation. Perhaps expanding those into various methodologies for LLMs.

Direct Link (Members Only) - https://medium.com/@florian_algo/advanced-rag-06-exploring-query-rewriting-23997297f2d1

Advanced RAG: Exploring Query Rewriting

This document focuses on query rewriting techniques within Retrieval-Augmented Generation (RAG) systems. RAG aims to improve the accuracy of responses generated by large language models (LLMs) by incorporating retrieval of relevant documents before generation. However, a key challenge is ensuring the semantics of the user's query align with the retrieved documents. Query rewriting techniques address this challenge by reformulating the original query.

Here's a breakdown of the key areas and findings explored in the document:

1. HyDE (Hypothetical Document Embeddings)

Aims to bridge the semantic gap between query and documents by generating hypothetical documents that represent the user's intent.
The LLM is used to create these hypothetical documents, which are then embedded alongside the original query for retrieval.
While HyDE can improve retrieval quality, it may not always generate accurate hypotheticals if the LLM lacks knowledge on the topic.

2. Rewrite-Retrieve-Read

This approach emphasizes rewriting the original query before retrieval.
The LLM is used to reformulate the query to better suit information retrieval.
This method is effective in improving the retrieved documents' relevance to the user's intent.

领英推荐

Multilingual RAG, Algorithmic Thinking, Outlier…

Towards Data Science 9 个月前

Retrieval-Augmented Generation (RAG) Application using…

Perfectz Digital 6 个月前

Natural Language Query Generation for Faster…

Tiny Fish 3 个月前

3. STEP-BACK PROMPTING

This technique aims to improve the LLM's reasoning abilities for complex queries.
It involves reformulating the query into a more abstract or broader question that is easier for the LLM to answer.
The LLM then uses this broader understanding to answer the original, more specific query.
This method is helpful for queries with intricate details that might hinder the LLM's reasoning process.

4. Query2Doc

This approach focuses on query expansion using the LLM to generate pseudo-documents relevant to the query.
These pseudo-documents are then combined with the original query to create a new, more informative query for retrieval.
While not currently implemented in Langchain or LlamaIndex, it offers an alternative to HyDE's method.

5. ITER-RETGEN

This method iteratively refines retrieval and generation within a Retrieve-Read-Retrieve-Read loop.
It leverages the generated content from previous iterations to guide retrieval in subsequent iterations, aiming to improve the relevance of retrieved documents and the quality of the final response.

Overall, the document explores various query rewriting techniques that enhance the effectiveness of RAG systems. By reformulating user queries to better align with the document space, these techniques can significantly improve the accuracy and relevance of the information retrieved and ultimately the quality of the responses generated by LLMs.

要查看或添加评论，请登录

Kulbir Minhas的更多文章

Beyond the Grave: Reshaping Grief and Memory

2024年6月27日

Beyond the Grave: Reshaping Grief and Memory

Few words evoke as much sentiment as life, death, and immortality. I recently wrote an article about knowledge…

1 条评论
Stepping Through the Portal: How XR Worlds Unleash Real-World Innovation

2024年6月27日

Stepping Through the Portal: How XR Worlds Unleash Real-World Innovation

Ever felt stifled by the limitations of traditional design and prototyping? Imagine a world where you can break free…

1 条评论
Digital Alchemists: Forging the Unseen with Synthetic Creation

2024年6月24日

Digital Alchemists: Forging the Unseen with Synthetic Creation

Synthetic Data: Unveiling the Unseen and Shaping the Future The potential of synthetic data is vast, offering a glimpse…
Model Merging - Evolution of AI

2024年3月22日

Model Merging - Evolution of AI

A wonderful article in hugging face for wider team to see and review -…
Merged for Life

2024年3月21日

Merged for Life

The paper explores decision-making processes within a married couple as an analogy to understand the complexities of…
Thoughfulness

2024年3月21日

Thoughfulness

A World of Thoughtful Code: Programming Through Ideas Envision a future where the act of programming is liberated from…
RAG - Semantic Chunking

2024年3月15日

RAG - Semantic Chunking

Another wonderful writeup by Florian (https://medium.com/@florian_algo) For teams trying to implement RAGs and want to…

See all articles

Advanced RAG: Exploring Query Rewriting

Kulbir Minhas

My next adventure - Entrepreneur and Skilled AI/ML professional with 20+ years of experience in design, development, and project management.

领英推荐

Kulbir Minhas的更多文章

社区洞察

其他会员也浏览了

Latest Advancements in RAG Every Developer Should Know!

Revolutionizing Semantic Search with RAG and Knowledge Graphs

A Quick Langchain Guide: Custom Data and External APIs

Should Data Professionals Care About LLMs?

?? Agents for Time Series Analysis

?? The Downsides of Structured Outputs

Unveiling the Power of LangChain: Retrievers, Parsers, and Chains in Action

?? Improving RAG with Self-Feedback

Exploring Data Retrieval Methods in Vector Databases

Expanded Explanation: Semantic Search and Knowledge Graphs with Large Language Models (LLMs)

领英推荐

Kulbir Minhas的更多文章

Beyond the Grave: Reshaping Grief and Memory

Stepping Through the Portal: How XR Worlds Unleash Real-World Innovation

Digital Alchemists: Forging the Unseen with Synthetic Creation

Model Merging - Evolution of AI

Merged for Life

Thoughfulness

RAG - Semantic Chunking

社区洞察

其他会员也浏览了

Latest Advancements in RAG Every Developer Should Know!

Revolutionizing Semantic Search with RAG and Knowledge Graphs

A Quick Langchain Guide: Custom Data and External APIs

Should Data Professionals Care About LLMs?

?? Agents for Time Series Analysis

?? The Downsides of Structured Outputs

Unveiling the Power of LangChain: Retrievers, Parsers, and Chains in Action

?? Improving RAG with Self-Feedback

Exploring Data Retrieval Methods in Vector Databases

Expanded Explanation: Semantic Search and Knowledge Graphs with Large Language Models (LLMs)