登录查看更多内容

RAG Document Search Done Right

Julian Seidenberg (PhD)

Head of Artificial Intelligence at Datch Ltd and Narrative Ltd

发布日期: 2024年7月31日

Sarah is a senior enterprise applications manager at a large multinational corporation, tasked with leveraging the latest technologies on the market to improve access to information for business users. Company knowledge is buried deep in many thousands of documents in the company intranet. Finding anything is extremely difficult unless you know the correct magic phrase to enter into the traditional search engine. Sarah hears about Retrieval-Augmented Generation (RAG) and Large Language Models (LLM) as advanced new technologies that might help solve the problem. She starts a project to build a prototype.

After months of work, the prototype is ready. It’s initially impressive, but very quickly workers start complaining. The tool is: too slow, inconsistent, buggy, hard to update, misses key documents, and no one trusts anything it says. Disillusioned, Sarah cancels the project. Is Generative AI overhyped?

Understanding RAG

When a user asked a question, RAG search uses semantic search to find relevant information from a large corpus of unstructured data. It then augments a Large Language Model prompt with that information to provide a natural language answer. It differs from traditional keyword search because it can find information related to a user’s question, even if none of the words in the question appear in the indexed documents.

Why is it not straightforward?

Datch offers a mature RAG-powered search, enhanced by many improvement iterations. Sarah’s prototype was missing many key refinements:

Understanding Images and Parsing Tables: Beyond text, many documents contain valuable information in images and tables. A robust RAG system must be capable of interpreting and extracting data from these non-text elements to provide comprehensive search results.

Expanding User Queries: Users are used to keyword search. Semantic search requires users to learn how to write their queries in a way that semantic search can find the correct information. A good RAG system should be able to expand and enrich a user’s keyword query, understanding the intent behind the keywords and rewriting the query to provide the most relevant results.

Switching Between Full-Text and Semantic Search: Depending on the query, either traditional full-text search or semantic search may be more appropriate. An intelligent RAG system can dynamically switch between these methods based on what will return the best results.

Drag and Drop Document Upload: Ease of use is critical. A user-friendly interface with drag-and-drop document upload capabilities ensures that users can easily add new documents to the system without technical barriers.

领英推荐

The past, present, and future of semantic search

Algolia 1 年前

Build RAG applications using only APIs with Postman! ??

Clarifai 9 个月前

First Major AI Law Approved: Industry News, Guides, &…

Oxylabs.cn 8 个月前

Consistent Search Results: Consistency is key to user trust. A well-designed RAG system should deliver consistent answers when the user asks the same question.

Removing Hallucinations: LLMs tend to make up convincing incorrect information if they don’t know the answer to a question. Users naturally lose trust in such systems if they cannot trust the answers they provide. A well-designed RAG system mitigates the hallucination problem by ensuring answers are always based on the information in the source documents.

Handling Multiple Versions of Documents: A company often has multiple versions of documents. The system must be able to recognize and handle different versions, ensuring users always access the most relevant and up-to-date information.

Handling Contradictory Information: Inconsistent or contradictory information across documents is a common challenge. A sophisticated RAG system can identify and inform the user about contradictory information, allow them to make informed decisions.

Provenance: To be trusted by users, a RAG search system must provide provenance for every answer it gives. It must reference the documents and pages that its answers are based on and make it easy for the user to verify the answers.

Speed: Semantic search and LLMs can be slow to run. They are new technologies that are significantly more complex than traditional search indexes. A good RAG system is optimized for speed, so users don’t have to wait for the answers to their questions.

Continuously Optimizing Search Results: A RAG system cannot just be deployed and forgotten about. To achieve the best search performance, it is crucial to work with domain experts to fine-tune the search. Every time the search delivers a wrong answer, experts can find the correct answer and update the system to learn from its mistake.

Datch’s RAG Document Search

At Datch, we have a ready-made solution that can quickly deliver value to a business. We have taken all the above nuances into account and built an enterprise-ready solution. Datch will also work with a client to fine-tune the results of the search.

Interested in seeing a demo of how this work for yourself? DM me or let us know in the comments section.

Christian Staton

You've repaired this exact asset failure before, but it's nearly impossible to find the old work order. Don't reinvent the wheel. Datch finds all the relevant history in your unstructured data.

7 个月

Julian Seidenberg (PhD) What is the structural difference between Sarah’s RAG project and Datch’s RAG-powered search? You explained the benefits of a mature RAG system, but what makes that difference??

Shamane Siri

This is the Chinese translation of my profile.

7 个月

Nice one. But I have the following question on the topic : "Reduced hallucinations." How would you achieve that? Do you finetune your own generators?

1 次回应

Elliot Sawyer

Senior Silverstripe Developer at Catalyst IT

7 个月

Very insightful article Julian Seidenberg (PhD). I've been doing some work with Typesense recently and could actually implement something like this given the right corpus of data to search and an appropriate training model. Would love to see a demo if you're offering! https://typesense.org/docs/26.0/api/conversational-search-rag.html

1 次回应

Damon Andrews

Customer Champion, Problem Solver

7 个月

from a product/user point of view an issue using basic RAG available in lots of places now: BASIC RAG - model can't disambiguate two similar concepts - model can't contextualise very well - model struggles to say 'no/ i don't know' when it has no info you can defiantly tell when it's done right with advanced RAG, its like the difference between talking to generalist and talking to an expert!

6 次回应

Emily Schaefer

Revolutionizing Asset Management with Generative AI | Industry 4.0 | Let's innovate together.

7 个月

Datch makes unstructured data just as valuable as structured data! Love this

3 次回应

查看更多评论

要查看或添加评论，请登录

Julian Seidenberg (PhD)的更多文章

Unknown Unknowns: Why AI Needs Knowledge Graphs

2025年2月5日

Unknown Unknowns: Why AI Needs Knowledge Graphs

Scenario Logan is an experienced mechanic. He is in the first few months of a new job as a maintenance worker in a food…

3 条评论
Brilliant Yet Blind: The Missing Wisdom in AI Agents

2024年12月10日

Brilliant Yet Blind: The Missing Wisdom in AI Agents

Sam is a planner in a large electricity distribution company. His company has just deployed an AI Agent with an IQ of…

10 条评论
What is an AI Agent?

2024年10月9日

What is an AI Agent?

Emma, a utility manager at an electric company, is excited about the new AI agent installed to assist with technical…

8 条评论
Knowledge Graph Powered LLM Insights

2024年8月28日

Knowledge Graph Powered LLM Insights

Imani is an Enterprise Data & AI manager at a major manufacturing company. Her company is at the forefront of…

1 条评论
Industrial Asset Insights using AI Knowledge Fusion

2024年6月26日

Industrial Asset Insights using AI Knowledge Fusion

Samuel is tasked with investigating voltage fluctuations at a small hydroelectric dam. Upon arrival he takes a look at…

3 条评论
Enhancing Information Quality in Fault Reporting

2024年5月16日

Enhancing Information Quality in Fault Reporting

Nolan works in an automative factory and is on the cusp of retirement. He is a man of few words but possesses…

9 条评论
Ambient Information Capture in an Industrial Setting

2024年4月9日

Ambient Information Capture in an Industrial Setting

Zoe is a quality inspector at a natural gas power plant. She faces a recurring problem: As she inspects the facility…

7 条评论
Diagnostic Assistance for Frontline Workers

2024年3月6日

Diagnostic Assistance for Frontline Workers

Liam is on his first day on the job as a manager at Northwestern Paper Mill. He faces his first significant challenge.

7 条评论

See all articles

RAG Document Search Done Right

Julian Seidenberg (PhD)

Head of Artificial Intelligence at Datch Ltd and Narrative Ltd

Understanding RAG

Why is it not straightforward?

领英推荐

Datch’s RAG Document Search

Julian Seidenberg (PhD)的更多文章

社区洞察

其他会员也浏览了

Build AI Apps with Ease with the Milvus Notebook Gallery & Advanced Video Search with Twelve Labs and Milvus!

Instinct AI : Intelligent Semantic Searching for Websites

Data-Juicer: A One-Stop Data Processing System for Large Language Models

GraphRAG Update Improves AI Search Results

Understanding Multi-Agent RAG Systems!

Revolutionizing Semantic Search with RAG and Knowledge Graphs

Enterprise Search Market in Focus: Doubling Growth by 2032 – Trends and Forecasts

AI-Powered Search: Embedding-Based Retrieval and Retrieval-Augmented Generation (RAG)

?? Agents for Time Series Analysis

Unveiling the Power of LangChain: Retrievers, Parsers, and Chains in Action

Understanding RAG

Why is it not straightforward?

领英推荐

Datch’s RAG Document Search

Julian Seidenberg (PhD)的更多文章

Unknown Unknowns: Why AI Needs Knowledge Graphs

Brilliant Yet Blind: The Missing Wisdom in AI Agents

What is an AI Agent?

Knowledge Graph Powered LLM Insights

Industrial Asset Insights using AI Knowledge Fusion

Enhancing Information Quality in Fault Reporting

Ambient Information Capture in an Industrial Setting

Diagnostic Assistance for Frontline Workers

社区洞察

其他会员也浏览了

Build AI Apps with Ease with the Milvus Notebook Gallery & Advanced Video Search with Twelve Labs and Milvus!

Instinct AI : Intelligent Semantic Searching for Websites

Data-Juicer: A One-Stop Data Processing System for Large Language Models

GraphRAG Update Improves AI Search Results

Understanding Multi-Agent RAG Systems!

Revolutionizing Semantic Search with RAG and Knowledge Graphs

Enterprise Search Market in Focus: Doubling Growth by 2032 – Trends and Forecasts

AI-Powered Search: Embedding-Based Retrieval and Retrieval-Augmented Generation (RAG)

?? Agents for Time Series Analysis

Unveiling the Power of LangChain: Retrievers, Parsers, and Chains in Action