登录查看更多内容

The Future of Retrieval Systems & LLMs

Sid Probstein

CEO at SWIRL | 10x CTO | AI & Search Pioneer | ex-Attivio

发布日期: 2024年8月7日

Here’s a reality check: nobody with a ton of data, in multiple silos, is copying all of it into some new silo.

So the future of LLM’s is very simple: they require integrated, sophisticated search capabilities to survive... especially in the enterprise.?

Search is the key to Retrieval-Augmented Generation (RAG) - which, despite the many efforts to re-label and re-brand it, remains among the best mechanisms for saving time and money using LLMs.

Putting the “R” in RAG

SWIRL is a metasearch engine designed to RAG across silos, or augment most any RAG with data from one or more silos. There's no need to move all the data into a new vector database – assuming it’s already searchable through some system, vector “aware” or not.

SWIRL adapts the user’s query as necessary – into SQL for example – then sends it out to one or more endpoints - search engines, databases, enterprise applications and information services. Asynchronously, of course.

Then it re-ranks the results using a Reader LLM.??

The Selection Problem?

If you're trying to build a retriever system using something like LangChain, it’s very cool until you get to the part where you have to figure out what data, from the large amount you might have, is relevant to your RAG.

This is the first problem that SWIRL solves.?

SWIRL vectorizes the user’s query, as well as each result from each source, using any configured embeddings model. (SWIRL ships with spaCy’s large english model, by default.)

It then re-ranks using an algorithm that combines Search, NLP and vector techniques – including term frequency, term surprise factor, source rank, recency, proximity, “aboutness”, entity analysis and soft cosine similarity.

Ultimately it seeks results that evidence the user’s query and intent. And it draws a line where the relevancy falls off. (In the galaxy UI, these are shown as star ratings.)?

领英推荐

Meet Sora: The AI Model Blurring the Lines Between…

Data Science Dojo 1 年前

Blueprint for Leveraging Vector Database in Business

Oak Business Consultant 8 个月前

A Gentle Introduction to Vector Search, AI Governance,…

Open Data Science Conference (ODSC) 2 年前

The Xethub study (https://about.xethub.com/blog/you-dont-need-a-vector-database) shows very clearly that re-ranking can out-perform a “full vector” approach. And it definitely costs a fraction of the time, effort and ongoing $.?

Passage Detection and Token Counts?

But that’s the first part of RAG: retrieval. The second part, often overlooked, is the “augment” step.??

In a nutshell, augment means “putting relevant data into the prompt”. Although search result snippets are indicative of the information they link to, to really perform a proper augmentation the full-text of the document needs to be retrieved.??

That’s actually the easy part.?

Much more important than retrieving any single document is analyzing the set of most relevant? documents to remove duplicates, select the most recent, and, perhaps most importantly, pare them down to the relevant portions.??

Do you really want to pay to summarize 7,000 tokens of boilerplate power point when the answer you’re seeking is on slide 17??

That’s the second problem SWIRL solves for you.?

SWIRL’s Reader LLM can de-dupe quite effectively using vectors; it can also crack open 1,500 file formats, find the most relevant portions and chunk it in less than 1 second (per X MB) before sending it out to your choice of GAI for summarization, question answering, comparison and/or translation – among others.??

The Future of RAG is ... Search

Want to supercharge your existing RAG systems while also avoiding the overhead of copying and re-indexing? Put SWIRL into your stack!

Check out the below video showing how quickly you can install and configure the Community Edition of SWIRL!!

Sid Probstein

CEO at SWIRL | 10x CTO | AI & Search Pioneer | ex-Attivio

3 个月

Predictions are hard. Especially about the future. But after a week at KMWorld I am more confident than ever that search is *the* key to making AI work in the enterprise.

Dave Voutila

Building better AI, Ex-[object Object]

6 个月

It's pretty wild how the dream we tried multiple times to build at Attivio, with semantic understanding of an NLP query, is becoming more possible now with LLMs being generally available.

1 次回应

Robert Yelle

Global Client & Partner Enablement Director

6 个月

Stealing the term “aboutness” … good stuff

1 次回应

查看更多评论

要查看或添加评论，请登录

Sid Probstein的更多文章

SWIRL v4.0 Enterprise Edition Released!!

2025年2月20日

SWIRL v4.0 Enterprise Edition Released!!

Team SWIRL is delighted to announce general availability of SWIRL Enterprise 4.0! First: our two products have been…

13 条评论
?? SWIRL v4.0 Community Edition Released!

2025年2月13日

?? SWIRL v4.0 Community Edition Released!

Team SWIRL is delighted to announce general availability of SWIRL AI Connect 4.0, Community Edition! This release…

10 条评论
Spending Hours Hunting for Information? SWIRL Found What I Needed in Minutes

2025年1月21日

Spending Hours Hunting for Information? SWIRL Found What I Needed in Minutes

As we kick off a new year, I found myself staring down a familiar, yet updated challenge: renewing our cyber insurance…
The Power of SWIRL AI Providers: A Deep Dive

2024年12月18日

The Power of SWIRL AI Providers: A Deep Dive

In a previous article, I detailed the humble SearchProvider – a configuration that connects SWIRL to different sources…

1 条评论
Five Use Cases for AI Search & RAG in Enterprise

2024年12月9日

Five Use Cases for AI Search & RAG in Enterprise

AI Search and Retrieval-Augmented Generation (RAG) aren’t just shiny new tools—they’re practical, powerful ways to save…

2 条评论
Why Search is CRITICAL for AI: Retrieval Augmented Generation (RAG) Explained

2024年12月2日

Why Search is CRITICAL for AI: Retrieval Augmented Generation (RAG) Explained

I was recently asked to explain exactly what RAG is, why it matters, and why there is so much heat around Search and…

1 条评论
The Power of SWIRL SearchProviders: A Deeper Dive

2024年11月27日

The Power of SWIRL SearchProviders: A Deeper Dive

This is the first in a series of posts detailing the essential elements of SWIRL! First up, the humble SearchProvider…
Blending Content w/SWIRL & Box!!

2024年11月20日

Blending Content w/SWIRL & Box!!

Box is a powerhouse. It’s secure, widely adopted, and user-friendly.

2 条评论
?? SWIRL Enterprise 3.9 is now available! ??

2024年11月18日

?? SWIRL Enterprise 3.9 is now available! ??

We are delighted to announce the release of SWIRL Enterprise 3.9! This version adds LLM-generated follow-on questions…

2 条评论
?? SWIRL Community 3.9 Released!! ??

2024年11月14日

?? SWIRL Community 3.9 Released!! ??

We are delighted to announce release of SWIRL Community 3.9.

1 条评论

See all articles

The Future of Retrieval Systems & LLMs

Sid Probstein

CEO at SWIRL | 10x CTO | AI & Search Pioneer | ex-Attivio

Putting the “R” in RAG

The Selection Problem?

领英推荐

Passage Detection and Token Counts?

The Future of RAG is ... Search

Sid Probstein的更多文章

社区洞察

其他会员也浏览了

Why Vector Databases Are Really Fast: An In-depth Look at FAISS

Edition 25 - What Retrieval Approaches Actually Work?

Gretel's Tabular LLM, Synthetic Data Accelerator, and much more

GraphRAG Update Improves AI Search Results

Our data science lead, Girija Shingte provides three principles to navigate data challenges.

Understanding Multi-Agent RAG Systems!

Enterprise Search Market in Focus: Doubling Growth by 2032 – Trends and Forecasts

When GraphRAG Goes?Bad: A Study in Why you Cannot Afford to Ignore Entity Resolution

?? Agents for Time Series Analysis

Guidebook to the State-of-the-Art Embeddings and Information Retrieval

Putting the “R” in RAG

The Selection Problem?

领英推荐

Passage Detection and Token Counts?

The Future of RAG is ... Search

Sid Probstein的更多文章

SWIRL v4.0 Enterprise Edition Released!!

?? SWIRL v4.0 Community Edition Released!

Spending Hours Hunting for Information? SWIRL Found What I Needed in Minutes

The Power of SWIRL AI Providers: A Deep Dive

Five Use Cases for AI Search & RAG in Enterprise

Why Search is CRITICAL for AI: Retrieval Augmented Generation (RAG) Explained

The Power of SWIRL SearchProviders: A Deeper Dive

Blending Content w/SWIRL & Box!!

?? SWIRL Enterprise 3.9 is now available! ??

?? SWIRL Community 3.9 Released!! ??

社区洞察

其他会员也浏览了

Why Vector Databases Are Really Fast: An In-depth Look at FAISS

Edition 25 - What Retrieval Approaches Actually Work?

Gretel's Tabular LLM, Synthetic Data Accelerator, and much more

GraphRAG Update Improves AI Search Results

Our data science lead, Girija Shingte provides three principles to navigate data challenges.

Understanding Multi-Agent RAG Systems!

Enterprise Search Market in Focus: Doubling Growth by 2032 – Trends and Forecasts

When GraphRAG Goes?Bad: A Study in Why you Cannot Afford to Ignore Entity Resolution

?? Agents for Time Series Analysis

Guidebook to the State-of-the-Art Embeddings and Information Retrieval