登录查看更多内容

Vector search, RAG, and large language models

Clara Shih

CEO of Salesforce AI | Founder & Board Chair of Hearsay Systems | TIME 100 AI | WEF YGL

发布日期: 2023年12月15日

Large language models, or LLMs, can't be relied on to recall specific data they were trained on, so the way to make them work in the enterprise, where accuracy is paramount, is to feed them with the right data. This is called grounding prompts using retrieval augmented generation, or RAG for short.

RAG relies on both keyword search (for structured data) and vector search (for unstructured data such as documents, call transcripts, videos, spreadsheets etc).
Unstructured data is also sometimes referred to as blobs, or Binary Large Objects (data in binary form that may or may not conform to a specific file format). 80% of enterprise data is unstructured.
Keyword search + vector search together is referred to as hybrid search. Hybrid search makes AI systems like Einstein Copilot very powerful in their ability to understand, generate outputs, and automate actions across a wide variety of use cases, contexts, and data/content types.

Why vectors? Unstructured data can't be stored in rows and columns in a relational database. It requires a different approach than SQL (or the Salesforce equivalents, SOQL and SOSL).

Data Science Dojo 1 年前

Data Analytics in the Age of AI, When to Use RAG…

Open Data Science Conference (ODSC) 6 个月前

RAG Unlocks Your Enterprise Data

VAST Data 4 周前

Vectors are an efficient way of representing unstructured data. This matters both for quickly indexing/ performing similarity search (also known as semantic search) against prompts and also to efficiently pass large amounts of data in to LLMs given their limited context windows.
Unstructured data requires much more storage and traditionally was difficult and slow to analyze or search. Enter LLMs - which are very good at understanding the most important, defining attributes of data blobs to pay attention to - these become the vector dimensions. All other dimensions are collapsed/ignored.
A smaller LLM dedicated to vectorizing unstructured data called an embeddings model is used to create the vectors. The embeddings model is different from the LLM that's used to generate outputs (into which the vectors are passed).
Vector embeddings aren't new. Google search has used embeddings for years. But LLMs make vector embeddings both possible and mission-critical for AI applications.

In the coming months and years, every organization and even individuals will need vector databases in order to overcome the limitations of LLMs -- including limited context windows, knowledge cutoff dates, and hallucinations -- and effectively utilize generative AI.

Mudit Agarwal

Head of IT ? Seasoned VP of Enterprise Business Technology ? Outcome Based Large Scale Business Transformation (CRM, ERP, Data, Security) ? KPI Driven Technology Roadmap

5 个月

Clara, Nice! Thanks for sharing!

Steve Hovland

AI for All

9 个月

Deals with a real problem.

Kalpesh Sharma

9 个月

???????????? ?????????? BELOW LINKEDIN POST LINK ???? ???????????????? ????????????????: ???????????????? ???? ?????? ?????????????? ???????????????????? ?????????????????? ???????????????????????? ?????? ?????????? ????????????????, ?????? ???????? ????????????????, ?????? ?????????? ?????? ?????? ???????????????? ???? ??????????: https://www.dhirubhai.net/posts/sharmakalpesh_todays-content-title-%3F%3F%3F%3F%3F%3F%3F-%3F%3F%3F%3F%3F%3F%3F%3F%3F%3F-activity-7150763464024539137-iPPK

1 次回应

Vernon Keenan

Transforming Business with Ethical AI ?? WorkDifferentWithAI.com/sign-up ?? Sr. Industry Analyst at SalesforceDevops.net

10 个月

Once again. Salesforce leads the enterprise AI race by integrating RAG into their prompt architecture. RAG is all the rage. Using RAG lets you do the thing many people are demanding, which is “how do I use ChatGPT with documents from my company?“ Both Microsoft and Amazon spent December explaining how they are integrating RAG into their enterprise cloud architectures. And the new GPT, available from open AI, also accomplish roughly the same thing. That’s why RAG appears to be the number one orchestration pattern in Enterprise AI today.

2 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

Vector search, RAG, and large language models

Clara Shih

CEO of Salesforce AI | Founder & Board Chair of Hearsay Systems | TIME 100 AI | WEF YGL

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Databricks AI/BI Series: A Technical Overview of AI/BI Genie

Synerise open-sourcing Cleora AI framework for ultra-fast embeddings in large graphs

Power of Vector Databases and its Evolution with AI & ML

Is Your Data Strategy Ready for Generative AI?

How Enterprise Data Observability will make the most of your Shiny New Vector Databases

Generative AI might revolutionize Data Science!

Vector Databases vs. Knowledge Graphs: Choosing the Right Foundation for Retrieval-Augmented Generation

10 (free) AI tools for data science

Overcoming Data Scarcity: Doing More With Less Using Data Centric AI

领英推荐

What No One Tells You About Being an Entrepreneur

2024年4月9日

The Battle To Win Employees And Conquer Customer Loyalty

2021年7月13日

What Being a Startup CEO Taught Me About the Strategic Importance of Customer Service

2021年6月2日

Thoughts from Day 1 as CEO of Service Cloud

2021年1月25日

Sabbatical Reflections: Our Role in Making a Better Society

2021年1月25日

Your Leadership Power #20WIP

2020年10月22日

The Last Mile of Insurance

2020年10月21日

The Power & Peril of Social Media

2020年10月14日

Sequoia Seven Questions: Hard-won advice for founders

2020年10月9日

Passing the Torch to Hearsay's New CEO

2020年9月2日