登录查看更多内容

Chat with your private data using RAG & LLMs

Nirmal Juluru

Generative AI @ NVIDIA | Carnegie Mellon

发布日期: 2023年9月18日

Large Language Models(LLMs) like ChatGPT have been making a significant impact across various industries with their capabilities such as summarization, question answering, and text generation. While LLMs excel at general knowledge, they stumble when it comes to personalized information since they have access only to the data they are trained on.?

In a way, their ‘Knowledge base’ is stuck in the past. They lack the ability to access your domain-specific data, whether it's customer records, proprietary documents, product specifications, or specialized databases.

So, when you ask questions like:?

How many customer accounts do we have??

ChatGPT can't answer questions about private data

ChatGPT can’t provide an accurate answer since it doesn't have access to that data point. In certain instances, LLMs like ChatGPT may even generate fabricated information, a phenomenon often referred to as "hallucination," leading to incorrect or inaccurate responses.

By connecting to private data, we can obtain accurate answers and derive numerous insights. One approach to achieving this is by regularly retraining an LLM with domain-specific data, but this process can be resource-intensive.

Fortunately, we have a simple solution to this challenge.

It’s called Retrieval Augmented Generation( RAG).

RAG is a framework for improving the accuracy of responses of LLMs by connecting them to external data sources. Using RAG, we can access relevant information from our private data and enhance LLMs with it, enabling them to deliver concise and factually accurate responses to our queries.

RAG even allows linking to the original data sources so that LLMs can provide evidence and citations to the end users.

On a high level, the architecture looks like this:?

Step 1: Create embeddings from the private data and store them in a vector database:

We split the private data into multiple chunks and use an embedding model to crate embeddings and store them in a vector database.?Embedding is a process in which we transform chunks of data into math vectors. These vectors are stored in a vector database.

Paresh Patil 1 年前

ChatGPT for data analysis

Vikram Ekambaram 4 个月前

Integrating ChatGPT into Power BI

Jahnavi Thekkada 1 年前

There are multiple options for embedding models and vector databases, please use these two links for comparison.?

Step 2: Create embeddings from the user query and find similar vectors

For the user's query, we find relevant docs from the vector store

We use the same embedding model to create embeddings and retrieve the top n vectors which are similar to user query. We can use different algorithms like cosine similarity, and maximal marginal relevance(MMR) to find similar vectors.

These vectors represent the chunks of data that are relevant to the user query.

Step 3: Pass the relevant documents along with the query to the LLM:?

We then pass the relevant pieces of information, along with the query, to the LLM, which acts as a reasoning agent and delivers a well-articulated response to the user.

As for LLMs, we can utilize proprietary models such as OpenAI's ChatGPT with API, or alternatively, we can run open-source models like Llama2 on our own machines.?

You can find more information about RAG on Meta's blog

Hope you enjoyed this short article!!

Chat with your private data using RAG & LLMs

Nirmal Juluru

Generative AI @ NVIDIA | Carnegie Mellon

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Harnessing the Power of Generative AI: Testing Data-Driven Decisions with ChatGPT Plus

Title: “Unlocking the Potential of ChatGPT: Applications for Data Analysts and Data Scientists”

The query access language for the language...

LangChain’s Dataframe Agent, Why You So Slow?

ChatGPT - A master at data comprehension, a pretty good analyst and an entry level Data Scientist

Using GPT to generate descriptions of photos based on photo analysis.

ChatGPT - Data Cleansing

ChatGPT your own data with Langchain and Streamlit - Part 2 now with User File Upload!

Integrating Event with Modal Logics

领英推荐

Marketing 101

2022年7月24日

The secret behind Stripe’s growth(shhh, It's great APIs)

2022年7月10日

The story of Jeff Bezos’ API mandate and the birth of AWS.

2022年7月5日