登录查看更多内容

CHATGPT HAS NO IDEA ABOUT MY SOURCE OF TRUTH

Timilehin Aliyu

Software Engineer | FullStack Developer (Backend heavy)

发布日期: 2023年9月17日

Latest trends in today's tech-driven world, artificial intelligence is at the forefront of innovation. One of the latest trends in AI is centered around generational models, and you simply can't escape their impact :). In this blog post, I'd like to introduce you to an exciting concept called RAG: Retrieval Augmented Generation. RAG is all about improving the quality of conversational agents (i.e LLMs), by grounding them to a source of knowledge. This, in turn, enhances the LLM's ability to provide accurate and relevant information.

To understand RAG better, let's delve into some essential concepts like Vectors and Embeddings. A vector is essentially an array of numbers. Embeddings, on the other hand, are similar to vectors but serve the purpose of converting text into tokens (vectors). Today, accomplishing this conversion is easily achievable using a function. But why is this important, you ask? Well, let's consider a query, for instance: "As a dynamic meteorologist derive the stability equation for leapfrog approach otherwise called Courant-Friedrichs-Lewy condition?" If we submit this query without any grounding/Source of truth, we might receive an inadequate response due to AI models being trained on vast datasets. RAG's logic comes into play here. By utilizing our source of truth, which contains information relevant to our specific context, and converting it into vectors, we can constrain the response to give us adequate information.

The magic of retrieving the right response happens through a mathematical function called cosine similarity, which measures the similarity between two non-zero vectors in an inner product space. By looping through our vectors and retrieving a similar vector based on the query, we can provide a tailored response. Sounds a bit complicated, Delve in to the bigger picture below :).

For example, our vectors may contain an embedding and associated metadata like this:

- Embedding: [0, 4, 5] -> Array of vectors

- Metadata: {textContent: 'Facts about Paris'} -> namespace

In practice, we can have a function that splits our text into documents, effectively dividing it into manageable chunks.

So, how do we code our text into vectors, and where do we store these vectors for retrieval and cosine similarity calculations? PineConeDB, a database designed for storing vectors. Here's a high-level overview of how this works: When we receive a query, we convert it into a vector (i.e., embed it). Then, we scan through all the vectors in the PineConeDB to find the most similar one. Once we identify the similar vector, we retrieve the original vector from the PineConeDB, complete with its metadata. With this information, we can create a prompt that feeds into ChatGPT to generate a relevant response.

Here are the steps in a nutshell:

1. Obtain your source of truth.

2. Split your source of truth into manageable chunks.

3. Vectorize and embed these chunks.

4. Store the resulting vectors in PineConeDB.

Now, let's talk about retrieval and response generation. At this point, we're essentially searching for the right answer. Here's what we do:

Bernard Marr 1 年前

Your ultimate 2023 guide to prompting

Azeem Azhar 9 个月前

A definitive guide to prompting Large Language Models

Michael Tate 8 个月前

1. Embed the query (convert it into vectors).

2. Query the PineConeDB for similar vectors.

3. Extract the metadata from the similar vector.

4. Feed the metadata into the OpenAI prompt to generate a contextually appropriate response in line with our source of truth.

With an understanding of these concepts, let's explore some packages and tools that make this ambitious project possible.

LangChain: This tool aids in splitting our source of truth into manageable chunks, making the process more efficient.

PineConeDB: This database is used for storing and organizing our vectors, making it easy to access and retrieve the information we need.

OpenAI-Edge: This component allows us to query the database and seamlessly feed the retrieved metadata into our ChatGPT prompt.

Vercel AI SDK: For creating the ChatGPT-like conversational experience, Vercel AI SDK is a recent and powerful SDK that simplifies the process, abstracting away much of the complexity.

The power of AI, specifically generational models like ChatGPT, is undeniable. However, to make these models truly effective and aligned with our source of truth, techniques like RAG and tools like PineConeDB, LangChain, and Vercel AI SDK come into play. With these technologies at our disposal, we can bridge the gap between AI models and real-world information, creating more accurate and contextually relevant conversational agents.

Note: The era of AI-powered content interaction has arrived, and it's only going to get more fascinating from here.

Jatin W.

Head of Data & AI at Givvable

1 年

RAG (Retrieval Augmented Generation) is indeed a fascinating development in the AI landscape. The ability to leverage vectors, embeddings, and context-aware responses enhances the quality of content interaction. It's particularly valuable in models like ChatGPT, as it helps in constraining responses to a reliable "source of truth."

1 次回应

Aliyu Ednah Olubunmi

University Lecturer at Adekunle Ajasin University

1 年

You really inspired me. You're a great researcher!! Exploring AI is inevitable because it makes our life easier.

1 次回应

John Alvarez

CEO @ ZeroBot | AI democratization and inclusivity

1 年

Great post, the use of RAG to constrain AI models like ChatGPT to a 'source of truth' is a fascinating concept and it's great to see how it can be used in software engineering!

1 次回应

Timilehin Aliyu

Software Engineer | FullStack Developer (Backend heavy)

1 年

To get extensive knowledge on this with a walkthrough and an amazing guide? A YouTube video with majority of the process can be found here https://www.youtube.com/watch?v=bZFedu-0emE&t=11306s by Elliott Chong

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

CHATGPT HAS NO IDEA ABOUT MY SOURCE OF TRUTH

Timilehin Aliyu

Software Engineer | FullStack Developer (Backend heavy)

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

How will ChatGPT impact the future of financial services?

Mere Mortals In The World Of Artificial Intelligence?

Navigating the new frontier with Brad Lightcap

Roald Dahl’s AI Prophecy: ChatGPT’s Unexpected Roots

Navigating the Nuances and Pitfalls of AI Image Generation

ChatGPT and Foundation Models for enterprises.. beyond the hype (1 of 5)

Where Do Companies Fit in an AI-First World?

A Deep Dive into 'Tree of Thought' Prompts: Unleashing the Power of Large Language Models

ChatGPT’s Role in the Supply Chain

AI Insights Newsletter III

领英推荐

Why not ORM's

2024年6月18日

Handling SQLite DB in Lambda Functions Using Zappa

2024年6月10日

Polling in React

2024年6月9日

Creating React Components with Style Variants using Class Variance Authority

2024年1月2日

Simplifying Database Migration with Django and API Endpoints

2023年12月18日

Serverless Deployment with Zappa on AWS Lambda

2023年12月2日

LeetCode has positive Impacts on improving coding skills.

2023年9月30日

Seamless Server Updates: Keeping Your Website Up and Running with Gunicorn and Nginx

2023年8月17日

社区洞察

其他会员也浏览了

How will ChatGPT impact the future of financial services?

Mere Mortals In The World Of Artificial Intelligence?

Navigating the new frontier with Brad Lightcap

Roald Dahl’s AI Prophecy: ChatGPT’s Unexpected Roots

Navigating the Nuances and Pitfalls of AI Image Generation

ChatGPT and Foundation Models for enterprises.. beyond the hype (1 of 5)

Where Do Companies Fit in an AI-First World?

A Deep Dive into 'Tree of Thought' Prompts: Unleashing the Power of Large Language Models

ChatGPT’s Role in the Supply Chain

AI Insights Newsletter III