登录查看更多内容

Retrieval-Augmented Generation

Harsh Parashar

发布日期: 2024年3月8日

We have started to use LLMs extensively in our daily lives, when in doubt, you go to ChatGPT and hit it with a question. The other day, I was wondering, which car is the most expensive in the world, so I asked ChatGPT and that’s what I got:

There are two problems in this answer, i.e.,

This information might be outdated.
we do not know the source of this information.

Large Language Models are trained over a corpse of data from the internet or different sources but as the model gets older, its information is also getting outdated which require regular re-training of such models.

RAG (Retrieval-Augmented Generation) is framework which help utilize power of LLM with Knowledge Banks in Knowledge intensive NLP tasks. it references an external knowledge base (which was not used in its training data) to get the facts before generating a response.

The architecture of RAG contains two major components:

1.?????? Retriever

2.?????? Generator

领英推荐

ChatGPT: Overview of implications in market research

Acuity Knowledge Partners 2 年前

GPT-4 and the Quest for Human-Like Intelligence in AI

Service Desk Institute (SDI) 1 年前

Unlocking the Potential of Retrieval Augmented…

Value Global, LLC. 3 个月前

Retriever:

The retriever component of the RAG (Retrieval-Augmented Generation) model is in responsible for retrieving relevant material from a large?corpus or knowledge database, such as Wikipedia, or Internal Database, etc., in response to an input query.

Input Query: The model accepts an input query (question or prompt) as input.
Encoding: A neural network-based encoder converts the input query into a dense vector representation. In the instance of RAG, the encoder is built on the BERT (Bidirectional Encoder Representations from Transformers) architecture.
Document Index: The retriever has access to a document index, which is essentially a collection of pre-encoded documents. Each document in the index is encoded into a dense vector representation using a document encoder, which is often also based on BERT.
Retrieval: The encoded query is compared against the encoded representations of documents in the index using a similarity metric, typically cosine similarity. The retriever selects the top-k documents in the index that are most comparable to the query.
Output: The retriever returns the top k retrieved documents, along with their similarity scores.

Generator:

The generator generates the next token in the sequence by using the query, retrieved documents, and any previously generated tokens as inputs. It provides additional context by concatenating the input query with the retrieved documents. During training, the generator is fine-tuned to produce the desired sequence based on the input query and retrieved documents.

At test time, the generator creates the output sequence token by token, based on the input and retrieved documents.

For more information, please refer to this research paper: https://proceedings.neurips.cc/paper/2020/hash/6b493230205f780e1bc26945df7481e5-Abstract.html

Daisy Martha

Tech Enthusiast

10 个月

Exciting read! The integration of retrieved knowledge in Retrieval-Augmented Generation (RAG) is indeed revolutionizing natural language generation.

1 次回应

Joydeep Bhattacharjee

?? Are you working towards leveling up your career? DM me. Lets Discuss. ????

1 年

Evaluating RAGs: https://www.youtube.com/watch?v=r0_O0IogbKo

Vivek Tak

Building AI, Breaking Limits | Software Engineer @ Société Générale | Training Models That Shape the Future

1 年

Informative.

1 次回应

Arjun Shenoy

UiARD, UiPath-ABA & UiPath-SAI Certified | 8x UiPath Community Forum Awardee | Senior RPA Analyst | Workato Certified Integration Developer Professional

1 年

Insightful??Thank you for sharing.

1 次回应

Auto More

1 年

Retrieval-Augmented Generation (RAG) revolutionizes natural language generation by seamlessly integrating retrieved knowledge, enhancing contextual understanding and producing more accurate and coherent responses.

1 次回应

查看更多评论

要查看或添加评论，请登录

Harsh Parashar的更多文章

Understanding the Latent or Bottleneck Layer in Deep Learning Models

2024年9月15日

Understanding the Latent or Bottleneck Layer in Deep Learning Models

In generative models, the latent, or bottleneck layer, is among the most important parts of a model. Despite its often…

2 条评论
Depth-Wise Separable Convolutions: A Dive into MobileNets

2024年7月21日

Depth-Wise Separable Convolutions: A Dive into MobileNets

Source of the content : arXiv:1704.04861v1 [cs.
Beyond Standard Autoencoders: Exploring the Potential of VAEs and U-Net Architectures

2024年7月16日

Beyond Standard Autoencoders: Exploring the Potential of VAEs and U-Net Architectures

Autoencoders have revolutionized the processing of various data modalities, whether it be text, images, or waveforms…
Kolmogorov-Arnold Networks (KANs)

2024年7月2日

Kolmogorov-Arnold Networks (KANs)

Multi-Layer Perceptron (MLPs) have fascinated researchers for a long time with their ability to approximate a broad…

1 条评论
Autoencoders for Denoising Task

2024年3月10日

Autoencoders for Denoising Task

An autoencoder is a neural network architecture made up of two main components: an encoder and a decoder. The encoder…

See all articles

Retrieval-Augmented Generation

Harsh Parashar

领英推荐

Harsh Parashar的更多文章

社区洞察

其他会员也浏览了

How does Chat GPT work?

GPT-4 Is Here and It Is Powerful: Here Is All It Encompasses

A ChatGPT Series by CollabLL: Part 4

All You Need to Know About ChatGPT

What is GPT 4? Here’s everything you need to know.

ChatGPT Alternatives For Healthcare And Research

ChatGPT won’t take your job, but GPT-4 might…

Prompting GPT: How to Get the Best Results in a Production Environment

Unlocking Business Potential with ChatGPT-4

Your Ultimate Guide to ChatGPT: Prompt Your Way Through AI

领英推荐

Harsh Parashar的更多文章

Understanding the Latent or Bottleneck Layer in Deep Learning Models

Depth-Wise Separable Convolutions: A Dive into MobileNets

Beyond Standard Autoencoders: Exploring the Potential of VAEs and U-Net Architectures

Kolmogorov-Arnold Networks (KANs)

Autoencoders for Denoising Task

社区洞察

其他会员也浏览了

How does Chat GPT work?

GPT-4 Is Here and It Is Powerful: Here Is All It Encompasses

A ChatGPT Series by CollabLL: Part 4

All You Need to Know About ChatGPT

What is GPT 4? Here’s everything you need to know.

ChatGPT Alternatives For Healthcare And Research

ChatGPT won’t take your job, but GPT-4 might…

Prompting GPT: How to Get the Best Results in a Production Environment

Unlocking Business Potential with ChatGPT-4

Your Ultimate Guide to ChatGPT: Prompt Your Way Through AI