登录查看更多内容

Retrieval-Augmented Generation (RAG) Production System - Data Relevancy Improvement Technique

Vivek Kaushik

Senior Data Scientist @ Mott MacDonald | Generative AI | Deep Learning | LLM Application | Langchain | Azure| AWS Bedrock | LLMOps | Prompt Engineering | Product Strategy | Airflow | AI Governance | Computer Vision

发布日期: 2024年1月14日

Today all organizations are adopting to the wave of Generative AI - Large Language Model (LLM) based applications into their business domain.

Most popular use-case currently implemented based on Retrieval-Augmented Generation (RAG).

Why RAG Use-Case Arrived ?

Generative AI uses large language models (LLMs) to create text responses. The generated text is often easy to read and provides detailed responses that are broadly applicable to the questions asked of the software. However, the information used to generate the response is limited to the information used to train the AI, which may be weeks, months, or years out of date. This can lead to incorrect responses that erode confidence in the technology among customers and employees

What Is Retrieval-Augmented Generation (RAG)?

Retrieval-Augmented Generation (RAG) is a technique that optimizes the output of a large language model (LLM) with targeted information without modifying the underlying model itself. This targeted information can be more up-to-date than the LLM and specific to a particular organization and industry. RAG can provide more contextually appropriate answers to prompts as well as base those answers on extremely current data. The concept of RAG was introduced in a 2020 paper by Patrick Lewis and a team at Facebook AI Research

RAG is a relatively new artificial intelligence technique that can improve the quality of generative AI by allowing large language model (LLMs) to tap additional data resources without retraining.
RAG models build knowledge repositories based on the organization’s own data, and the repositories can be continually updated to help the generative AI provide timely, contextual answers.

Key Concepts & Technologies Used To Design & Build RAG System :

Sentence Embedding Model : Embedding models especially Deep neural network (DNN) models and Transformer models used to create sentence embedding which are numerical representation of text containing all the contextual, syntactical, semantic knowledge.
Vector DB : Vector Databases are used to store and index sentence embeddings.
Retrieval Algorithm : These algorithms are used to extract most relevant information for synthesizing information from the knowledge bank using Generative AI Large Language Models (LLM).

Why Retrieval-Augmented Generation (RAG) System Fails ?

One and only reason for failure of RAG systems are because of Similarity vs Relevancy battle.

Inside any production ready large scale based Retrieval-Augmented Generation (RAG) System data Relevancy in extraction of information from Vector Databases plays a vital role.

Charles Phiri, PhD, CITP 8 个月前

Understanding Retrieval-Augmented Generation (RAG) in…

Jean KO?VOGUI 6 个月前

?? GraphRAG: GenAI + Knowledge Graphs ??

Satish Surath 3 个月前

How To Improve Data Relevancy In Retrieval-Augmented Generation (RAG) System ?

While extracting knowledge based on Retrieval Algorithm always it's a good practice to check relevancy of data extracted by projecting the embedding vectors into 2D or 3D representations.

Uniform Manifold Approximation and Projection (UMAP) is a dimension reduction technique that can be used for projections into 2D visualisation

Best way to improve relevancy results using Cross-Encoder Re-Ranking Technique.

Cross-Encoder : A cross-encoder is a type of neural network architecture used in natural language processing tasks, particularly in the context of sentence or text pair classification. It is designed to evaluate and provide a single score or representation for a pair of input sentences, indicating the relationship or similarity between them. This is different from other architectures like siamese networks or traditional encoder-decoder models

Re-Ranking : After retrieval of relevant results all results are re-ordered and shuffled with right compatibility and relevancy with the query.

Try development of re-ranking techniques using pre-trained or custom trained encoder models. Let's connect for consultancy to know more about development of RAG systems.

#rag #genai #llm #chatgpt #llama2 #consultancy #machinelearning #ai #development #product #artificialintelligence #reranking #FANG #learning #article #facebookreasearch #Retrieval-Augmented Generation #production #aws #gcp #cloud #azureopenai

Gaurav Sood

1 个月

Hello sir Help

1 次回应

Futurum One

8 个月

Your insights on enhancing RAG-based systems through re-ranking are spot on – it's a game-changer for precision in AI-driven solutions. ?? Generative AI can indeed elevate this process, ensuring higher quality outputs at remarkable speeds, which could be transformative for your projects. ?? Let's explore how these advancements can streamline your workflow; I invite you to book a call with us to unlock the full potential of generative AI in your work. ?? Cindy

查看更多评论

要查看或添加评论，请登录

Vivek Kaushik的更多文章

How to make trustworthiness in AI with new concept of explainable AI, Let's improve trust in our work...

2019年5月17日

How to make trustworthiness in AI with new concept of explainable AI, Let's improve trust in our work...

The amount of software systems that are using artificial intelligence (AI) and in particular machine learning (ML) is…
Infusing Radical Personalisation In Customer Service Chatbots

2019年5月14日

Infusing Radical Personalisation In Customer Service Chatbots

According to survey reports of McKinsey we found that almost each and every new company in customer service sector is…
Business Benefits To Opt Using Natural Language Processing & Artificial Intelligence

2019年4月18日

Business Benefits To Opt Using Natural Language Processing & Artificial Intelligence

What Exactly is Business Benefit ? We all have some definition in our mind when we hear word "Benefit", all those…
Recurrent Neural Network (RNN) For Language Understanding

2019年4月6日

Recurrent Neural Network (RNN) For Language Understanding

#ai #artificialintelligence #machinelearning #nlp #development #smartsolutions #becreative Artificial Neural Networks…
DIFFERENT CATEGORIES AI IS DIVIDED IN? WHAT APPLICATIONS INDUSTRIES ARE WORKING ON IN DIFFERENT AI CATEGORIES ?

2019年1月5日

DIFFERENT CATEGORIES AI IS DIVIDED IN? WHAT APPLICATIONS INDUSTRIES ARE WORKING ON IN DIFFERENT AI CATEGORIES ?

As we all know that Data is the fuel of new computing applications & AI is an engine of producing maximum output…
Changes to be Amend in Hospitality Industry for Increasing Sales

2018年2月15日

Changes to be Amend in Hospitality Industry for Increasing Sales

Marked by signs of global economic recovery and optimistic growth forecasts in developing economies, hotel chains will…

See all articles

Retrieval-Augmented Generation (RAG) Production System - Data Relevancy Improvement Technique

Vivek Kaushik

Senior Data Scientist @ Mott MacDonald | Generative AI | Deep Learning | LLM Application | Langchain | Azure| AWS Bedrock | LLMOps | Prompt Engineering | Product Strategy | Airflow | AI Governance | Computer Vision

Why RAG Use-Case Arrived ?

What Is Retrieval-Augmented Generation (RAG)?

Key Concepts & Technologies Used To Design & Build RAG System :

Why Retrieval-Augmented Generation (RAG) System Fails ?

领英推荐

How To Improve Data Relevancy In Retrieval-Augmented Generation (RAG) System ?

Vivek Kaushik的更多文章

社区洞察

其他会员也浏览了

Understanding Vector Databases: Powering the World of AI

GPT-4: A Potential Stepping Stone on the Path to Artificial General Intelligence AGI

The Pivotal Role of Structured Data in Crafting AI/ML Models

Scaling Synthetic Data Creation with 1,000,000,000 Personas: A Paradigm Shift

Semantics meeting needs of real-time data evaluation and artificial intelligence (AI)?

The Falcon soars high with Good Data

The Enduring Importance of Data Structures and Algorithms in the Age of AI Technologies like GPT

The Importance of Quality Data in Adopting Generative AI and How RAG Can Help

Enhancing AI Model Accuracy Using RAG: A Breakthrough Approach

The Critical Link Between Data Quality and Generative AI

Why RAG Use-Case Arrived ?

What Is Retrieval-Augmented Generation (RAG)?

Key Concepts & Technologies Used To Design & Build RAG System :

Why Retrieval-Augmented Generation (RAG) System Fails ?

领英推荐

How To Improve Data Relevancy In Retrieval-Augmented Generation (RAG) System ?

Vivek Kaushik的更多文章

How to make trustworthiness in AI with new concept of explainable AI, Let's improve trust in our work...

Infusing Radical Personalisation In Customer Service Chatbots

Business Benefits To Opt Using Natural Language Processing & Artificial Intelligence

Recurrent Neural Network (RNN) For Language Understanding

DIFFERENT CATEGORIES AI IS DIVIDED IN? WHAT APPLICATIONS INDUSTRIES ARE WORKING ON IN DIFFERENT AI CATEGORIES ?

Changes to be Amend in Hospitality Industry for Increasing Sales

社区洞察

其他会员也浏览了

Understanding Vector Databases: Powering the World of AI

GPT-4: A Potential Stepping Stone on the Path to Artificial General Intelligence AGI

The Pivotal Role of Structured Data in Crafting AI/ML Models

Scaling Synthetic Data Creation with 1,000,000,000 Personas: A Paradigm Shift

Semantics meeting needs of real-time data evaluation and artificial intelligence (AI)?

The Falcon soars high with Good Data

The Enduring Importance of Data Structures and Algorithms in the Age of AI Technologies like GPT

The Importance of Quality Data in Adopting Generative AI and How RAG Can Help

Enhancing AI Model Accuracy Using RAG: A Breakthrough Approach

The Critical Link Between Data Quality and Generative AI