登录查看更多内容

Building a Ranking System to Enhance Prompt Results: The New PageRank for RAG/LLM

Vincent Granville

AI/LLM Disruptive Leader | GenAI Tech Lab

发布日期: 2024年10月8日

In this document, you will learn how to build a system that decides, among dozens of candidate paragraphs selected from the corpus to answer a prompt, which ones to show in the results, and in what order. The goal is to maximize relevancy while not overwhelming the user with a long, cluttered answer. Think of it as the new PageRank for RAG/LLM, although the algorithm is radically different, and much simpler.

The approach is generic and works for all RAG/LLM systems whether based on neural networks or not. It is implemented in xLLM. The main steps are:

Backend processing (linked to the corpus)

Split your corpus into text entities such as webpages, paragraphs, sections and so on. This step is similar to chunking. Attach an ID (called index) to each text entity.
Text entity have two types of fields: regular text like in all LLMs, and knowledge graph elements such as categories, related items, URL, tags, parent categories, title, and so on. These knowledge graphs elements are found while crawling and part of the original corpus. Or they can be added after the full crawl. For instance, in xLLM, agents are assigned to text entities post-crawling, using a clustering algorithm.
You need two types of tokens: regular ones, and those found in the knowledge graph elements. The latter are called graph tokens. You then create a key-value table Hash_ID, where the key is a token, and the value is the list of text entity IDs attached to the token in question, with a token count for each one. Graph tokens start with “__”, to differentiate them from regular tokens.

Free Online Courses 1 年前

Probabilistic Nearest Neighbors: The Swiss Army Knife…

Vincent Granville 5 个月前

??Top ML Papers of the Week

DAIR.AI 5 个月前

Frontend processing (linked to the prompt)

You create a local, small key-value table ID_Hash, a transposed version of Hash_ID, where the key is a text entity ID, and the value is a list of tokens t found in the prompt, with ID in Hash_ID[t].
For each ID, you compute (say) 4 relevancy scores: [..]

?? Follow this link to read the full article with all Frontend steps and smart ranking, download the technical document with Python code (with links to GitHub) and case study featuring the anonymized augmented corpus of a fortune 100 company, as well as future LLM developments (auto-indexing and LLM for cataloging and glossary generation).

GenAI and Machine Learning

203,957 位关注者

Aman Naik

Pursuing AI/ML | Implementing Large Language Models

1 个月

Wonderful insightful!

HEMANTH LINGAMGUNTA

Innovative Prompt Engineer | AI Whisperer

1 个月

Thanks for sharing your experience sir have a great day!

查看更多评论

要查看或添加评论，请登录

查看全部

Building a Ranking System to Enhance Prompt Results: The New PageRank for RAG/LLM

Vincent Granville

AI/LLM Disruptive Leader | GenAI Tech Lab

Backend processing (linked to the corpus)

领英推荐

Frontend processing (linked to the prompt)

GenAI and Machine Learning

203,957 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

Reproducible AI: How and Why?

pANN: A Fast Alternative to Vector Search

Fast Classification and Clustering via Image Convolution Filters

The Encoder Component of the Transformer Architecture: Source code Demystified

FunSearch: Leveraging AI Hallucinations to Make New Discoveries in Mathematics

Fundamentals of BERT - Bidirectional Encoders Representations from Transformers, Part-1

Decoding the Transformers: A Dive into GPT with TensorFlow

The "Hockey Stick" Chart of Artificial Intelligence

Ch:14.1 Types of GAN's with?Math.

Progress, but no major breakthrough on matrix multiplication

Backend processing (linked to the corpus)

领英推荐

Frontend processing (linked to the prompt)

GenAI and Machine Learning

203,957 位关注者

New LLM & RAG Courses and Certifications

2024年11月14日

Optimizing AI Systems: Fintech Case Study

2024年11月5日

LLM, RAG, GPT & GenAI: Free Certifications and Courses from Leading Experts

2024年11月1日

Building a GenAI/LLM app on AWS with Anthropic Claude

2024年10月28日

AI/RAG Tutorial: Building Enterprise-Grade, Secure, Scalable Data APIs

2024年10月22日

AI, GenAI, LLM, Prompt Engineering, NLP: Review of the Ecosystem

2024年10月18日

New Book: Building Disruptive AI & LLM Technology from Scratch

2024年10月15日

Building an Enterprise-Grade Agentic RAG

2024年10月14日

Databases For AI, GenAI & RAG/LLMs: Vendor Comparison

2024年10月9日

State of the Art in AI Research

2024年10月4日

社区洞察

其他会员也浏览了

Reproducible AI: How and Why?

pANN: A Fast Alternative to Vector Search

Fast Classification and Clustering via Image Convolution Filters

The Encoder Component of the Transformer Architecture: Source code Demystified

FunSearch: Leveraging AI Hallucinations to Make New Discoveries in Mathematics

Fundamentals of BERT - Bidirectional Encoders Representations from Transformers, Part-1

Decoding the Transformers: A Dive into GPT with TensorFlow

The "Hockey Stick" Chart of Artificial Intelligence

Ch:14.1 Types of GAN's with?Math.

Progress, but no major breakthrough on matrix multiplication