登录查看更多内容

To Embed or not to Embed ...

Arun Krishnan

Entrepreneur, Technology Leader, Business Leader, Analytics, AI, GenAI, Author Experienced Data Science and AI professional and leader. Driving Data Science, AI and GenAI technology and business growth

发布日期: 2023年12月12日

Everyone by now, ought to be familiar with the Retrieval-Augmented Generation (RAG) approach, wherein documents or text files are broken into chunks, embedded into numerical vectors, stored in vector databases and relevant chunks retrieved, based on the query, by resorting to cosine similarity or some other similarity metric.

This has enabled the development of a wide range of applications without having to resort to the more expensive option of fine-tuning of models. The current wave of Generative AI applications is driven, in large part, by the RAG approach.

This works well for content creation, translations or summarisation of text. However, when it comes to numerical and structured data, this approach might not work two well.

Why? What's the difference? As far as the LLM is concerned, it can treat the numerical data as another form of text? Why should we not just embed numerical tables and use it in the same way?

All very valid questions. However, the devil, as always, is in the details. Let's take a csv file with rows and rows of comma-separated data. We know that the text will be broken up into chunks, of size, say, 1024 tokens each. When this happens, our nice, structured table, will get broken up into several, unrecognisable pieces. When these chunks are queried, the results can be, to put it mildly, surprising.

If the data that our query demands, does NOT reside on those chunk boundaries, we might, just might get correct information to your query. If, however, the data resides on those boundaries, then the responses might be no different from hallucinations.

领英推荐

Building Retrieval Augmented Generation (RAG) from…

Saurav Prateek 7 个月前

Choosing the Right RAG Framework: LangChain or…

Dr. Rabi Prasad Padhy 8 个月前

Understanding the RAG Pipeline: Components and…

Ajay Verma 2 个月前

A further disadvantage is that we can't ask complicated queries like say, "Show me the average sales by product by quarter, last year". The LLM will mostly have no way to unpack the start and end-dates of the dataset required to process such a query.

A much better way is to use the LLM as a front-end for analysing the question, and translating it into a database query. You would, of course, have to provide the prompt with enough information, so that the LLM is able to generate the query with the right table and field names.

Why would this work?

Because, the LLM, being a language model, is great at translations. And an SQL query is, after all, just another language. The LLM can easily translate between human language and a database query, provided all the relevant information is provided.

Want to learn more? Looking for something like this for your organisation? iLink can certainly help you with that. Let's connect!

要查看或添加评论，请登录

Arun Krishnan的更多文章

A new architecture that incorporates more human-like memory features

2025年1月28日

A new architecture that incorporates more human-like memory features

The one huge drawback of attention models that are ubiquitous in LLMs, is the fact that the memory requirements can…

3 条评论
What's Deep about DeepSeek?

2025年1月27日

What's Deep about DeepSeek?

Deepseek has taken the LLM world by storm, achieving parity with the latest models from OpenAI at a fraction of the…

16 条评论
BertViz - Visualizing Attention in Transformers

2024年6月25日

BertViz - Visualizing Attention in Transformers

With the increasing use of LLMs and Transformers in organisations, users are starting to demand explainability from…
Buffer-of-Thought Prompting

2024年6月20日

Buffer-of-Thought Prompting

With use cases becoming more and more complicated and agent-based systems becoming the norm for #GenerativeAI based…

1 条评论
The GenAI conundrum

2023年11月30日

The GenAI conundrum

So you are the CEO of a company and have heard of this wonderful new toy called Generative AI. You call a meeting of…

9 条评论
Understanding the craft of writing

2023年6月15日

Understanding the craft of writing

I have never written an article about writing. Even though I have published my first novel and three more are already…
Generating Images with Large Language Model (GILL)

2023年6月13日

Generating Images with Large Language Model (GILL)

By now, we all know that LLMs work by creating embeddings of sentences in a large, multi-dimensional textual space…

2 条评论
Are neural networks actually starting to replicate the functioning of the human brain?

2023年5月25日

Are neural networks actually starting to replicate the functioning of the human brain?

Artificial Neural Networks (ANNs), as the name suggests were patterned after the way we thought the human brain worked.…

2 条评论
Claude and "Constitutional" AI

2023年5月23日

Claude and "Constitutional" AI

For a while now, I have been of the firm opinion that we need to build in Asimov's Three Laws of Robotics into our AI…
All about Chain-of-Thought (CoT)Prompting

2023年5月15日

All about Chain-of-Thought (CoT)Prompting

The rapidity with which LLM models have been progressing has been nothing short of stunning. The last few months have…

5 条评论

See all articles

To Embed or not to Embed ...

Arun Krishnan

Entrepreneur, Technology Leader, Business Leader, Analytics, AI, GenAI, Author Experienced Data Science and AI professional and leader. Driving Data Science, AI and GenAI technology and business growth

领英推荐

Arun Krishnan的更多文章

社区洞察

其他会员也浏览了

How to Build a Powerful GenAI App with Llama Index

#142 The Inclusive Lake Vectoria

Why Retrieval Matters More Than You Think in Modern LLMs

Retrieval-Augmented Generation (RAG): Bridging the Gap Between LLMs and Real-Time Data

The Future of Retrieval Systems & LLMs

GraphRAG Update Improves AI Search Results

Building a Local RAG Document Knowledge Base

Leveraging Enterprise Search for Retrieval-Augmented Generation (RAG)

Unlock the Power of Multimodal and Hybrid Search

Scaling RAG from POC to Prod.

领英推荐

Arun Krishnan的更多文章

A new architecture that incorporates more human-like memory features

What's Deep about DeepSeek?

BertViz - Visualizing Attention in Transformers

Buffer-of-Thought Prompting

The GenAI conundrum

Understanding the craft of writing

Generating Images with Large Language Model (GILL)

Are neural networks actually starting to replicate the functioning of the human brain?

Claude and "Constitutional" AI

All about Chain-of-Thought (CoT)Prompting

社区洞察

其他会员也浏览了

How to Build a Powerful GenAI App with Llama Index

#142 The Inclusive Lake Vectoria

Why Retrieval Matters More Than You Think in Modern LLMs

Retrieval-Augmented Generation (RAG): Bridging the Gap Between LLMs and Real-Time Data

The Future of Retrieval Systems & LLMs

GraphRAG Update Improves AI Search Results

Building a Local RAG Document Knowledge Base

Leveraging Enterprise Search for Retrieval-Augmented Generation (RAG)

Unlock the Power of Multimodal and Hybrid Search

Scaling RAG from POC to Prod.