登录查看更多内容

GenAI Optimization Techniques - Part 1

David Jitendranath

AI Strategy Developer, Advisor, CxO -- Architect of the absurdly brilliant

发布日期: 2024年3月21日

Techniques used in the Store and Structure Phase

There is a view to LLMs that it is some sort of a database that contains tons of data, you can ask it anything and it will respond with accuracy. If the response is not accurate, then we have to employ “optimization” to get the response to be accurate.

That view is operating on the assumption that LLMs are fact based responders. In the field of Generative AI, that is largely a misdirected view considering the fact that most LLMs are closed (we do not know what data they are trained on).?

The right assumption would be to imagine LLMs as primarily a vocabulary provider that provides relevancy and not accuracy. In other words, you can inquire an LLM to examine the relevancy of the response.?

To illustrate this, Imagine if you were to go to a bookstore that has English books. Though all books are written in the same language, The books in the Religion section will use a certain vocabulary which would be quite different from books in the Travel section.?

Let's say the section tags were missing and all the books had blank covers.? You as the reader will be able to figure out whether you are in the Travel book section or Religious book section. To determine this, you may employ some of your background knowledge (aka LLM) to evaluate what category the book you are reading belongs to. This is what I mean by relevance and context awareness.

If we view LLMs this way it would make more sense to apply optimization at different stages within the Gen AI process.?

To briefly define optimization, It is a set of techniques applied at different stages within the GPT pipeline. These techniques are applied so the response to a user's question will be accurate, relevant and safe.?

Accurate - Fact based mathematical precision, when precision is needed.
Relevant - Responds within context, style and uses domain specific language.?
Safe -? Ignores users' malicious questions such as how to make an atom bomb.? Does not reveal private and sensitive data.?

I primarily like to segment the GPT pipeline into two phases. Though they may not be sequential in timeline for execution.

领英推荐

Evaluating Single & Multi-Agent based LLMs: Insights…

ELCA Group 1 个月前

AI Surge Unleashed: Your Weekly Testing News - Issue…

Ministry of Testing 1 年前

What can LLMs offer the Systematic manager?

GAM Investments 1 年前

Phase one Optimization?

In this blog we will cover optimization techniques that are primarily used in Phase1.? In my next blog we will cover optimization techniques used in Phase 2.? So stay tuned by clicking the subscribe button.

Let's quickly understand the store and structure phase. There are two steps that happen in this phase (refer diagram below).?

Ingestion?
Model embedding

Ingestion is the process of creating the necessary data pipelines to define and store the model. The key components create the data pipelines are?

Loaders - Components that import data from various sources.?
Transformers - Components that chunk data to relevant store spaces and add contextual data

The below diagram will shed some light on the steps that happen during this phase.

Chunking algorithms are a way to employ optimization at the ingestion step. Choosing the right algorithm to use for chunking is an essential optimization technique. Here are a few chunking techniques.

Chunking Techniques

Fixed Size Chunking - Split data into fixed size chunks.? The chunk size can potentially be determined to optimize for retrieval speed or size of data being loaded.
Context Aware Chunking - Split data into chunks based on some criteria. For example, you can split into chunks, sentences that are separated by periods (“.”) or new lines.

Indexing or model embedding one utilizes LLMs to inquire and add meaning to the chunks of data. This meaning is basically a numerical representation of the chunked data’s context, and relative relevancy. Refer to the earlier blog link here to learn more about how this numerical representation is done. During this step, the numerical representations are stored as indices. These indicer are more like the table of contents for a book

Embedding Model Choice

In this step,? one may want to? determine which would be the best embedding model to choose in order to optimize for performance.

Here is a link to the Leaderboard to find embedding models that may be best suited for embedding/encoding.? How you determine which model to choose is beyond the scope of this blog. ? The important point here is that this is another optimization technique that can be applied in this phase.

Based on the choice the embedding data can be added to the vector database as indices.? Again, the Table of Contents example helps here.

In my next blog, we will cover Optimization techniques used in Phase 2.? So stay tuned by clicking the subscribe button.

Get In The Mode

950 位关注者

Cognitive.ai > Building Next-Generation AI Services

11 个月

Can’t wait to dive into your blog! David Jitendranath

1 次回应

要查看或添加评论，请登录

David Jitendranath的更多文章

Just one question

2025年2月12日

Just one question

https://www.linkedin.
AI Snake Oil - My Thoughts on the book Not another book review

2025年1月31日

AI Snake Oil - My Thoughts on the book Not another book review

AI Snake Oil - My Thoughts on the book Not another book review David J Recently I read the book AI Snake Oil. This book…

2 条评论
How AI agents are different from Automation systems (RPA, BPA etc)

2024年12月18日

How AI agents are different from Automation systems (RPA, BPA etc)

AI Agents are systems that are capable of doing complex open ended tasks in the real world just like a real person…
The Gilmour Touch - What we can learn from it

2024年12月13日

The Gilmour Touch - What we can learn from it

On my return flight last week to Denver from London. I got a chance to watch David Gilmour’s concert Live at Pompeii.

2 条评论
Agentic AI or Autonomous Agents - A No Jargon Explanation

2024年11月17日

Agentic AI or Autonomous Agents - A No Jargon Explanation

One of my programming mentors at one of my earliest jobs was Rick (last name hidden). When I was somewhat of a newb at…
Being a Polymath in the Age of AI

2024年11月11日

Being a Polymath in the Age of AI

In this blog I explore what it means to be a polymath and why it is the best way to innovate and create in the age of…

6 条评论
GenAI Optimization Techniques - Part 1

2024年3月21日

GenAI Optimization Techniques - Part 1

Techniques used in the Store and Structure Phase There is a view to LLMs that it is some sort of a database that…

2 条评论
No Jargon Explanation of Embedding Models with practical use cases

2024年2月22日

No Jargon Explanation of Embedding Models with practical use cases

Understanding the cognitive process of the human mind can shed light on how Gen AI works. GenAI is sort of an early…

4 条评论
5 Types of LLM Apps - Understanding AI in a more nuanced way

2024年2月8日

5 Types of LLM Apps - Understanding AI in a more nuanced way

It is no secret there are massive developments happening in the AI space. Although AI itself has been around for the…

5 条评论
Biohacking - A case for human performance optimization

2024年1月25日

Biohacking - A case for human performance optimization

“The reason why these individuals have different feelings and tastes is usually to be found in some oddity of their…

See all articles

GenAI Optimization Techniques - Part 1

David Jitendranath

AI Strategy Developer, Advisor, CxO -- Architect of the absurdly brilliant

Techniques used in the Store and Structure Phase

领英推荐

Phase one Optimization?

Get In The Mode

950 位关注者

David Jitendranath的更多文章

社区洞察

其他会员也浏览了

Understanding Retrieval Augmented Generation (RAG): A Leap Forward in AI

Revolutionizing Document Summarization with GenAI and RAG (AI Document Summarization Part 1)

Large Language Model Battles: Which LLM do you choose?

Top 10 Things You Should Know About AI Agents

Declutter AI: things that matter!

DeepSeek R1: Enter the Next Frontier of AI Evolution

Making high-quality prompts

Human-Business-Artificial Intelligence..... Confusing?

Mastering logic for AI - Build LLMs with efficiency and performance in mind

Common Pitfalls in Prompt Design and How to Avoid Them!

Techniques used in the Store and Structure Phase

领英推荐

Phase one Optimization?

Get In The Mode

950 位关注者

David Jitendranath的更多文章

Just one question

AI Snake Oil - My Thoughts on the book Not another book review

How AI agents are different from Automation systems (RPA, BPA etc)

The Gilmour Touch - What we can learn from it

Agentic AI or Autonomous Agents - A No Jargon Explanation

Being a Polymath in the Age of AI