登录查看更多内容

Mastering Chunking for RAG: Semantic vs Recursive vs Fixed Size

Zahiruddin Tavargere

Senior Principal Software Engineer@Dell | Opinions are my own

发布日期: 2024年9月16日

+ 关注

Note: The read-time of this article was going beyond 4 minutes, so I am sharing the video instead.

This is part of the Advanced RAG Series: Part 1

When working with Retrieval Augmented Generation (RAG) models, selecting the right chunking method can make a huge difference in performance.

In my latest YouTube video, I dive deep into the three main chunking approaches—Semantic, Recursive, and Fixed Size—and evaluate their performance based on four critical metrics: context precision, faithfulness, answer relevancy, and context recall.

The chunking method you choose can impact how accurate and relevant the AI-generated answers are. So, which method strikes the perfect balance between retaining enough context and providing highly relevant, faithful responses?

In the video, I break down:

How Semantic Chunking performed in capturing context but struggled with relevancy.
Why Recursive Chunking emerged as a strong contender with high accuracy and relevancy.
The surprising strengths of Fixed Size Chunking, especially in context retention.

If you're interested in fine-tuning your RAG models or curious about which chunking method works best, this video is packed with insights that will help you make the right choice. Check out the full breakdown in the embedded video below!

Watch the full analysis and find out which chunking method is best for your use case:

The Adaptive Engineer

792 位关注者

要查看或添加评论，请登录

Zahiruddin Tavargere的更多文章

My Favorite OpenAI Agents SDK Feature (And The Most Understated!)

2025年3月24日

My Favorite OpenAI Agents SDK Feature (And The Most Understated!)

In our previous tutorial, we built a restaurant customer support chatbot using OpenAI's Agents SDK. In this follow-up…
Building a Multi-Agent System with OpenAI Agents SDK - Part 1

2025年3月16日

Building a Multi-Agent System with OpenAI Agents SDK - Part 1

OpenAI recently released their Agents SDK, a lightweight yet powerful framework for building multi-agent workflows…
Why I'm Going Back to Basics

2025年2月2日

Why I'm Going Back to Basics

As an engineer in the rapidly evolving field of AI, I don't just want to leverage GenAI APIs and build agents. Video…

1 条评论
How Uber Saved 140,000 Hours Monthly Using Generative AI Agents

2025年1月14日

How Uber Saved 140,000 Hours Monthly Using Generative AI Agents

Video The Problem at Hand Uber's data platform processes approximately 1.2 million interactive queries monthly, with…
A Deep Dive into Google's "Agents" White Paper

2025年1月10日

A Deep Dive into Google's "Agents" White Paper

Google's recent white paper on "Agents" has created quite a buzz. The paper explores the concept of AI agents and…

1 条评论
How the Definition of Full-Stack Development Will Evolve by 2025

2024年12月31日

How the Definition of Full-Stack Development Will Evolve by 2025

Today I want to share something I deeply believe will shape the future of software engineering. As we approach 2025…

1 条评论
Unlocking the Power of Dynamic Prompting with Jinja2

2024年12月22日

Unlocking the Power of Dynamic Prompting with Jinja2

Colab Notebook: colab.research.
How to Build a Price Monitoring Agent with Pydantic AI

2024年12月16日

How to Build a Price Monitoring Agent with Pydantic AI

Video Tutorial Keeping track of fluctuating product prices across e-commerce platforms can be a daunting task. Whether…

1 条评论
Building a Multi-Agent Orchestrator: A Step-by-Step Guide

2024年12月6日

Building a Multi-Agent Orchestrator: A Step-by-Step Guide

Today, we’re diving into an exciting project: creating a Multi-Agent Orchestrator. Thanks for reading The Adaptive…

1 条评论
Is This the Most Robust Agentic Intent Classifier Yet?

2024年11月26日

Is This the Most Robust Agentic Intent Classifier Yet?

This week, I showcase the Multi-Agent Orchestrator by AWS, a tool designed to streamline the development of intelligent…

See all articles

Mastering Chunking for RAG: Semantic vs Recursive vs Fixed Size

Zahiruddin Tavargere

Senior Principal Software Engineer@Dell | Opinions are my own

The Adaptive Engineer

792 位关注者

Zahiruddin Tavargere的更多文章

社区洞察

其他会员也浏览了

Mastering Stream Processing - Windowing time semantics

Wigner Theorem for Random Matrices

Unveiling the Inner Workings of Decision Trees

Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks

Six For Sunday

On the Proper Use of the Word Idiolect

Use StructRAG When Graph RAG Or Plain RAG Underperform

Building a Sci-Fi Book Semantic Search Engine with Qdrant in 5 Minutes

RAPTOR (Recursive Abstractive Processing for Tree-Organized Retrieval)

Basic Text Analysis: Tokenizers & Word Frequency

The Adaptive Engineer

792 位关注者

Zahiruddin Tavargere的更多文章

My Favorite OpenAI Agents SDK Feature (And The Most Understated!)

Building a Multi-Agent System with OpenAI Agents SDK - Part 1

Why I'm Going Back to Basics

How Uber Saved 140,000 Hours Monthly Using Generative AI Agents

A Deep Dive into Google's "Agents" White Paper

How the Definition of Full-Stack Development Will Evolve by 2025

Unlocking the Power of Dynamic Prompting with Jinja2

How to Build a Price Monitoring Agent with Pydantic AI

Building a Multi-Agent Orchestrator: A Step-by-Step Guide

Is This the Most Robust Agentic Intent Classifier Yet?

社区洞察

其他会员也浏览了

Mastering Stream Processing - Windowing time semantics

Wigner Theorem for Random Matrices

Unveiling the Inner Workings of Decision Trees

Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks

Six For Sunday

On the Proper Use of the Word Idiolect

Use StructRAG When Graph RAG Or Plain RAG Underperform

Building a Sci-Fi Book Semantic Search Engine with Qdrant in 5 Minutes

RAPTOR (Recursive Abstractive Processing for Tree-Organized Retrieval)

Basic Text Analysis: Tokenizers & Word Frequency