登录查看更多内容

Retrieval Augmented Generation (RAG)

Daniel Hand

CTO | Architect | Technology Leader

发布日期: 2025年1月6日

Retrieval Augmented Generation (RAG) combines the strengths of Large Language Models (LLMs) with retrieval mechanisms. The term was first introduced by Meta AI researchers in a 2020 paper titled Retrieval Augmented Generation for Knowledge-Intensive NLP Tasks https://arxiv.org/abs/2005.11401. However, it wasn't until early 2023 that it started to gain interest within Enterprise organisations when early adopters started using it to provide the necessary domain context for knowledge based systems. Since then, the desire for greater reliability, efficiency, transparency, accuracy, flexibility, security and reduced latency has driven the development of new RAG architecture patterns as highlighted in the table below.

RAG Architecture Patterns

The table below highlights current RAG architecture patterns together with the pros, cons and emerging considerations for each.

Shubrashankh Chatterjee

2 个月

My biggest learning after implementing multiple RAG systems last year has been. In 90% of use cases you are bottlenecked by the quality of your Retrieval system and how good is the pre-production corpus storage and upstream pipelines. The basics of Retrieval systems still supersede any UX gain you might get from LLMs.

查看更多评论

要查看或添加评论，请登录

Daniel Hand的更多文章

Exploring the Boox Go 10.3 as a Focused Writing Device

2025年2月5日

Exploring the Boox Go 10.3 as a Focused Writing Device

I had been curious about the benefits of e-ink tablets since the release of the Remarkable v1. I wondered whether its…
Exploring vLLM as an Alternative to Ollama

2025年2月3日

Exploring vLLM as an Alternative to Ollama

Ollama vs. vLLM: A Power-user's Perspective on LLM Serving For the past 18 months, Ollama has been my go-to tool for…

3 条评论
Building in the Shadow of Giants: Thriving Alongside Rapidly Evolving Software

2025年2月1日

Building in the Shadow of Giants: Thriving Alongside Rapidly Evolving Software

In December, OpenAI released an update for ChatGPT, supporting project folders, allowing users to group together chats…
Hidden Potential By Adam Grant - Key Takeaways Relating to Education

2024年4月7日

Hidden Potential By Adam Grant - Key Takeaways Relating to Education

Finland excels in the Programme for International Student Assessment (PISA) - Maths, Reading and Science - due to its…

2 条评论
AI-Powered Image and Video Processing: The Future of Communication

2023年11月27日

AI-Powered Image and Video Processing: The Future of Communication

As a child, I was captivated by the earliest animated shorts produced by Pixar. The combination of groundbreaking…

2 条评论
AI in Children's Education

2023年7月30日

AI in Children's Education

I am excited about the potential for good that Artificial Intelligence (AI) has in the education system. Every child is…

2 条评论
Ethical Artificial Intelligence (AI)

2021年11月26日

Ethical Artificial Intelligence (AI)

The English word ethics is derived from the Greek word êthos meaning “character or moral nature”. The study of ethics…
Black Box Thinking - Summary Notes

2020年9月8日

Black Box Thinking - Summary Notes

Historically, healthcare when compared with aviation has a relatively poor record of learning from failure. There is a…

2 条评论
Good Strategy Bad Strategy - Summary Notes

2020年6月16日

Good Strategy Bad Strategy - Summary Notes

Most organisations do not understand what good strategy is or are unwilling to make difficult decisions to implement…

3 条评论
Regression - Predicting How Much or How Many

2020年4月30日

Regression - Predicting How Much or How Many

At the start of this series on AI for business leaders, I shared that AI can solve five broad categories of problem. In…

See all articles

Retrieval Augmented Generation (RAG)

Daniel Hand

CTO | Architect | Technology Leader

RAG Architecture Patterns

Daniel Hand的更多文章

社区洞察

其他会员也浏览了

The Emergent AGI: A Visionary Perspective

Unlocking the Power of AI in Natural Language Processing

Axiomatic Metaphysics & Science & Technology: Metaphysical Technology

The Evolution of LLM-based Agents: From Philosophical Concepts to Simulated Societies

I'm an AI and This Is What Goes on Inside My 'Brain'

Singularity: Are we getting closer to it or drifting away from it?

AI Hallucinations Explained

Understanding RAG Fusion: The Next-Gen Information Retrieval ??

Introducing State Space Models: Is Attention All you need ?

From Curiosity to Discovery: My Journey into the History of AI, Starting with Leibniz

RAG Architecture Patterns

Daniel Hand的更多文章

Exploring the Boox Go 10.3 as a Focused Writing Device

Exploring vLLM as an Alternative to Ollama

Building in the Shadow of Giants: Thriving Alongside Rapidly Evolving Software

Hidden Potential By Adam Grant - Key Takeaways Relating to Education

AI-Powered Image and Video Processing: The Future of Communication

AI in Children's Education

Ethical Artificial Intelligence (AI)

Black Box Thinking - Summary Notes

Good Strategy Bad Strategy - Summary Notes

Regression - Predicting How Much or How Many

社区洞察

其他会员也浏览了

The Emergent AGI: A Visionary Perspective

Unlocking the Power of AI in Natural Language Processing

Axiomatic Metaphysics & Science & Technology: Metaphysical Technology

The Evolution of LLM-based Agents: From Philosophical Concepts to Simulated Societies

I'm an AI and This Is What Goes on Inside My 'Brain'

Singularity: Are we getting closer to it or drifting away from it?

AI Hallucinations Explained

Understanding RAG Fusion: The Next-Gen Information Retrieval ??

Introducing State Space Models: Is Attention All you need ?

From Curiosity to Discovery: My Journey into the History of AI, Starting with Leibniz