登录查看更多内容

The Ins and Outs of Retrieval-Augmented Generation (RAG)

Towards Data Science

Your home for data science. A publication sharing concepts, ideas and codes.

发布日期: 2023年10月12日

When accessible large language models first came on the scene, the excitement was impossible to miss: beyond their sheer novelty, they came with the promise to completely transform numerous fields and lines of work.

Almost a year after the launch of ChatGPT, we’re far more aware of LLMs’ limitations, and of the challenges we face when we try to integrate them into real-world products. We’ve also, by now, come up with powerful strategies to complement and enhance LLMs’ potential; among these, retrieval-augmented generation (RAG) has emerged as—arguably—the most prominent. It gives practitioners the power to connect pre-trained models to external, up-to-date information sources that can generate more accurate and more useful outputs.

This week, we’ve gathered a potent lineup of articles that explain the intricacies and practical considerations of working with RAG. Whether you’re deep in the ML trenches or approaching the topic from the perspective of a data scientist or product manager, gaining a deeper familiarity with this approach can help you prepare for whatever the future of AI tools brings.?

If you’re still in the earlier stages of your data science journey and need some expert guidance before you can jump into more specialized topics like RAG, we’ve got you covered, too. From our partners, we’re thrilled to share the AI and Data Scientist Roadmap. Check out this step-by-step guide to becoming an AI or Data Scientist in 2023, along with all the resources you’ll need to help you learn.

Add Your Own Data to an LLM Using Retrieval-Augmented Generation (RAG). For a beginner-friendly introduction to the topic, Beatriz Stollnitz ’s recent deep dive is a terrific resource to visit and bookmark for future reference. It goes through the theoretical foundations of RAG before transitioning to a hands-on basic implementation, showing how you can create a chatbot to help customers find information about the products a company sells.
10 Ways to Improve the Performance of Retrieval Augmented Generation Systems. If you’ve already started tinkering with RAG in your projects, you’ve likely observed that setting it up is one thing, but making it work consistently and produce the intended results is another: “RAG is easy to prototype, but very hard to productionize.” Matt Ambrogi ’s guide provides pragmatic insights on bridging the gap between the framework’s potential and more tangible benefits.

Brij kishore Pandey 1 个月前

?? Promptpack: How to build a second-brain (featuring…

Azeem Azhar 1 年前

A Free Massive New Language Model; Moder Data…

Steve Nouri 2 年前

RAG vs Finetuning?—?Which Is the Best Tool to Boost Your LLM Application? There are more than a few alternatives to RAG when it comes to building better AI products. Heiko Hotz offers a nuanced and thorough comparison of RAG and model fine-tuning, another prominent strategy for upgrading the performance of generic LLMs. Ultimately, as Heiko eloquently puts it, “There is no one-size-fits-all solution; success lies in aligning the optimisation method with the specific requirements of the task.”

For other excellent reads on topics ranging from counterfactual insights to dynamic pricing, we hope you explore some of our other recent highlights:

class>If you’d like to test out the power of the ChatGPT API, ng-control-name="article-ssr-frontend-pulse_little-mention" data-tracking-will-navigate data-test-link> class> shares an introductory guide to using it for topic modeling class>.

class>Looking to brush up on your programming skills? ng-control-name="article-ssr-frontend-pulse_little-mention" data-tracking-will-navigate data-test-link> class>’s hands-on tutorial

tackles NaN (not-a-number) values in Python class> and how to use them properly. ng-control-name="article-ssr-frontend-pulse_little-mention" data-tracking-will-navigate data-test-link> class> is back with one of his trademark deep dives, this time

covering the mathematical underpinnings of dimensionality class> (and the notorious “curse” thereof) in great detail. class>

To learn about counterfactuals and their place class> within data analysis, don’t miss net/in/mahamharoon?trk=article-ssr-frontend-pulse_little-mention" target="_blank" data-tracking-control-name="article-ssr-frontend-pulse_little-mention" data-tracking-will-navigate data-test-link> class>’s clear and accessible explainer. class>Why are so many businesses

jumping on the generative-AI bandwagon class> even in the absence of a well-defined business goal? net/in/skirmer?trk=article-ssr-frontend-pulse_little-mention" target="_blank" data-tracking-control-name="article-ssr-frontend-pulse_little-mention" data-tracking-will-navigate data-test-link> class> digs into an emerging conundrum. class>After unpacking the potential of using a reinforcement-learning approach to dynamic pricing, ng-control-name="article-ssr-frontend-pulse_little-mention" data-tracking-will-navigate data-test-link> class> weighs the benefits of

adding context to a multi-armed bandits solution class>. class>In a fun project walkthrough, ng-control-name="article-ssr-frontend-pulse_little-mention" data-tracking-will-navigate data-test-link> class> shows

how you can leverage pre-trained models and reanalysis data class> to create a custom AI weather-forecast app.

Thank you for supporting our authors’ work! If you enjoy the articles you read on TDS, consider becoming a Medium member ?—?it unlocks our entire archive (and every other post on Medium, too).

The Ins and Outs of Retrieval-Augmented Generation (RAG)

Towards Data Science

Your home for data science. A publication sharing concepts, ideas and codes.

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

The data that trains AI is under the spotlight — and even I’m weirded out

How To Summarize Public Opinion Using RAG AI

The future of advanced AI is simple

Beyond LLMs: Building magic

GenAI Weekly — Edition 16

The Future of Prompting Mechanisms: A Definitive Guide for Fully Agentic Applications

No Connection, No Problem: AI Solutions with GPT4All and KNIME

Function Calling AI: Transforming Text Models into Dynamic Agents

5 Top LLM Service Providers Every CEO Needs in Their Speed Dial

Introducing Gemini: Google's Next-Generation AI Model with Groundbreaking Test Results

领英推荐

Getting Started with Multimodal AI, CPUs and GPUs, One-Hot Encoding, and Other Beginner-Friendly Guides

2024年11月21日

Network Analysis, Diffusion Models, Data Lakehouses, and More: Our Best Recent Deep Dives

2024年11月14日

Beyond Math and Python: The Other Key Data Science Skills You Should Develop

2024年11月7日

LLM Evaluation, AI Side Projects, User-Friendly Data Tables, and Other October Must-Reads

2024年10月31日

AI in Practice: How to Choose and Deploy the Right Strategy

2024年10月24日

What Does It Take to Get Your Foot in the Door as a Data Scientist?

2024年10月17日

All About AI Agents: Autonomy, Reasoning, Alignment, and More

2024年10月10日

Graph RAG, Automated Prompt Engineering, Agent Frameworks, and Other September Must-Reads

2024年10月3日

A Close Look at AI Pain Points, and How to (Sometimes) Resolve Them

2024年9月26日

How to Build Your Own Roadmap for a Successful Data Science Career

2024年9月19日