CAG vs RAG: Which One to Use?
Louis-Fran?ois Bouchard
Making AI accessible. ?? What's AI on YouTube. Co-founder at Towards AI. ex-PhD Student.
If you're using ChatGPT or other AI models, you've probably noticed they sometimes give incorrect information or hallucinate.
RAG helps solve this by searching through external documents, but this new approach takes a completely different approach - and it might just be what you need!
Good morning everyone! As always, this is Louis-Fran?ois, co-founder and CTO at Towards AI, and today, we'll dive deep into something really exciting: Cache-Augmented Generation, or CAG.
In the early days of LLMs, context windows, which is what we send them as text, were small, often capped at just 4,000 tokens (or 3,000 words), making it impossible to load all relevant context.
This limitation gave rise to approaches like Retrieval-Augmented Generation (RAG) in 2023, which dynamically fetches the necessary context.
As LLMs evolved to support much larger context windows—up to 100k or even millions of tokens—new approaches like caching, or CAG, began to emerge, offering a true alternative to RAG...
领英推荐
And that's it for this iteration! I'm incredibly grateful that?the What's AI newsletter?is now read by over 20,000 incredible human beings. Click here to share this iteration with a friend if you learned something new!
Looking for more cool AI stuff? ??
Want to share a product, event or course with my AI community? Reply directly to this email, or visit my Passionfroot profile to see my offers.
Thank you for reading, and I wish you a fantastic week! Be sure to have?enough sleep and physical activities next week!
Louis-Fran?ois Bouchard
Expertise in full IT service life cycle, focusing testing, delivery and analysis to Improve infrastructure and application observability, performance, availability and reliability.
4 周CAG make more sense needing less GPU/TPUs, response is faster with less memory depending upon rule requirements , embedding, fine tuning and how much context aware response is needed, surly faster but more effectively and efficient for specialized area/ jobs
I help teams navigate and negotiate change. Applying real-time alerts to align products, influencers and customers is my forte. ???? ??
4 周?? KNOWLEDGE, just like TRUST, is a simple vendor MYTH ?? ???????? ?????? ???? ??