登录查看更多内容

DFS-RAG

Priyesh Srivastava

发布日期: 2024年9月14日

Since OpenAI O1 came out I have been seeing a lot of discussion around Chain of Thought + Reinforcement Learning fine-tune, actor-critic models, monte-carlo tree search and so on.

But for people who are working with structured content like code generation or legal circulars, you can hotfix a lot of this with DFS RAG.

DFS RAG follows the same process as you see in your OpenAI O1 model's chain of thought but because all of these are distinct model calls, the flow is much more parallelizable in production. And is also more constrainted when being using temperature > 0.2 and top-p < 0.95, which may be required when the model must use some freedom.

Here is a flow below for code generation style of usecases:

As you can see there are 2 databases, here you can prepare them by extracting, annotating, and linking type definitions of custom structs. In case you have nested custom types in structs you need to flatten them out when generating context.

We have adapted this a similar depth-first flow to work with compliance documents at OnFinance AI at 1/20th the cost of OpenAI O1 using open source models. This makes sure you are able to track version control from regulator documents without needing any manual intervention, no matter how complex the gazette/regulation.

Email:[email protected] to get a demo of our compliance copilot.

要查看或添加评论，请登录

Priyesh Srivastava的更多文章

Phasing out our v1 RAG engine

2024年9月27日

Phasing out our v1 RAG engine

Insights from being in production LLMs in 12 months and why we feel the current system is no longer needed? We are…

4 条评论
MoE vs Ensemble (Part 2 for technical folks and AI folks)

2024年1月29日

MoE vs Ensemble (Part 2 for technical folks and AI folks)

The core idea of an ensemble model is say you are training a very simple model to learn the function on a particular…

1 条评论

DFS-RAG

Priyesh Srivastava

Priyesh Srivastava的更多文章

社区洞察

其他会员也浏览了

Top RAG Papers of the Week (November Week 1, 2024)

How Can We Generate BETTER Sequences with LLMs?

How to Create Custom LLMs From Scratch - Interview with Vincent Granville

OpenAI: How to Build LLM Apps that can See, Hear, Speak

Learn how to evaluate and score results from GPT-like systems

How to build an ML platform + other resources

Vector RAG w/o fine tuned LLM

First use of OpenAI o1 Model

Unleashing the Power of 9 with YOLOv9

How does Retrieval-Based Speculative Decoding Improve RAG Performance

Priyesh Srivastava的更多文章

Phasing out our v1 RAG engine

MoE vs Ensemble (Part 2 for technical folks and AI folks)

社区洞察

其他会员也浏览了

Top RAG Papers of the Week (November Week 1, 2024)

How Can We Generate BETTER Sequences with LLMs?

How to Create Custom LLMs From Scratch - Interview with Vincent Granville

OpenAI: How to Build LLM Apps that can See, Hear, Speak

Learn how to evaluate and score results from GPT-like systems

How to build an ML platform + other resources

Vector RAG w/o fine tuned LLM

First use of OpenAI o1 Model

Unleashing the Power of 9 with YOLOv9

How does Retrieval-Based Speculative Decoding Improve RAG Performance