登录查看更多内容

点击“继续加入或登录”，即表示您同意遵守领英的《用户协议》、《隐私政策》及《Cookie 政策》。

StructRAG Explained: Revolutionizing Structured Data Reasoning

Jose Luis Latorre

IT & Dev Community Lead & Software Architect at Swiss Life AG | Generative AI & Agentic AI Engineer & Enthusiast | LinkedIn Learning Course Author | Helping people understand and apply AI | Microsoft AI MVP | Speaker

发布日期: 2025年1月12日

The field of AI has been revolutionized by Retrieval-Augmented Generation (RAG) techniques, enabling models to combine the power of retrieval systems with generative AI. This approach addresses a key limitation of language models: their inability to stay updated with external, dynamic knowledge. By fetching relevant data and synthesizing it with generative capabilities, RAG has opened new possibilities for knowledge-intensive reasoning. Over time, RAG has evolved into more specialized methods, including GraphRAG and the latest innovation, StructRAG.

In this article, I will introduce these RAG methods, dive into how StructRAG builds upon its predecessors, and explore Kévin BEAUGRAND ’s open-source implementation of StructRAG.

Traditional RAG: The Foundation

Traditional RAG is a two-step process that enhances the capabilities of large language models (LLMs):

Retrieve: Using a retrieval system, the most relevant documents are fetched based on a user’s query. This retrieval step leverages embeddings and similarity metrics to locate relevant knowledge.
Generate: The LLM processes the retrieved data, combining it with its internal knowledge to generate an accurate and context-aware response.

While effective, traditional RAG assumes a linear and flat structure of documents. It is well-suited for general-purpose queries but struggles when dealing with complex relationships, hierarchies, or structured data such as graphs or tables.

GraphRAG: Navigating Complex Relationships

GraphRAG extends traditional RAG by incorporating graph-based relationships between pieces of information. Instead of treating documents as independent entities, GraphRAG models the relationships between them, enabling reasoning over connected knowledge. This approach is particularly useful in domains like academic research, where papers cite each other, or in enterprise settings with linked datasets.

By treating documents and their connections as nodes and edges in a graph, GraphRAG allows for:

Contextualized retrieval, leveraging document relationships.
Enhanced reasoning by understanding how knowledge pieces influence each other.

However, GraphRAG is limited when dealing with multi-modal or highly structured information, such as detailed tables, catalogs, or algorithms.

StructRAG: Structured Knowledge Reasoning

StructRAG, introduced in the paper “StructRAG: Structured Retrieval-Augmented Generation for Knowledge-Intensive Reasoning” https://arxiv.org/abs/2410.08815 pushes the boundaries further. Unlike GraphRAG, StructRAG is designed to handle structured data alongside unstructured documents.

How StructRAG Works

StructRAG operates through a structured pipeline that dynamically adapts to different data structures and scenarios by intelligently routing queries to the appropriate format, such as graphs, tables, or catalogs, ensuring optimal processing and reasoning for each case.

Here's how its main components work, explained simply:

Router:

Decides the type of data structure needed to answer the query (e.g., graph, table, catalog, chunk or Algorithm).
Think of it as a traffic controller that directs questions to the most relevant type of data.

Structurizer:

Takes the chosen data and organizes it in a way that makes sense for the query.
For example, it might extract rows from a table, highlight connections in a graph, or sort items in a catalog.

Utilizer:

Combines all the processed data and generates a clear, complete answer.
It’s like a storyteller weaving the data into a coherent and useful response.

This picture shows the process beautifully (from the original paper):

Key Advantages of StructRAG

StructRAG excels in scenarios where structured and semi-structured data are critical. Examples include:

Financial Reports: Analyzing tables of metrics and generating insights.
Scientific Research: Extracting algorithm descriptions and reasoning across multi-modal datasets.
Enterprise Knowledge: Synthesizing catalogs, hierarchical documents, and structured reports into actionable outputs.

StructRAG’s ability to handle complex, structured data while still leveraging unstructured content makes it an unparalleled tool for knowledge-intensive reasoning tasks.

Comparison of RAG Methods

StructRAG stands out as the most versatile approach, dynamically processing structured data through routing and structuring stages, compared to GraphRAG's reliance on pre-constructed graphs that limit flexibility to predefined relationships.

The following picture extracted from the paper where some benchmarks are shown also speaks for itself:

Basically and in short, it outperforms other techniques, exceeding all baselines. In addition achieves the best average performance and latency, as shown in the following image, also from the original paper:

Kevin Beaugrand’s StructRAG Implementation

Kévin BEAUGRAND ’s repository, KernelMemory.StructRAG, provides a robust .NET implementation of StructRAG, leveraging KernelMemory for efficient retrieval and reasoning. Unlike other implementations, this repository emphasizes modularity, allowing developers to customize the routing, structuring, and reasoning processes based on specific use cases. Notably, it builds on Microsoft’s KernelMemory, extending its functionality. Familiarity with KernelMemory is essential for understanding and effectively utilizing this implementation.

Implementation Highlights

AskAsync: This method retrieves relevant records and orchestrates the RAG pipeline. It:

Uses a “Router” to determine the type of structure required for the query.
Processes the retrieved information via the “ConstructAsync” and “DecomposeAsync” methods to handle structured data.
Generates the final response by merging synthesized knowledge.

Router and Structurizer: These modules play a critical role in identifying the appropriate structure (graph, table, or catalog) and organizing the information for subsequent reasoning.

Integration with Prompts: Prompts are defined for each stage (e.g., “Route,” “ConstructGraph,” “Decompose”). They guide the model’s reasoning process, ensuring contextually relevant outputs.

How to Use the Repository

Setup: Clone the repository and configure the necessary settings using environment variables or the appsettings.json file. For environment variables, you can use commands like setx on Windows or export on Linux/Mac. For example:

Windows Command Prompt (Persistent):

setx AzureOpenAIEvaluationChatCompletion__APIKey your-api-key
setx AzureOpenAIEvaluationChatCompletion__Endpoint https://your-endpoint.openai.azure.com/

Linux/Mac (Temporary):

export AzureOpenAIEvaluationChatCompletion__APIKey=your-api-key
export AzureOpenAIEvaluationChatCompletion__Endpoint=https://your-endpoint.openai.azure.com/

These environment variables will map to the configuration keys in the application. This approach provides flexibility for deployment scenarios where file-based configuration is less practical (and also insecure due to adding secrets to your codebase...).

{
  "AzureOpenAICompletion": {
    "APIKey": "your-api-key",
    "Endpoint": "https://your-endpoint.openai.azure.com/"
  }
}

Query Execution: Use the AskAsync method to pass a query and receive structured responses. Be sure to check the sample project in the repository for a practical demonstration. Additionally, the KernelMemory.Evaluation package from Kevin Beaugrand is a brilliant resource that simplifies evaluation tasks and complements StructRAG's capabilities.

var client = new StructRAGSearchClient(memoryDb, textGenerator, config, loggerFactory);
var response = await client.AskAsync("index-name", "What are the main insights from the sales data?");
Console.WriteLine(response.Result);

But as mentioned, for a fully fledged experience, go to the sample project's Program.cs and understand its usage along the Evaluator usage.

Conclusion

StructRAG represents a significant advancement in RAG methodologies, enabling models to reason over structured and unstructured data seamlessly. Kevin Beaugrand’s implementation provides an excellent foundation for exploring this paradigm. Whether you’re working with financial data, academic research, or technical documentation, StructRAG offers a powerful toolset to extract and synthesize complex knowledge.

For more insights and updates, explore the linked resources and start experimenting with StructRAG today!

Curious about how Generative and Agentic AI are shaping the future? maybe along Semantic Kernel and AutoGen?

Follow José Luis Latorre for real insights and practical examples of these technologies in action.

Anisha Mane

2 个月

This is a great topic. Hadn't read about it before. Thank you!

2 次回应

Kévin BEAUGRAND

????Developer and craftsman in the field of information technology ???? - Microsoft MVP AI Platform & Azure AI Services

2 个月

Thank you very much Jose Luis Latorre for this article and the sharing. I'm very exited about discussing in live with you about this RAG method, I'm pretty sure that interesting thing will come during the session together.

1 次回应

Jordi Gonzalez Segura

CEO/CIO greenYng & Co-founder at greenYng & greenYng energY. #YoutúYou #YoudecideYourwasteisVALUE #YoudecideYourwasteisENERGY

2 个月

I don't know much about structRAG, but and it's surely not, new, we've found an 'additional' value to the RAG, surely it's nothing new, the concept... we call it greensemantYcnet. We developers are to establish flow routing through synthetic programming, but what if we were to take the networking of actions, processes and agents to the semantic level with RAG... surely it is nothing new.... but it's very exciting...

1 次回应

Jose Luis Latorre

2 个月

And of course, it wouldn't be complete without some contribution love, so already went deep into the repo code - check out https://github.com/kbeaugrand/KernelMemory.StructRAG

2 次回应

Jose Luis Latorre

2 个月

Dimitrios Toulakis, I expect your feedback and some resharing love - you asked me for it, so here it is ??

2 次回应

查看更多评论

要查看或添加评论，请登录

Jose Luis Latorre的更多文章

Harnessing AI for Long-Form Audio: Building an Agentic Language Coach

2025年2月26日

Harnessing AI for Long-Form Audio: Building an Agentic Language Coach

Imagine having an AI that can generate entire audiobooks or comprehensive language lessons at the click of a button…

6 条评论
2024: A Year of Challenges, Growth, and Gratitude

2024年12月24日

2024: A Year of Challenges, Growth, and Gratitude

As 2024 comes to a close, I find myself reflecting on an incredible journey—one that began with a few personal and…

10 条评论
Semantic Kernel: Contributing to a Simpler, More Fluent Process Framework ??

2024年12月17日

Semantic Kernel: Contributing to a Simpler, More Fluent Process Framework ??

The Story: How It Started Back in mid-September, I had the privilege of being introduced to the Process Framework in…

7 条评论
A New Era in AI Agentic Architectures: What Does It Mean for Developers?

2024年11月16日

A New Era in AI Agentic Architectures: What Does It Mean for Developers?

Event-driven, distributed, composable, flexible, observable, and scalable—these aren’t just buzzwords; they are the…

10 条评论
Public Commitment and the Journey to Build My AI Speaker Assistant

2024年10月12日

Public Commitment and the Journey to Build My AI Speaker Assistant

There’s a technique I often use when I need extra motivation to make real progress on an idea or project. It’s the…

2 条评论
OpenAI DevDay 2024: Realtime API Unveiled—Revolutionary, But Worth the Price?"

2024年10月2日

OpenAI DevDay 2024: Realtime API Unveiled—Revolutionary, But Worth the Price?"

“Finally, natural interactions with AI are here—but at a price that might make you think twice.” At OpenAI's DevDay…

1 条评论
Introducing the Semantic Kernel Process Library: A New Era of AI Workflow Orchestration

2024年9月25日

Introducing the Semantic Kernel Process Library: A New Era of AI Workflow Orchestration

In the ever-evolving field of artificial intelligence, the need for robust, scalable, and flexible workflows is…

4 条评论
Supercharging Semantic Kernel with AutoGen: Integrating the "best of both worlds" for Advanced AI Workflows

2024年9月25日

Supercharging Semantic Kernel with AutoGen: Integrating the "best of both worlds" for Advanced AI Workflows

In the rapidly evolving landscape of artificial intelligence, combining powerful tools can unlock unprecedented…

7 条评论
Extending Semantic Kernel with Agentic AI Workflows: New Patterns for Chat Automation

2024年9月15日

Extending Semantic Kernel with Agentic AI Workflows: New Patterns for Chat Automation

In the rapidly evolving world of AI, agentic AI workflows are becoming the backbone of automation and advanced…

3 条评论
Comparing the OpenAI API (Beta 2) Library and the Semantic Kernel SDK

2024年6月7日

Comparing the OpenAI API (Beta 2) Library and the Semantic Kernel SDK

Preface I just announced the OpenAI API Library Beta 2 launch (like 16 hours ago) and I've got some questions on what…

7 条评论

See all articles

Traditional RAG: The Foundation

GraphRAG: Navigating Complex Relationships

StructRAG: Structured Knowledge Reasoning

How StructRAG Works

Key Advantages of StructRAG

Comparison of RAG Methods

Kevin Beaugrand’s StructRAG Implementation

Implementation Highlights

How to Use the Repository

Conclusion

Jose Luis Latorre的更多文章

Harnessing AI for Long-Form Audio: Building an Agentic Language Coach

2024: A Year of Challenges, Growth, and Gratitude

Semantic Kernel: Contributing to a Simpler, More Fluent Process Framework ??

A New Era in AI Agentic Architectures: What Does It Mean for Developers?

Public Commitment and the Journey to Build My AI Speaker Assistant

OpenAI DevDay 2024: Realtime API Unveiled—Revolutionary, But Worth the Price?"

Introducing the Semantic Kernel Process Library: A New Era of AI Workflow Orchestration

Supercharging Semantic Kernel with AutoGen: Integrating the "best of both worlds" for Advanced AI Workflows

Extending Semantic Kernel with Agentic AI Workflows: New Patterns for Chat Automation

Comparing the OpenAI API (Beta 2) Library and the Semantic Kernel SDK

社区洞察