登录查看更多内容

GraphRAG: Powerful but Expensive and Slow Solution

Jayant Kumar

Principal ML Scientist at Adobe | Technical Advisor at Preffect | Multimodal AI | Large language models and Knowledge Graph applications

发布日期: 2024年7月29日

Microsoft's GraphRAG architecture represents a significant advancement in Retrieval-Augmented Generation (RAG) systems, offering a comprehensive solution for handling both specific and broad queries. Traditional RAG systems, which retrieve a limited number of document chunks as context for language models, often fall short when answering high-level questions that require a full understanding of the content.

GraphRAG enhances the traditional approach by integrating vector stores with knowledge graphs, including entities, relationships, hierarchical communities, community reports, and claims covariant. This advanced system ensures detailed and accurate responses by summarizing information at different hierarchical levels.

The workflow of GraphRAG involves chunking documents, creating embeddings, extracting and resolving entities and relationships, detecting hierarchical communities, and mapping text chunks to these entities. [Refer]

Phase 1: Compose text units

[ Document -> Chunk -> Text Units (TU)]

Phase 2: Graph Extraction

[Text Units -> Entity/Relationship Extraction -> ER Summarization-> Entity Resolution -> Claim Extraction -> Graph Tables (GT)]

Phase 3: Graph Augmentation

[GT -> Community Detection -> Graph Embedding -> Augmented Graph Tables (AGT)]

Phase 4: Community Summarization

领英推荐

TAI #122; LLMs for Enterprise Tasks; Agent Builders or…

Towards AI 5 个月前

Altair Forward First – July 2023 Edition

Altair 1 年前

Data Science #33

Andriy Burkov 5 个月前

[AGT-> Community embedding -> Community Summarization]

Phase 5: Document Processing

[TU -> Links to TU -> Doc Embedding -> Doc Graph Creation -> Doc Tables]

Phase 6: Network Visualization

[DT, ADT -> Nodes table]

This comprehensive process, although powerful, comes with significant drawbacks: high computational costs and slow processing times. For instance, indexing a single book can cost around $10 and take considerable time.

Thats why Microsoft immediately deployed the accelerator here https://github.com/Azure-Samples/graphrag-accelerator. But the TPM thresholds are quite high?

Despite these challenges, GraphRAG's ability to provide detailed and comprehensive answers makes it a valuable tool for complex queries and data retrieval needs. Future developments may focus on optimizing the cost and speed, potentially incorporating open-source models to make the system more accessible and efficient.

Ref: https://microsoft.github.io/graphrag/

Ashutosh Gupta, PhD

Student | AI / ML / Data Scientist | Industry + Academia

7 个月

Many applications are there even right now, and hopefully the prices too will come down soon. Opensource / local models can also help cut costs somewhat. Thanks again. :)

1 次回应

Ashutosh Gupta, PhD

Student | AI / ML / Data Scientist | Industry + Academia

7 个月

Thanks for sharing Jayant! This is indeed helpful. GraphRAG seems like a powerful tool for enhancing LLM performance with knowledge graphs.

1 次回应

查看更多评论

要查看或添加评论，请登录

Jayant Kumar的更多文章

DeepSeek-R1: A Pure RL-based Reasoning Model

2025年1月26日

DeepSeek-R1: A Pure RL-based Reasoning Model

I summarize the key steps involved in creating the DeepSeek models, from the foundational development of DeepSeek-R1 to…

1 条评论
LLaVA-OneVision

2024年9月21日

LLaVA-OneVision

The LLaVA-NeXT series represents a groundbreaking evolution in large multimodal models with each iteration bringing…

2 条评论
SIGIR Day 1 - Keynotes and Industry Papers

2024年7月16日

SIGIR Day 1 - Keynotes and Industry Papers

Day 1 started with the opening remarks from general/program chairs. Some key insights are as follows: RecSys has the…
LLM Alignment: Direct Preference Optimization

2024年7月13日

LLM Alignment: Direct Preference Optimization

In the realm of language models (LMs), alignment is essential to ensure that the outputs generated by these models meet…

1 条评论
Behind the Rankings: LLM Model Evaluation in Benchmark Datasets

2024年4月20日

Behind the Rankings: LLM Model Evaluation in Benchmark Datasets

Over the past few days, there's been a flurry of posts discussing the newly unveiled Llama 3 model and its impressive…
Navigating the Shifting Tides: Reflections on the Rollercoaster Ride of 2023

2023年12月31日

Navigating the Shifting Tides: Reflections on the Rollercoaster Ride of 2023

The Unfolding Drama in Early 2023: Unrealistic Projections, Layoffs, and the Pressure to Innovate As the curtains rose…

1 条评论
AI Horizons: A Closer Look at the Five Big AI Bets in 2023

2023年12月22日

AI Horizons: A Closer Look at the Five Big AI Bets in 2023

As we navigate the ever-evolving landscape of artificial intelligence, it's natural to wonder – which bets are paying…

1 条评论
BERT as a service

2020年5月17日

BERT as a service

There are multiple ways of leveraging the open source BERT model for your NLP work, for example, via huggingface…
Custom Object Detector

2018年12月2日

Custom Object Detector

Recently I had a chance to try Tensorflow object detection API to develop a custom object detector - an object…

2 条评论
Learning by Teaching

2015年8月22日

Learning by Teaching

I had heard before that the best way to learn anything is to try to teach it to others. If you can explain a topic of…

3 条评论

See all articles

GraphRAG: Powerful but Expensive and Slow Solution

Jayant Kumar

Principal ML Scientist at Adobe | Technical Advisor at Preffect | Multimodal AI | Large language models and Knowledge Graph applications

领英推荐

Jayant Kumar的更多文章

社区洞察

其他会员也浏览了

Data Science #34

Data Science #34

Artificial Intelligence #172

Fired up! Promising features from Microsoft power platform 2023 release

Building Data & ML pipelines + other resources

Qbeast: Shaping the Future of Data Lakes for Big Data

April 17, 2024

Spring into the Newest in DevTools

Microsoft Build 2024: Transforming AI Development

Edition #89 - Analytics Bites - Windows 11 Debuts AI Assistant + USA's Strongest AI Regulations Yet

领英推荐

Jayant Kumar的更多文章

DeepSeek-R1: A Pure RL-based Reasoning Model

LLaVA-OneVision

SIGIR Day 1 - Keynotes and Industry Papers

LLM Alignment: Direct Preference Optimization

Behind the Rankings: LLM Model Evaluation in Benchmark Datasets

Navigating the Shifting Tides: Reflections on the Rollercoaster Ride of 2023

AI Horizons: A Closer Look at the Five Big AI Bets in 2023

BERT as a service

Custom Object Detector

Learning by Teaching

社区洞察

其他会员也浏览了

Data Science #34

Data Science #34

Artificial Intelligence #172

Fired up! Promising features from Microsoft power platform 2023 release

Building Data & ML pipelines + other resources

Qbeast: Shaping the Future of Data Lakes for Big Data

April 17, 2024

Spring into the Newest in DevTools

Microsoft Build 2024: Transforming AI Development

Edition #89 - Analytics Bites - Windows 11 Debuts AI Assistant + USA's Strongest AI Regulations Yet