登录查看更多内容

Agentic Reasoning: Reasoning LLMs with Tools for Deep Research

Florent LIU

Data architect, Full Stack Data Engineer in BIG DATA, and Full Stack Developer AI.

发布日期: 2025年2月12日

1. Introduction

Agentic Reasoning is a framework that enhances Large Language Model (LLM) reasoning by integrating external tool-using agents, including web search, code execution, and structured reasoning-context memory.

Unlike traditional LLM-based reasoning, which relies solely on internal inference, Agentic Reasoning dynamically engages external sources to improve logical deduction, fact retrieval, and problem-solving accuracy.

The framework introduces the Mind Map agent, which constructs a structured knowledge graph to track logical relationships, enhancing deductive reasoning.

It also integrates web search and coding agents to retrieve real-time information and perform computational analysis, significantly outperforming retrieval-augmented generation (RAG) systems and closed-source LLMs in complex research tas

2. Core Methodology

Agentic Reasoning follows a multi-agent architecture where LLMs interact with external tools. The reasoning process dynamically integrates four key components:

Task Instruction (o): Defines the reasoning objective.
Query (q): Represents the complex question requiring multi-step reasoning.
External Tool Outputs (e): Information retrieved from web search, coding execution, or memory graphs.
Reasoning Memory (k): Structured knowledge stored from previous reasoning steps.

The system uses a probability model:

where rr represents the reasoning steps, and aa is the final answer. The model optimizes both through structured retrieval and external agent interactions.

3. The Agentic Reasoning Pipeline

The framework enables LLMs to autonomously determine when additional information is required, triggering specialized tokens that call external agents:

Web-search token: Retrieves real-time information.
Coding token: Executes calculations and simulations.
Mind Map token: Stores and organizes reasoning context.

This agent-based interaction ensures that the reasoning model retrieves, refines, and structures information dynamically, rather than relying solely on pre-trained knowledge.

4. Key Components of Agentic Reasoning

Mind Map Agent

- Constructs a real-time knowledge graph by structuring logical relationships from reasoning chains.
- Uses community clustering to group reasoning contexts and generate summaries.
- Functions as an external memory tool, allowing LLMs to track arguments, clarify 
ambiguities, and retrieve past deductions.

Web-Search Agent

领英推荐

Unraveling the Enigma of VAE

360DigiTMG 1 年前

CVPR Edition: Voxel51 Filtered Views Newsletter - June…

Voxel51 9 个月前

Upcoming Books and Articles on MLTechniques.com

Vincent Granville 2 年前

- Retrieves real-time and context-aware information from the web.
- Extracts concise summaries that match reasoning tasks, such as:
- Numerical values (e.g., “What is the population of the US in 2024?”).
- Nuanced perspectives for open-ended topics.
- Evidence validation for hypothesis-driven queries.

Coding Agent

- Offloads computation-heavy tasks to a specialized coding LLM.
- Executes quantitative analysis and returns structured outputs.
- Ensures separation of reasoning and execution, improving coherence.

5. Main Findings and Insights

Minimal Tool Selection Improves Performance:
Delegating Tasks to Specialized LLMs Enhances Efficiency:
Test-Time Scaling and Verifiability:

6. Experimental Results

Evaluation on GPQA (PhD-Level Scientific Reasoning Benchmark):

Agentic Reasoning significantly outperformed state-of-the-art LLMs in physics, chemistry, and biology.
Key results:

Case Study: Medical Decision-Making

The model computed FiO2 (Fraction of Inspired Oxygen) via the coding agent.
It retrieved PEEP (Positive End-Expiratory Pressure) values via web search.
Combined insights for an optimal treatment plan, demonstrating real-world applicability.

Comparison with Human Experts:

Surpassed human experts in physics, chemistry, and biology in the GPQA Extended Set.
Higher pass rates in deep research tasks across finance, law, and medicine.

7. Future Implications

Scaling to Multimodal Data: Future work will integrate images, charts, and tabular data for complex reasoning.
Reinforcement Learning with Agentic Tools: Using tool usage as a reward signal could further optimize reasoning strategies.
Enhanced Human-AI Collaboration: Agentic frameworks could power research assistants, scientific discovery, and automated expertise synthesis.

8. Conclusion

Agentic Reasoning redefines LLM reasoning by integrating external tools dynamically. It outperforms traditional models in expert-level knowledge tasks and research-driven problem-solving by leveraging structured memory, real-time search, and computational agents.

Future improvements in multimodal reasoning, reinforcement learning, and domain-specific tools will further enhance its ability to tackle complex real-world challenges.

#AI #DataScience #data #generative ai #reinforcement learning optimization #model optimization techniques #fine tuning llms

Follow me on LinkedIn: www.dhirubhai.net/comm/mynetwork/discovery-see-all?usecase=PEOPLE_FOLLOWS&followMember=florentliu

要查看或添加评论，请登录

Florent LIU的更多文章

Comparing OpenAI’s new Response API + Agents SDK with Anthropic’s Model Context Protocol (MCP)

2025年3月19日

Comparing OpenAI’s new Response API + Agents SDK with Anthropic’s Model Context Protocol (MCP)

Below is a deep analysis comparing OpenAI’s new Response API + Agents SDK with Anthropic’s Model Context Protocol…
ReMA: Learning to Meta-Think for LLMs with Multi-Agent Reinforcement Learning

2025年3月15日

ReMA: Learning to Meta-Think for LLMs with Multi-Agent Reinforcement Learning

1. Core Concept: Meta-Thinking in LLMs Problem Statement: Current LLMs struggle with adaptive reasoning in complex…
L'audace de l'innovation : Transformer l'échec en opportunité

2025年3月12日

L'audace de l'innovation : Transformer l'échec en opportunité

Depuis toujours, la Tour Montparnasse est per?ue comme l’un des immeubles les plus laids par les Parisiens, alors que…
The critical role of mathematical frameworks in advancing AI agent

2025年3月2日

The critical role of mathematical frameworks in advancing AI agent

Below is a refined breakdown of the core mathematical and architectural contributions from the paper "G-Retriever:…
Overview of Popular AI Frameworks

2025年3月2日

Overview of Popular AI Frameworks

1. Overview of Popular AI Frameworks Popular AI frameworks such as TensorFlow, PyTorch, JAX, and Keras have…
Unlocking Enterprise Insights: How Palantir's AI Knowledge Database Transforms B2B Decision-Making

2025年2月28日

Unlocking Enterprise Insights: How Palantir's AI Knowledge Database Transforms B2B Decision-Making

Below is a detailed analysis of how Palantir delivers B2B business value through its AI Knowledge Enterprise Database…
AI Knowledge Enterprise Database

2025年2月28日

AI Knowledge Enterprise Database

An AI Knowledge Enterprise Database is a smart, AI-powered data management system designed to store, organize, and…
Azure Synapse vs. AWS: Matching Data Analytics & Warehousing Solutions

2025年2月28日

Azure Synapse vs. AWS: Matching Data Analytics & Warehousing Solutions

The similar service to Azure Synapse Analytics in AWS is Amazon Redshift combined with AWS Glue and Amazon EMR. Since…
MindMap: Knowledge Graph Prompting Graph of Thoughts in Large Language Models

2025年2月25日

MindMap: Knowledge Graph Prompting Graph of Thoughts in Large Language Models

Introduction The article introduces MindMap, a novel framework that integrates knowledge graphs (KGs) with large…
The differences between "Term", "Match Phrase", and "Query String" queries on ElasticSearch

2025年2月25日

The differences between "Term", "Match Phrase", and "Query String" queries on ElasticSearch

Elasticsearch provides different types of queries for searching text and structured data. Here’s a breakdown of the…

See all articles

Agentic Reasoning: Reasoning LLMs with Tools for Deep Research

Florent LIU

Data architect, Full Stack Data Engineer in BIG DATA, and Full Stack Developer AI.

1. Introduction

2. Core Methodology

3. The Agentic Reasoning Pipeline

4. Key Components of Agentic Reasoning

领英推荐

5. Main Findings and Insights

6. Experimental Results

7. Future Implications

8. Conclusion

Florent LIU的更多文章

社区洞察

其他会员也浏览了

OpenAI’s New o1 Model: A Leap Forward in AI Reasoning Capabilities

Essential Linear Algebra Concepts for Aspiring ML Engineers

??Top ML Papers of the Week

A Beginner’s Guide to Computer Vision: History, Techniques, and Future

??Top ML Papers of the Week (Feb 13 - Feb 19)

No Free Lunch - Computer Vision 5

LSTM for Enterprise Time Series Forecasting

Stable Diffusion: Harnessing the Power of Open Source Technologies for Advanced Image Synthesis

Why Transformers are Slowly Replacing CNNs in Computer Vision?

Linear-time sequence modeling with selective state spaces

1. Introduction

2. Core Methodology

3. The Agentic Reasoning Pipeline

4. Key Components of Agentic Reasoning

领英推荐

5. Main Findings and Insights

6. Experimental Results

7. Future Implications

8. Conclusion

Florent LIU的更多文章

Comparing OpenAI’s new Response API + Agents SDK with Anthropic’s Model Context Protocol (MCP)

ReMA: Learning to Meta-Think for LLMs with Multi-Agent Reinforcement Learning

L'audace de l'innovation : Transformer l'échec en opportunité

The critical role of mathematical frameworks in advancing AI agent

Overview of Popular AI Frameworks

Unlocking Enterprise Insights: How Palantir's AI Knowledge Database Transforms B2B Decision-Making

AI Knowledge Enterprise Database

Azure Synapse vs. AWS: Matching Data Analytics & Warehousing Solutions

MindMap: Knowledge Graph Prompting Graph of Thoughts in Large Language Models

The differences between "Term", "Match Phrase", and "Query String" queries on ElasticSearch

社区洞察

其他会员也浏览了

OpenAI’s New o1 Model: A Leap Forward in AI Reasoning Capabilities

Essential Linear Algebra Concepts for Aspiring ML Engineers

??Top ML Papers of the Week

A Beginner’s Guide to Computer Vision: History, Techniques, and Future

??Top ML Papers of the Week (Feb 13 - Feb 19)

No Free Lunch - Computer Vision 5

LSTM for Enterprise Time Series Forecasting

Stable Diffusion: Harnessing the Power of Open Source Technologies for Advanced Image Synthesis

Why Transformers are Slowly Replacing CNNs in Computer Vision?

Linear-time sequence modeling with selective state spaces