登录查看更多内容

Advanced Agentic Reasoning with Structure & Optimisation

Kai Xin Thia

Head of AI & Analytics, Group Tech Office, ST Engineering

发布日期: 2025年2月13日

LLMs are transforming beyond simple text generation to complex problem-solving and expert-level reasoning. This shift is driven by innovations such as Agentic Reasoning, which equips LLMs with external tools like web search and code execution, and by sophisticated Topology DSPy for multi-agent systems, which optimize collaboration through automated prompt and topology design.

These advancements are not merely incremental improvements; they represent a fundamental change in LLMs' operations. They enable them to handle intricate tasks requiring in-depth research, logical deduction, and real-time data analysis,?sometimes doing an even better job than human experts.

The ability to reason effectively is crucial because it underpins intelligent decision-making, allowing AI to move from pattern recognition to insightful analysis, unlocking new possibilities in areas from science and medicine to finance and beyond. By enhancing reasoning capabilities, we are making AI systems more reliable, adaptable, and capable of solving the world's most complex problems.

Special thanks to Michal Polanowski, MBA, PhD , @Srikrishna iyer, Ouyang Ruofei for assisting with the research.

AI Podcast Discussion

This week's podcast provides an excellent summary, especially for the challenging technical details and their significance.

Podcast (12.5min)

Why Does This Technology Matter?

Enhanced Problem-Solving: Traditional LLMs often struggle with complex tasks requiring multi-step reasoning, in-depth research, or real-time data analysis. These new approaches equip LLMs with the ability to use external tools and structure their reasoning processes, leading to more accurate and comprehensive solutions.
Expert-Level Performance: By integrating external tools and optimizing multi-agent collaboration, LLMs can now achieve performance levels rival human experts in various domains, from science and medicine to finance and law.
Automation of Complex Tasks: These technologies enable the automation of complex, labor-intensive tasks, such as in-depth research, data analysis, and strategic planning, freeing up valuable human resources.
Scalability and Efficiency: Optimised multi-agent systems and agentic frameworks lead to more efficient use of computational resources and enable more scalable solutions.
Competitive Advantage: Implementing these advanced LLM technologies will provide a significant competitive advantage by allowing us to innovate faster, make more informed decisions, and deliver superior products and services.

Deep Dive: Agentic Reasoning

Oxford's "Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research" framework introduces a novel approach to enhancing LLM reasoning by integrating external LLM-based agents as tools. Key components include:

External Tool Integration: Instead of relying solely on their internal knowledge, LLMs can dynamically interact with external tools, including web search engines and code execution environments.

Web-search Agent: Retrieves relevant information from the internet, supplementing the model's knowledge and providing real-time data.
Code Agent: Performs computational analysis and coding tasks, enabling quantitative reasoning and complex data manipulation.

领英推荐

Confronting Bias in AI-Driven CX

ibex 9 个月前

Rapid AI Insights: Edition 34

RapidCanvas 7 个月前

How to overcome context window limit of LLMs?

InteligenAI 1 个月前

Mind Map Agent:?This agent constructs a structured knowledge graph to track logical relationships, improving deductive reasoning and helping the model organize its reasoning process. It clusters reasoning context, provides concise summaries, and allows the model to query previous reasoning steps.

Mind Maps also help to clarify complex logical relationships, enabling the model to solve tricky logic-based questions and enhance deductive reasoning in strategic games.
Dynamic and Iterative Reasoning: The model can proactively decide when additional information is required, embedding specialized tokens to call external agents. This allows for an iterative retrieval and reasoning cycle until a thoroughly reasoned answer is reached.
Less is More: The research found that using just a few well-chosen tools is more effective than many, which can degrade performance.
Delegation of Tasks: The framework delegates specific tasks to specialized LLM-based agents, ensuring that auxiliary tasks do not disrupt the primary reasoning model, allowing for longer and more coherent reasoning chains. This also leverages the strengths of different LLMs.
Performance: Agentic Reasoning has demonstrated superior performance on expert-level questions, achieving impressive accuracy rates on the GPQA dataset (58% in chemistry, 88% in physics, and 79% in biology). It has also outperformed human experts in deep research tasks.
Test-Time Scaling: The frequency of tool use can be leveraged as a test-time reasoning verifier to filter out weaker outputs.

Deep Dive: Optimised Multi-Agent Systems

The "Topology DSPy: Prompting the Swarm (Multi-Agents)" by Discover AI on YouTube describes a new approach to multi-agent systems using a three-step optimization process:

Block Level Prompt Optimisation: Ensures that each single agent is optimally configured, including optimizing instructions and examples. This is the most dominant factor in the system.
Topology Optimisation: Focuses on the structural arrangement of agents and the workflow between them. Different topologies, like parallel aggregations, hierarchical reflection, or debate structures, can be selected for the optimal interaction between agents.
Workflow Level Prompt Optimisation: The final step is to optimize prompts for each agent within the best-found topology, considering the dependence of prompts within the system.

Automated Design: The system automatically designs and optimizes multi-agent configurations, removing the need to craft topologies manually. The system uses a mathematical optimization process to search for possible configurations and determine the best one to solve the problem.

Agent Types: Different agent types can be incorporated into the system, including predictor, reflector, summarizer, and debater. Each agent has a specific function in the process.
Baseline Importance: A baseline predictor agent is used to evaluate the impact of more complex configurations and to measure how much each added agent improves performance.
Prompt Templates: Specific prompt templates have been developed for each agent type, using "let's think step by step" as a base prompt. These templates can be further optimized using tools like DSPy.
Performance: This approach has shown significant performance gains, particularly in mathematical reasoning and coding tasks.

Key Learnings

Importance of External Tools: Both sources emphasize the critical role of external tools in enhancing LLM reasoning. Agentic reasoning uses tools like web search and code execution, while multi-agent systems can incorporate tasks-specific tools.
Structured Reasoning: The Mind Map in Agentic Reasoning and the topology optimization in multi-agent systems demonstrate the importance of structured reasoning processes.
Automated Optimisation: Both approaches highlight the value of automated processes for optimizing LLM performance, whether for prompt optimization, agent configuration, or tool usage.
Modular Design: Breaking down complex problems into subtasks and delegating them to specialized agents is a common theme.
Iterative Refinement: Both approaches employ iterative processes, whether the retrieval-and-reasoning cycle of Agentic Reasoning or the multi-step optimization of the multi-agent system.
Sensitivity to Task: The performance of both systems is sensitive to the specific task, necessitating careful selection of tools and configurations.

Conclusion

As demonstrated in the Agentic Reasoning framework and the optimized multi-agent system, these emerging technologies in LLM reasoning hold immense potential. By embracing these advancements, we can significantly enhance our problem-solving capabilities, automate complex processes, and gain a competitive edge in the market.

Sources

Koo Ping Shung

2 周

Have always been wondering if and how topology can figure in building smarter machines. Will check out the paper and determine its suitability. :)

1 次回应

要查看或添加评论，请登录

Kai Xin Thia的更多文章

Small but Mighty: SLMs are Democratising AI

2025年2月27日

Small but Mighty: SLMs are Democratising AI

This week, we explore the surge in the development of small language models (SLMs) that address the growing need for…

5 条评论
DeekSeek AI Agents for Knowledge Graph Augmentation & Query

2025年2月20日

DeekSeek AI Agents for Knowledge Graph Augmentation & Query

This week, let's explore how advancements in AI-driven knowledge management pave the way for more efficient and…
Practical Humanoid Robots - Agile, Affordable, Teleoperated

2025年2月5日

Practical Humanoid Robots - Agile, Affordable, Teleoperated

This week, let's take a deeper look into Humanoid robotics, which is experiencing a rapid transformation, making…
DeepSeek – A Deep Dive into Efficiency and Innovation

2025年1月27日

DeepSeek – A Deep Dive into Efficiency and Innovation

This week, we will explore DeepSeek, a Chinese AI lab that has rapidly gained recognition for its innovative LLM…

14 条评论
Applied AI: LLMs for Enhanced Emergency Response

2025年1月25日

Applied AI: LLMs for Enhanced Emergency Response

This week, we explore several innovative approaches to leveraging LLMs and other AI techniques to enhance emergency…

1 条评论
Physical AI and the Convergence of Embodied & Living Intelligence

2025年1月17日

Physical AI and the Convergence of Embodied & Living Intelligence

The rapidly developing field of Artificial Intelligence is no longer confined to the digital realm of text and images…
Future of Humanoid Robotics

2025年1月9日

Future of Humanoid Robotics

The world of humanoid robotics is on the cusp of a significant leap forward, driven by the convergence of sophisticated…

1 条评论
A Deep Dive into Generative World Models

2025年1月2日

A Deep Dive into Generative World Models

This week, we explore the surge of innovation in AI world models that enables the creation of interactive and…

1 条评论
Building and Deploying Robust AI Systems

2024年12月24日

Building and Deploying Robust AI Systems

This week, let's examine how we can develop AI systems that are robust, reliable, and adaptable for real-world…

1 条评论
AI Gone Rogue: The Hidden Threat of Scheming Agentic AI

2024年12月19日

AI Gone Rogue: The Hidden Threat of Scheming Agentic AI

This week, we look into recent research revealing surprising capabilities in advanced LLMs, showcasing their potential…

3 条评论

See all articles

Advanced Agentic Reasoning with Structure & Optimisation

Kai Xin Thia

Head of AI & Analytics, Group Tech Office, ST Engineering

AI Podcast Discussion

Why Does This Technology Matter?

Deep Dive: Agentic Reasoning

领英推荐

Deep Dive: Optimised Multi-Agent Systems

Key Learnings

Conclusion

Sources

Kai Xin Thia的更多文章

其他会员也浏览了

Reviewing the AI Opportunities Action Plan

A Practical Guide to Identifying ‘AI Systems’ for the EU AI Act

DeepSeek, GenAI Disruption, and Why Vertical AI (Like snapland) Is Set to Win

Exploring the Capabilities of R10 Summarizer

Marvelous MLOps issue 13: ML Monitoring vs. ML Observability - understanding the differences

Unlocking the Full Potential of AI with Quality Data

Striking The Balance Between AI and HX

Navigating the Uncharted: AI's Impact on Engineering and the Services Industry

Precision is Power: Shakti’s Blueprint for AI Excellence

Multipurpose use of AI forecasting

AI Podcast Discussion

Why Does This Technology Matter?

Deep Dive: Agentic Reasoning

领英推荐

Deep Dive: Optimised Multi-Agent Systems

Key Learnings

Conclusion

Sources

Kai Xin Thia的更多文章

Small but Mighty: SLMs are Democratising AI

DeekSeek AI Agents for Knowledge Graph Augmentation & Query

Practical Humanoid Robots - Agile, Affordable, Teleoperated

DeepSeek – A Deep Dive into Efficiency and Innovation

Applied AI: LLMs for Enhanced Emergency Response

Physical AI and the Convergence of Embodied & Living Intelligence

Future of Humanoid Robotics

A Deep Dive into Generative World Models

Building and Deploying Robust AI Systems

AI Gone Rogue: The Hidden Threat of Scheming Agentic AI

其他会员也浏览了

Reviewing the AI Opportunities Action Plan

A Practical Guide to Identifying ‘AI Systems’ for the EU AI Act

DeepSeek, GenAI Disruption, and Why Vertical AI (Like snapland) Is Set to Win

Exploring the Capabilities of R10 Summarizer

Marvelous MLOps issue 13: ML Monitoring vs. ML Observability - understanding the differences

Unlocking the Full Potential of AI with Quality Data

Striking The Balance Between AI and HX

Navigating the Uncharted: AI's Impact on Engineering and the Services Industry

Precision is Power: Shakti’s Blueprint for AI Excellence

Multipurpose use of AI forecasting