登录查看更多内容

Building a Multi-Agent Orchestrator: A Step-by-Step Guide

Zahiruddin Tavargere

Senior Principal Software Engineer@Dell | Opinions are my own

发布日期: 2024年12月6日

+ 关注

Today, we’re diving into an exciting project: creating a Multi-Agent Orchestrator.

Thanks for reading The Adaptive Engineer! Subscribe for free to receive new posts and support my work.

This post is an extension of my earlier guide, "Building an AI Agent from Scratch."

If you’re new here, I recommend revisiting that post to get up to speed, as we’ll build upon its concepts and code.

In this project, we’ll tackle orchestrating actions between multiple agents, enabling seamless execution of tasks such as fetching weather information and the current time. Let’s jump in!

If you prefer video

What Is a Multi-Agent Orchestrator?

A Multi-Agent Orchestrator is a system that:

Identifies the intent of a user’s input.
Selects the appropriate agent to handle the request.
Executes tasks using tools associated with the agent.

Think of it as a manager assigning tasks to specialized team members. This orchestration ensures complex queries involving multiple tasks are handled efficiently.

What We’ll Build

We’ll create:

Agents: Specialized entities to handle tasks like fetching weather or time.
Tools: Functional utilities for the agents, such as APIs or database queries.
Orchestrator: The central system managing task delegation and execution.

Key Components of an Agent

An agent has three main components:

Reasoning Loop: Decides the next action based on context.
Model: Uses a language model (LLM) for decision-making.
Tools: A list of utilities to perform specific tasks.

Our agents will dynamically decide which tool to use, making them highly adaptable.

The Agent Class

Here’s a high-level breakdown of the Agent class:

Constructor: Initializes the agent with a name, description, tools, and an LLM model.
Process Input: Takes user input, decides on a tool, and executes the task.
Prompting: Constructs a prompt for the LLM to guide decision-making.

Agents also handle parsing JSON responses from the LLM to ensure smooth execution.

from abc import ABC, abstractmethod
import ast
import os
import requests
from llm.llm_ops import query_llm
from tools.base_tool import Tool
import json


class Agent:
    def __init__(self, Name: str, Description: str, Tools: list, Model: str):        
        self.memory = []
        self.name = Name
        self.description = Description
        self.tools = Tools
        self.model = Model
        self.max_memory = 10

    def json_parser(self, input_string):

      print(type(input_string))

      python_dict = ast.literal_eval(input_string)
      json_string = json.dumps(python_dict)
      json_dict = json.loads(json_string)

      if isinstance(json_dict, dict) or isinstance(json_dict,list):
        return json_dict

      raise "Invalid JSON response"

    def process_input(self, user_input):
        self.memory.append(f"User: {user_input}")
        12

        context = "\n".join(self.memory)
        tool_descriptions = "\n".join([f"- {tool.name()}: {tool.description()}" for tool in self.tools])
        response_format = {"action":"", "args":""}

        prompt = f"""Context:
        {context}

        Available tools:
        {tool_descriptions}

        Based on the user's input and context, decide if you should use a tool or respond directly.        
        If you identify a action, respond with the tool name and the arguments for the tool.        
        If you decide to respond directly to the user then make the action "respond_to_user" with args as your response in the following format.

        Response Format:
        {response_format}

        """

        response = query_llm(prompt)
        self.memory.append(f"Agent: {response}")

        response_dict = self.json_parser(response)

        # Check if any tool can handle the input
        for tool in self.tools:
            if tool.name().lower() == response_dict["action"].lower():
                return tool.use(response_dict["args"])

        return response_dict

The Orchestrator

The orchestrator coordinates multiple agents:

领英推荐

Cybage: Where Excellence Meets Purpose!

Cybage Software 8 个月前

February 2024: Work Automation Index, financial use…

Workato 1 年前

Postman’s AI Agent Builder

Troy Latter 1 个月前

Accepts user input.
Selects the right agent based on the intent.
Manages task execution, including cases where multiple tasks are requested.

Core Features of the Orchestrator:

Maintains context by storing user queries, agent responses, and intermediate results.
Uses a reasoning loop to determine the next steps.
Constructs prompts to guide the LLM in selecting the right agent and tools.

import ast
import json
from llm.llm_ops import query_llm
from agents.base_agent import Agent
from logger import log_message

class AgentOrchestrator:
    def __init__(self, agents: list[Agent]):
        self.agents = agents
        self.memory = []  # Stores the reasoning and action steps taken
        self.max_memory = 10

    def json_parser(self, input_string):

      print(type(input_string))

      python_dict = ast.literal_eval(input_string)
      json_string = json.dumps(python_dict)
      json_dict = json.loads(json_string)

      if isinstance(json_dict, dict) or isinstance(json_dict,list):
        return json_dict

      raise "Invalid JSON response"

    def orchestrate_task(self, user_input: str):        
        self.memory = self.memory[-self.max_memory:]

        context = "\n".join(self.memory)

        print(f"Context: {context}")

        response_format = {"action":"", "input":"", "next_action":""}

        def get_prompt(user_input):
            return f"""

                Use the context from memory to plan next steps.                
                Context:
                {context}

                You are an expert intent classifier.
                You need will use the context provided and the user's input to classify the intent select the appropriate agent.                
                You will rewrite the input for the agent so that the agent can efficiently execute the task.                                                

                Here are the available agents and their descriptions:
                {", ".join([f"- {agent.name}: {agent.description}" for agent in self.agents])}

                User Input:
                {user_input}              

                ###Guidelines###
                - Sometimes you might have to use multiple agent's to solve user's input. You have to do that in a loop.
                - The original userinput could have multiple tasks, you will use the context to understand the previous actions taken and the next steps you should take.
                - Read the context, take your time to understand, see if there were many tasks and if you executed them all
                - If there are no actions to be taken, then make the action "respond_to_user" with your final thoughts combining all previous responses as input.
                - Respond with "respond_to_user" only when there are no agents to select from or there is no next_action
                - You will return the agent name in the form of {response_format}
                - Always return valid JSON like {response_format} and nothing else.                

                """


        response = ""
        loop_count = 0
        self.memory = self.memory[-10:]        
        prompt = get_prompt(user_input)
        llm_response = query_llm(prompt)

        llm_response = self.json_parser(llm_response)
        print(f"LLM Response: {llm_response}")

        self.memory.append(f"Orchestrator: {llm_response}")


        action=  llm_response["action"]
        user_input = llm_response["input"]

        print(f"Action identified by LLM: {action}")


        if action == "respond_to_user":
            return llm_response
        for agent in self.agents:
            if agent.name == action:
                print("*******************Found Agent Name*******************************")
                agent_response = agent.process_input(user_input)
                print(f"{action} response: {agent_response}")
                self.memory.append(f"Agent Response for Task: {agent_response}")
                print(self.memory)
                return agent_response                


    def run(self):
        print("LLM Agent: Hello! How can I assist you today?")
        user_input = input("You: ")
        self.memory.append(f"User: {user_input}")

        while True:            
            if user_input.lower() in ["exit", "bye", "close"]:
                print("See you later!")
                break

            response = self.orchestrate_task(user_input)
            print(f"Final response of orchestrator {response}")
            if isinstance(response, dict) and response["action"] == "respond_to_user":                
                log_message(f"Reponse from Agent: {response["input"]}", "RESPONSE")
                user_input = input("You: ")
                self.memory.append(f"User: {user_input}")                
            elif response == "No action or agent needed":
                print("Reponse from Agent: ", response)
                user_input = input("You: ")
            else:
                user_input = response

Tools in Action

Agents use tools to perform tasks. For example:

Weather Tool: Fetches real-time weather data from OpenWeatherMap.
Time Tool: Determines the local time for a given city, even without a timezone.

Each tool includes:

A name and description to guide the LLM.
A use method to perform the task.

import os
import requests
from tools.base_tool import Tool

class WeatherTool(Tool):
    def name(self):
        return "Weather Tool"

    def description(self):
        return "Provides weather information for a given location. The payload is just the location. Example: New York"

    def use(self, location:str):        
        api_key = os.getenv("OPENWEATHERMAP_API_KEY")
        url = f"https://api.openweathermap.org/data/2.5/weather?q={location}&appid={api_key}&units=metric"
        response = requests.get(url)
        data = response.json()
        if data["cod"] == 200:
            temp = data["main"]["temp"]
            description = data["weather"][0]["description"]
            response = f"The weather in {location} is currently {description} with a temperature of {temp}°C."
            print(response)
            return response
        else:
            return f"Sorry, I couldn't find weather information for {location}."

Demo: Running the Orchestrator

Here’s a quick demonstration:

Query: “What’s the weather in Bangalore, and what’s the current time?”
Execution:

Example Output:

"The weather in Bangalore is misty with a temperature of 22°C. The current time in Bangalore is 12:27 AM."

from agents.base_agent import Agent
from tools.weather_tool import WeatherTool
from tools.time_tool import TimeTool
from orchestrator import AgentOrchestrator

from dotenv import load_dotenv
import os

# Load environment variables from .env file
load_dotenv()

# Create Weather Agent
weather_agent = Agent(
    Name="Weather Agent",
    Description="Provides weather information for a given location",
    Tools=[WeatherTool()],
    Model="gpt-4o-mini"
)

# Create Time Agent
time_agent = Agent(
    Name="Time Agent",
    Description="Provides the current time for a given city",
    Tools=[TimeTool()],
    Model="gpt-4o-mini"
)

# Create AgentOrchestrator
agent_orchestrator = AgentOrchestrator([weather_agent, time_agent])

# Run the orchestrator
agent_orchestrator.run()

What’s Next?

This orchestrator is just the beginning. You can:

Add more agents for tasks like translation, currency conversion, or database queries.
Optimize prompts for better LLM responses.
Extend the system for real-world applications like customer support or smart assistants.

Full Code

Final Thoughts

Building a Multi-Agent Orchestrator showcases the power of combining LLMs with task-specific agents. By modularizing tasks and leveraging context effectively, you can create systems that are both scalable and intelligent.

Stay tuned for more updates, and feel free to share your thoughts or ask questions in the comments below. Don’t forget to check out the accompanying video for a detailed walkthrough of the code.

Happy coding! ??

The Adaptive Engineer

790 位关注者

Hrusikesh Panda

Data Science Consultant/Architect @ Dell Technologies || Data Scientist Leader (ML | DL | NLP | Gen AI) || Data Engineering(Spark MLlib) || Leadership (Agile Project Management || Product Management)

3 个月

Very helpful!

2 次回应

要查看或添加评论，请登录

Zahiruddin Tavargere的更多文章

Building a Multi-Agent System with OpenAI Agents SDK - Part 1

2025年3月16日

Building a Multi-Agent System with OpenAI Agents SDK - Part 1

OpenAI recently released their Agents SDK, a lightweight yet powerful framework for building multi-agent workflows…
Why I'm Going Back to Basics

2025年2月2日

Why I'm Going Back to Basics

As an engineer in the rapidly evolving field of AI, I don't just want to leverage GenAI APIs and build agents. Video…

1 条评论
How Uber Saved 140,000 Hours Monthly Using Generative AI Agents

2025年1月14日

How Uber Saved 140,000 Hours Monthly Using Generative AI Agents

Video The Problem at Hand Uber's data platform processes approximately 1.2 million interactive queries monthly, with…
A Deep Dive into Google's "Agents" White Paper

2025年1月10日

A Deep Dive into Google's "Agents" White Paper

Google's recent white paper on "Agents" has created quite a buzz. The paper explores the concept of AI agents and…

1 条评论
How the Definition of Full-Stack Development Will Evolve by 2025

2024年12月31日

How the Definition of Full-Stack Development Will Evolve by 2025

Today I want to share something I deeply believe will shape the future of software engineering. As we approach 2025…

1 条评论
Unlocking the Power of Dynamic Prompting with Jinja2

2024年12月22日

Unlocking the Power of Dynamic Prompting with Jinja2

Colab Notebook: colab.research.
How to Build a Price Monitoring Agent with Pydantic AI

2024年12月16日

How to Build a Price Monitoring Agent with Pydantic AI

Video Tutorial Keeping track of fluctuating product prices across e-commerce platforms can be a daunting task. Whether…

1 条评论
Is This the Most Robust Agentic Intent Classifier Yet?

2024年11月26日

Is This the Most Robust Agentic Intent Classifier Yet?

This week, I showcase the Multi-Agent Orchestrator by AWS, a tool designed to streamline the development of intelligent…
AWS Just Released a New Multi-Agent AI Framework

2024年11月18日

AWS Just Released a New Multi-Agent AI Framework

Video I Posted This Week AWS Multi-Agent Orchestrator Amazon’s Multi-Agent Orchestrator is a framework designed to…
Understanding Email-Driven Task Automation with AI Agents - Solution Overview

2024年11月11日

Understanding Email-Driven Task Automation with AI Agents - Solution Overview

Today, we'll take a look into a practical enterprise automation solution that bridges the gap between email…

See all articles

Building a Multi-Agent Orchestrator: A Step-by-Step Guide

Zahiruddin Tavargere

Senior Principal Software Engineer@Dell | Opinions are my own

If you prefer video

What Is a Multi-Agent Orchestrator?

What We’ll Build

Key Components of an Agent

The Agent Class

The Orchestrator

领英推荐

Tools in Action

Demo: Running the Orchestrator

What’s Next?

Full Code

Final Thoughts

The Adaptive Engineer

790 位关注者

Zahiruddin Tavargere的更多文章

社区洞察

其他会员也浏览了

How AI Empowers Engineers to Innovate: A Step-by-Step Guide to Building Email Classification and Automation System.

Upgrade Your AI Art with Midjourney V5.1

How To Add Tool Support to AI Agents for Performing Actions

Capturing Field Data and Automating Workflows With No-Code Tools

Five Key Take-Aways from Imagine 2024: Driving business productivity and breaking down operational silos

UDS over CAN Tools - Exploring Top Companies' Dominance and Low-Cost Alternatives

Getting Started with FlowiseAI and Creating Automated RAG Systems

Top 8 API Management Trends in 2025

The Evolving Role of Business Analysts in the Age of AI and Automation: How the Role is Changing and What Future BAs Need to Know

Choosing a tool for logging, tracing & evals - how to?

If you prefer video

What Is a Multi-Agent Orchestrator?

What We’ll Build

Key Components of an Agent

The Agent Class

The Orchestrator

领英推荐

Tools in Action

Demo: Running the Orchestrator

What’s Next?

Full Code

Final Thoughts

The Adaptive Engineer

790 位关注者

Zahiruddin Tavargere的更多文章

Building a Multi-Agent System with OpenAI Agents SDK - Part 1

Why I'm Going Back to Basics

How Uber Saved 140,000 Hours Monthly Using Generative AI Agents

A Deep Dive into Google's "Agents" White Paper

How the Definition of Full-Stack Development Will Evolve by 2025

Unlocking the Power of Dynamic Prompting with Jinja2

How to Build a Price Monitoring Agent with Pydantic AI

Is This the Most Robust Agentic Intent Classifier Yet?

AWS Just Released a New Multi-Agent AI Framework

Understanding Email-Driven Task Automation with AI Agents - Solution Overview

社区洞察

其他会员也浏览了

How AI Empowers Engineers to Innovate: A Step-by-Step Guide to Building Email Classification and Automation System.

Upgrade Your AI Art with Midjourney V5.1

How To Add Tool Support to AI Agents for Performing Actions

Capturing Field Data and Automating Workflows With No-Code Tools

Five Key Take-Aways from Imagine 2024: Driving business productivity and breaking down operational silos

UDS over CAN Tools - Exploring Top Companies' Dominance and Low-Cost Alternatives

Getting Started with FlowiseAI and Creating Automated RAG Systems

Top 8 API Management Trends in 2025

The Evolving Role of Business Analysts in the Age of AI and Automation: How the Role is Changing and What Future BAs Need to Know

Choosing a tool for logging, tracing & evals - how to?