登录查看更多内容

OpenAI Introduces Swarm, a Framework for Building Multi-Agent Systems

Aditi Khare

AWS & AI Research Specialist-Principal Machine Learning Scientist & AI Architect | IIM-A | Author | AI Research [Portfolio] Build Production-Grade AI Products from Scratch | Vision Transformers??Open-Source Contributor

发布日期: 2024年10月12日

+ 关注

#openai #ai #airesearch #airesearchpapers #researchskills

For more information on AI Research Papers you can visit my Github Profile -

https://github.com/aditikhare007/AI_Research_Junction_Aditi_Khare

For Receving latest updates on Latest Advancements in AI Research Papers Summaries @Generative AI @Quantum AI @GPUs Optimzation @Deep Learning @Vision you can subscribe to my AI Research Papers Summaries Newsletter using below link -

https://www.dhirubhai.net/newsletters/7152631955203739649/

Thank you & Happy Reading !!

Swarm vs Assistant API - Why Swarm

Swarm is lightweight, scalable, and highly customizable by design. It is best suited for situations dealing with a large number of independent capabilities and instructions that are difficult to encode into a single prompt.

The Assistants API is a great option for developers looking for fully-hosted threads and built in memory management and retrieval. where as Swarm is optimal for developers who want full transparency and fine-grained control over context, steps, and tool calls. Swarm runs (almost) entirely on the client and, much like the Chat Completions API, does not store state between calls.

Examples

basic: Simple examples of fundamentals like setup, function calling, handoffs, and context variables
triage_agent: Simple example of setting up a basic triage step to hand off to the right agent
weather_agent: Simple example of function calling
airline: A multi-agent setup for handling different customer service requests in an airline context.
support_bot: A customer service bot which includes a user interface agent and a help center agent with several tools
personal_shopper: A personal shopping agent that can help with making sales and refunding orders.

Running Swarm -

Start by instantiating a Swarm client (which internally just instantiates an OpenAI client).

from swarm import Swarm

client = Swarm()

client.run()

Swarm's run() function is analogous to the chat.completions.create() function in the Chat Completions API – it takes messages and returns messages and saves no state between calls. Importantly, however, it also handles Agent function execution, hand-offs, context variable references, and can take multiple turns before returning to the user.

At its core, Swarm's client.run() implements the following loop:

Get a completion from the current Agent
Execute tool calls and append results
Switch Agent if necessary
Update context variables, if necessary
If no new function calls, return

Agents

An Agent simply encapsulates a set of instructions with a set of functions (plus some additional settings below), and has the capability to hand off execution to another Agent.

While it's tempting to personify an Agent as "someone who does X", it can also be used to represent a very specific workflow or step defined by a set of instructions and functions (e.g. a set of steps, a complex retrieval, single step of data transformation, etc). This allows Agents to be composed into a network of "agents", "workflows", and "tasks", all represented by the same primitive.

Functions

Swarm Agents can call python functions directly.
Function should usually return a str (values will be attempted to be cast as a str).
If a function returns an Agent, execution will be transfered to that Agent.
If a function defines a context_variables parameter, it will be populated by the context_variables passed into client.run().

领英推荐

The Weekend @ ...

Generative AI 1 年前

OpenAI's Latest AI Model Can Perform Some Human-Like…

Bloomberg News 2 个月前

This AI newsletter is all you need #95

Towards AI 7 个月前

Function Schemas

Swarm automatically converts functions into a JSON Schema that is passed into Chat Completions tools.

Docstrings are turned into the function description.
Parameters without default values are set to required.
Type hints are mapped to the parameter's type (and default to string).
Per-parameter descriptions are not explicitly supported, but should work similarly if just added in the docstring. (In the future docstring argument parsing may be added.)

def greet(name, age: int, location: str = "New York"):
   """Greets the user. Make sure to get their name and age before calling.

   Args:
      name: Name of the user.
      age: Age of the user.
      location: Best place on earth.
   """
   print(f"Hello {name}, glad you are {age} in {location}!")

{
   "type": "function",
   "function": {
      "name": "greet",
      "description": "Greets the user. Make sure to get their name and age before calling.\n\nArgs:\n   name: Name of the user.\n   age: Age of the user.\n   location: Best place on earth.",
      "parameters": {
         "type": "object",
         "properties": {
            "name": {"type": "string"},
            "age": {"type": "integer"},
            "location": {"type": "string"}
         },
         "required": ["name", "age"]
      }
   }
}

Streaming

stream = client.run(agent, messages, stream=True)
for chunk in stream:
   print(chunk)

Uses the same events as Chat Completions API streaming . See process_and_print_streaming_response in /swarm/repl/repl.py as an example.

Two new event types have been added:

{"delim":"start"} and {"delim":"start"}, to signal each time an Agent handles a single message (response or function call). This helps identify switches between Agents.
{"response": Response} will return a Response object at the end of a stream with the aggregated (complete) response, for convenience.

Evaluations

Evaluations are crucial to any project, and we encourage developers to bring their own eval suites to test the performance of their swarms. For reference, we have some examples for how to eval swarm in the airline, weather_agent and triage_agent quickstart examples. See the READMEs for more details.

Utils

Use the run_demo_loop to test out your swarm! This will run a REPL on your command line. Supports streaming.

from swarm.repl import run_demo_loop
...
run_demo_loop(agent, stream=True)

Summary -

Swarm is a Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.

References -

Reference Reading Link -

https://github.com/openai/swarm

OpenAI Introduces Swarm, a Framework for Building Multi-Agent Systems

Aditi Khare

AWS & AI Research Specialist-Principal Machine Learning Scientist & AI Architect | IIM-A | Author | AI Research [Portfolio] Build Production-Grade AI Products from Scratch | Vision Transformers??Open-Source Contributor

Swarm vs Assistant API - Why Swarm

Examples

Running Swarm -

client.run()

Agents

Functions

领英推荐

Function Schemas

Streaming

Evaluations

Utils

Swarm is a Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.

AI Research Junction

1,564 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

Exploring the Future of AI with DBRX: What You Need to Know

This AI newsletter is all you need #36

LLMs for Simulated User Feedback, Causal AI, AI Slide Decks from ODSC East, and Low Code Time Series Analysis

Artificial Intelligence #230

Artificial Intelligence #230

The rise of AI agents

AI/ML news summary: week 33

What’s the future of IT Services? With Vadim Peskov. CEO of Diffco.

Gemini 1.5 Pro: Google's Latest Leap Forward in AI Technology!

Trip Advisor reports 3X revenue on ai users

Swarm vs Assistant API - Why Swarm

Examples

Running Swarm -

client.run()

Agents

Functions

领英推荐

Function Schemas

Streaming

Evaluations

Utils

Swarm is a Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.

AI Research Junction

1,564 位关注者

OpenAI's AI Powered Search Engine Into ChatGPT

2024年11月1日

Introducing Anthropic's Claude 3.5 Sonnet, and Claude 3.5 Haiku

2024年10月23日

Architecture Search Framework for Inference-Time Techniques & Designing Priors for Better Few-Shot Image Synthesis

2024年10月7日

Meta's Llama 3.2 - Edge AI & Vision with Open, Customizable Models

2024年9月28日

Agents in Software Engineering-Survey, Landscape, and Vision & Qwen2.5-Coder

2024年9月24日

Anthropic Introduces Contextual Retrieval Using Prompt Caching & Contextual Embeddings & Reranking Techniques

2024年9月23日

Google's Training Language Models to Self-Correct via Reinforcement Learning & Iteration of Thought - Autonomous Large Language Model Reasoning

2024年9月22日

Learning to Reason with LLMs - Introducing OpenAI o1

2024年9月14日

LongCite - Enabling LLMs to Generate Fine-grained Citations in Long-context QA

2024年9月10日

Role of RAG Noise in Large Language Models & Strategic Chain-of-Thought

2024年9月9日

社区洞察

其他会员也浏览了

Exploring the Future of AI with DBRX: What You Need to Know

This AI newsletter is all you need #36

LLMs for Simulated User Feedback, Causal AI, AI Slide Decks from ODSC East, and Low Code Time Series Analysis

Artificial Intelligence #230

Artificial Intelligence #230

The rise of AI agents

AI/ML news summary: week 33

What’s the future of IT Services? With Vadim Peskov. CEO of Diffco.

Gemini 1.5 Pro: Google's Latest Leap Forward in AI Technology!

Trip Advisor reports 3X revenue on ai users