OpenAI Dev Digest: New tools for building agents

OpenAI Dev Digest: New tools for building agents

Today, we released our first set of tools to help you accelerate building agents. These building blocks will help you design and scale the complex orchestration logic required to build agents and enable agents to interact with tools to make them truly useful.


Introducing the Responses API

The Responses API is a new API primitive that combines the best of both the Chat Completions and Assistants APIs. It’s simpler to use, and includes built-in tools provided by OpenAI that execute tool calls and add results automatically to the conversation context. As model capabilities continue to evolve, we believe the Responses API will provide a more flexible foundation for developers building agentic applications.

New tools to help you build useful agents

Web Search delivers accurate and clearly-cited answers from the web. Using the same tool as search in ChatGPT, it’s great at conversation and follow-up questions—and you can integrate it with just a few lines of code. Web Search is available in the Responses API as a tool for the gpt-4o and gpt-4o-mini models, and can be paired with other tools. In the Chat Completions API, web search is available as a separate model, called gpt-4o-search-preview and gpt-4o-mini-search-preview.

File Search is an easy-to-use retrieval tool that delivers fast, accurate search results with a few lines of code. It supports multiple file types, reranking, attribute filtering, and query rewriting. File Search is available in the Responses API, plus continues to be available via the Assistants API.

Computer Use is the fastest way to build computer-using agents with CUA, the same model that powers Operator in ChatGPT. You can use this tool to control computers or virtual machines that you operate. You simply pass the tool screenshots, and the tool responds with an action you should take like click, scroll, or type. The model is available for select developers in a research preview in the Responses API.

Agents SDK is an orchestration framework that abstracts the complexity involved in designing and scaling agents. It includes built-in observability tooling that allows developers to log, visualize, and analyze agent performance to identify issues and areas of improvement. Inspired by Swarm, the Agents SDK is also open source and supports both other? model and tracing providers.

Andrey Khomyakov

Experienced leader in digital transformation, private and public cloud migrations, a fractional CTO, IT strategist, operator, enterprise and data center architect.

1 周

Exciting updates! Looking forward to seeing how this evolves. Let's connect and share insights!

回复
Narayana Challa

Director of IT, Enterprise Applications | Mentor @ADPList | IEEE Senior Member | Integration Expert | Visionary Tech Leader | Technology Advisor

1 周

Very informative

回复

Exciting updates, Edwin! These new tools, especially the Responses API and Agents SDK, look like a significant step forward in simplifying and scaling agent development.

回复

要查看或添加评论,请登录

Edwin Wee Arbus的更多文章

  • Recap of what's new in the OpenAI API

    Recap of what's new in the OpenAI API

    Happy New Year! We ended last year with DevDays in SF, London, Singapore, and a holiday edition online. You can watch…

    1 条评论
  • Seven years at Stripe

    Seven years at Stripe

    Seven years ago today, I walked through the front door of Stripe. Patrick Collison held the door open for me—literally…

    15 条评论

社区洞察