Multi-Agent Week Recap - Microsoft Copilot Agents, Salesforce Agentforce, WindowsArena, Paper2QA
Victor Dibia, PhD
Principal RDSE at Microsoft Research (Generative AI, Agents) | Carnegie Mellon Alumnus
The last week (Sept 8 - 16) has been eventful wrt to agents and multi-agent systems with a few interesting announcements.
Learn more at multiagentbook.com/news.
Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents
微软 has launched the next wave of Microsoft 365 Copilot, introducing significant updates to enhance AI-powered productivity.
Key features:
These enhancements are based on feedback from nearly 1,000 customers, resulting in over 700 product updates and 150 new features. The release blog post is chuck full of details including speed and customer satisfaction updates
Copilot responses are more than two times faster on average, and response satisfaction has improved by nearly three times.
IMO, this update represents a significant step towards integrating AI agents into everyday productivity tools, potentially transforming how we collaborate and work with AI assistance.
Salesforce Agentforce
Salesforce is introducing Agentforce, ushering in what they call the "third wave" of the AI revolution. Agentforce represents a shift towards autonomous AI agents capable of reasoning and tackling multi-faceted projects with minimal human oversight.
Key points:
IMO, efforts like this represent careful rollouts of pipelines where LLMs drive some controlled actions. We will need advances in model performance, architecture, lots of engineering/experimentation to achieve value from more autonomous deployments.
Windows Agent Arena
Some of my colleagues 微软 recently released the Windows Agent Arena, an open-source benchmark for developing and testing AI agents on Windows operating systems.
领英推荐
Highlights:
IMO, this project provides critical infrastructure for benchmarking interface agents - agents that address tasks by driving UI interfaces designed for humans.
PaperQA2
PaperQA2 is an advanced AI agent designed for conducting comprehensive scientific literature reviews autonomously, reportedly outperforming PhD and postdoc-level researchers in biology.
Key features:
These developments showcase the rapid advancement of AI agents across different domains, from business operations to scientific research, indicating a trend towards more autonomous and capable AI systems in various fields.
OpenAI o1: A New Era of AI Reasoning
And ofcourse - OpenAI introduced a new series of AI models called OpenAI o1, designed to excel at complex reasoning tasks. The first model in this series, o1-preview, is now available in ChatGPT and via API access.
Key features of OpenAI o1:
IMO, Models like this could have significant implications for multi-agent systems - better planning, better reasoning, less dumb mistakes.
Learn more at multiagentbook.com/news.
Knowledge Engineer | Al Engineer | Ontologist | Semantic Architect | Knowledge Graph Engineer | Information Architect | Python AI
6 个月Thanks for this. I find them useful. Keep it up. BTW I am passionate about agentic frameworks and task-oriented architecture.
Principal RDSE at Microsoft Research (Generative AI, Agents) | Carnegie Mellon Alumnus
6 个月Update: Link to Microsoft Copilot Agents announcement - https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/