登录查看更多内容

Multi-Agent Week Recap - Microsoft Copilot Agents, Salesforce Agentforce, WindowsArena, Paper2QA

Victor Dibia, PhD

Principal RDSE at Microsoft Research (Generative AI, Agents) | Carnegie Mellon Alumnus

发布日期: 2024年9月16日

The last week (Sept 8 - 16) has been eventful wrt to agents and multi-agent systems with a few interesting announcements.

Learn more at multiagentbook.com/news.

Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents

微软 has launched the next wave of Microsoft 365 Copilot, introducing significant updates to enhance AI-powered productivity.

Unveiling Copilot agents built with Microsoft Copilot Studio to supercharge your business.

Key features:

Copilot Pages: A dynamic, persistent canvas for multiplayer AI collaboration, designed as the first new digital artifact for the AI age.
Major improvements to Copilot in Microsoft 365 apps: Advanced data analysis in Excel with Python support Dynamic storytelling in PowerPoint Enhanced meeting capabilities in Teams
Introduction of Copilot agents built in Copilot Studio: Automate and execute business processes, enabling teams to scale their efforts.
New agent builder: Allows creating custom Copilot agents right in BizChat or SharePoint.

These enhancements are based on feedback from nearly 1,000 customers, resulting in over 700 product updates and 150 new features. The release blog post is chuck full of details including speed and customer satisfaction updates

Copilot responses are more than two times faster on average, and response satisfaction has improved by nearly three times.

IMO, this update represents a significant step towards integrating AI agents into everyday productivity tools, potentially transforming how we collaborate and work with AI assistance.

Salesforce Agentforce

Salesforce is introducing Agentforce, ushering in what they call the "third wave" of the AI revolution. Agentforce represents a shift towards autonomous AI agents capable of reasoning and tackling multi-faceted projects with minimal human oversight.

Key points:

Built on the Salesforce Platform, leveraging existing customization capabilities
Includes ready-to-deploy agents like Sales Development Rep (SDR) and Service Agent
Utilizes the Einstein Trust Layer for secure and responsible AI deployment
Aims to free up human workers for more strategic, relationship-building tasks
Customizable through low-code Agent Builder, allowing organizations to tailor agents to specific needs

IMO, efforts like this represent careful rollouts of pipelines where LLMs drive some controlled actions. We will need advances in model performance, architecture, lots of engineering/experimentation to achieve value from more autonomous deployments.

Windows Agent Arena

Some of my colleagues 微软 recently released the Windows Agent Arena, an open-source benchmark for developing and testing AI agents on Windows operating systems.

领英推荐

Copilot for work(flows)—Agents, actions, and…

Microsoft 365 3 个月前

Enhancing Productivity with Copilot Extensibility

Amber Bahl 10 个月前

SLG Business Applications Updates for April 2024

John P. 12 个月前

Highlights:

Provides a scalable framework for evaluating AI agents that can reason, plan, and act on a PC
Includes over 150 agent tasks across various applications and domains
Allows for parallelized evaluation in Azure, significantly speeding up testing
Uses Omniparser to process screenshots and GPT-4V for decision-making
Current best agent solves 19.5% of tasks, compared to 74.5% human performance

IMO, this project provides critical infrastructure for benchmarking interface agents - agents that address tasks by driving UI interfaces designed for humans.

PaperQA2

PaperQA2 is an advanced AI agent designed for conducting comprehensive scientific literature reviews autonomously, reportedly outperforming PhD and postdoc-level researchers in biology.

Key features:

Capable of finding, summarizing, and synthesizing relevant scientific literature
Refines search parameters based on initial findings
Provides cited, factually grounded answers
Achieves state-of-the-art performance on LitQA2, part of the LAB-Bench benchmark
Open-sourced code available for further research and development
Represents a significant step towards AI-assisted scientific research

These developments showcase the rapid advancement of AI agents across different domains, from business operations to scientific research, indicating a trend towards more autonomous and capable AI systems in various fields.

OpenAI o1: A New Era of AI Reasoning

And ofcourse - OpenAI introduced a new series of AI models called OpenAI o1, designed to excel at complex reasoning tasks. The first model in this series, o1-preview, is now available in ChatGPT and via API access.

Key features of OpenAI o1:

Enhanced reasoning: The models are trained to spend more time thinking through problems, refining their thought processes, and recognizing mistakes.
Improved performance: In tests, the upcoming model update performs similarly to PhD students on challenging tasks in physics, chemistry, and biology. It also shows exceptional abilities in math and coding.
Safety focus: A new safety training approach leverages the model's reasoning capabilities to better adhere to safety and alignment guidelines.
Collaboration with safety institutes: OpenAI has formalized agreements with U.S. and U.K. AI Safety Institutes, granting them early access for evaluation and testing.
Specialized versions: Along with o1-preview, OpenAI is releasing o1-mini, a faster and cheaper model optimized for coding tasks.

IMO, Models like this could have significant implications for multi-agent systems - better planning, better reasoning, less dumb mistakes.

Learn more at multiagentbook.com/news.

Designing with ML/AI

3,734 位关注者

Emeka Okoye

6 个月

Thanks for this. I find them useful. Keep it up. BTW I am passionate about agentic frameworks and task-oriented architecture.

Victor Dibia, PhD

Principal RDSE at Microsoft Research (Generative AI, Agents) | Carnegie Mellon Alumnus

6 个月

Update: Link to Microsoft Copilot Agents announcement - https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/

1 次回应

查看更多评论

要查看或添加评论，请登录

Victor Dibia, PhD的更多文章

AutoGen Studio v0.4.1 Release Notes: Declarative Configuration, Team Testing, and Enhanced Agent Gallery

2025年2月11日

AutoGen Studio v0.4.1 Release Notes: Declarative Configuration, Team Testing, and Enhanced Agent Gallery

We just released v0.4.

10 条评论
New AutoGen Release - v0.4.4: Serializable Agent Configuration, Support for Azure Hosted Models and More

2025年1月29日

New AutoGen Release - v0.4.4: Serializable Agent Configuration, Support for Azure Hosted Models and More

We just released v0.4.

4 条评论
AI Agents 2024 Rewind - A Year of Building and Learning

2025年1月8日

AI Agents 2024 Rewind - A Year of Building and Learning

2024 was quite an eventful year for generative AI and agents! I spent sometime curating the most interesting updates I…

4 条评论
Multi-Agent Week Recap - π0, OmniParser, ARIA, Anthropic Computer Use, OpenAI Swarm ..

2024年11月1日

Multi-Agent Week Recap - π0, OmniParser, ARIA, Anthropic Computer Use, OpenAI Swarm ..

Another light-weight recap of some of the rather interesting things that have happened in the multi-agent space in the…

3 条评论
Using LLMs as Context-Aware Text Embedding Models - NV-Embed Paper Review

2024年10月9日

Using LLMs as Context-Aware Text Embedding Models - NV-Embed Paper Review

Can you harness the immense language understanding capabilities of generative models (e.g.
Multi-Agent Week Recap - MolMo Model, Letta, Core Bench.

2024年10月1日

Multi-Agent Week Recap - MolMo Model, Letta, Core Bench.

A recap of some announcements/papers in the last week (Sept 16 - 24) that I found interesting. Past news items (and…
How will AI Impact Software Engineering?

2024年8月22日

How will AI Impact Software Engineering?

How will software engineering change in the age of strong AI models that can write code and what should individual…

15 条评论
Announcing A New Book - Multi-Agent Systems with AutoGen!

2024年7月19日

Announcing A New Book - Multi-Agent Systems with AutoGen!

I'm excited to announce a significant milestone in a project I've been working on: I'm writing a book titled…

89 条评论
Introducing Anomagram?-?An Interactive Visualization of Autoencoders Applied to the Task of Anomaly Detection.

2020年1月8日

Introducing Anomagram?-?An Interactive Visualization of Autoencoders Applied to the Task of Anomaly Detection.

Across many business use cases that generate data, it is frequently desirable to automatically identify data samples…

3 条评论
All I learned about Social Science Research Writing, I learned from Blogging.

2014年12月28日

All I learned about Social Science Research Writing, I learned from Blogging.

Photo Credit : Theslaeslion By the middle of my first year in a Social Science PhD program (Information Systems), I…

8 条评论

See all articles

Multi-Agent Week Recap - Microsoft Copilot Agents, Salesforce Agentforce, WindowsArena, Paper2QA

Victor Dibia, PhD

Principal RDSE at Microsoft Research (Generative AI, Agents) | Carnegie Mellon Alumnus

Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents

Salesforce Agentforce

Windows Agent Arena

领英推荐

PaperQA2

OpenAI o1: A New Era of AI Reasoning

Designing with ML/AI

3,734 位关注者

Victor Dibia, PhD的更多文章

社区洞察

其他会员也浏览了

Wave 2 of Microsoft 365 Copilot: Insights from Cognizant’s Partnership

More expensive Microsoft 365 and Free Copilot improved

Microsoft Trends and Updates: A Comprehensive Resume of January 2025

A Critical Examination of Teams for Dataverse for Technical Developers

10 Most Asked Frontend System Design Questions & Patterns

??Microsoft Copilot CY2025 Focus Areas – Post-Ignite 2024 Insights

7 key requirements to be Microsoft 365 Copilot ready!

Enhanced Productivity with the new Microsoft 365 Copilot Suite

#45: Copilot updates flying in

??NEWS: Microsoft 365 Copilot Wave 2 is Here!

Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents

Salesforce Agentforce

Windows Agent Arena

领英推荐

PaperQA2

OpenAI o1: A New Era of AI Reasoning

Designing with ML/AI

3,734 位关注者

Victor Dibia, PhD的更多文章

AutoGen Studio v0.4.1 Release Notes: Declarative Configuration, Team Testing, and Enhanced Agent Gallery

New AutoGen Release - v0.4.4: Serializable Agent Configuration, Support for Azure Hosted Models and More

AI Agents 2024 Rewind - A Year of Building and Learning

Multi-Agent Week Recap - π0, OmniParser, ARIA, Anthropic Computer Use, OpenAI Swarm ..

Using LLMs as Context-Aware Text Embedding Models - NV-Embed Paper Review

Multi-Agent Week Recap - MolMo Model, Letta, Core Bench.

How will AI Impact Software Engineering?

Announcing A New Book - Multi-Agent Systems with AutoGen!

Introducing Anomagram?-?An Interactive Visualization of Autoencoders Applied to the Task of Anomaly Detection.

All I learned about Social Science Research Writing, I learned from Blogging.

社区洞察

其他会员也浏览了

Wave 2 of Microsoft 365 Copilot: Insights from Cognizant’s Partnership

More expensive Microsoft 365 and Free Copilot improved

Microsoft Trends and Updates: A Comprehensive Resume of January 2025

A Critical Examination of Teams for Dataverse for Technical Developers

10 Most Asked Frontend System Design Questions & Patterns

??Microsoft Copilot CY2025 Focus Areas – Post-Ignite 2024 Insights

7 key requirements to be Microsoft 365 Copilot ready!

Enhanced Productivity with the new Microsoft 365 Copilot Suite

#45: Copilot updates flying in

??NEWS: Microsoft 365 Copilot Wave 2 is Here!