登录查看更多内容

Is the World Ready for Autonomous AI Agents, and are they Autonomous, or even Intelligent?

Craig Martin

Director | Strategic Innovation & Experimentation | Guiding clients to test and refine business decisions before investment and execution

发布日期: 2025年2月28日

If you’ve had a chance to explore OpenAI’s new Operator functionality, you’re probably as intrigued as I am about how these intelligent agents attempt to logically and autonomously solve problems for us. It’s fascinating—and in many ways echoes earlier technological promises we’ve heard before.

Many years ago, we experimented with ambitious tasks like: “Find me my top 10 customers, arrange customer visits, book flights, hotels, and taxis, and keep my boss and my wife updated about my travel plans.” At that time, Service-Oriented Architecture (SOA) was hailed as the solution to seamlessly automate these processes. Unfortunately, reality couldn’t quite live up to the hype.

Fast forward to today, and we find ourselves once again exploring the promise of intelligent automation, now through “Operators” and intelligent agents. Observing the OpenAI Operator, I saw first-hand the classic problem faced by agent automation: integrating smoothly with real-world applications. Currently, Operator agents have adopted a practical approach by effectively automating interactions via a web browser—this integration resembles Robotic Process Automation (RPA).

For instance, when I asked Operator to find and book the cheapest tickets for a Hans Zimmer concert in London, I watched—part amused, part frustrated—as it navigated through multiple different sites until it hit a roadblock. The gatekeeping systems are explicitly designed to stop automated bots. The irony wasn’t lost on me. Our modern internet landscape is littered with protective measures like CAPTCHAs, specifically designed to verify human identity. Yet we’re rapidly moving towards a future where sophisticated AI agents or avatars perform tasks on our behalf, blurring the line between human and bot interactions.

This introduces multiple layers of irony. We’ve previously witnessed disintermediation (remember the disappearance of travel agents?), effectively turning each of us into our own travel agents. Now we’re automating these tasks again, reinstating digital agents as intermediaries. But to realise this future smoothly, we’ll need to evolve gatekeeping systems from simplistic “human or bot” distinctions to more nuanced approaches acknowledging legitimate agent interactions.

This evolution raises critical governance and legal challenges. What’s the legal status of an AI agent making financial transactions or purchases on my behalf? If an agent makes a mistake or conducts unauthorised activities, who bears the responsibility? These challenges closely mirror debates surrounding autonomous vehicles—where liability for accidents remains an unresolved issue. Is responsibility held by the developer, the manufacturer, the vehicle owner, or the software itself? As always, regulatory and governance frameworks significantly lag behind technological advancements. Addressing these challenges demands urgent regulatory clarity, thoughtful policy-making, and robust governance mechanisms.

We may soon reach a tipping point where current gatekeeping measures—such as CAPTCHAs—become obsolete. Perhaps a new business opportunity lies in developing mechanisms for agents to securely identify and authenticate themselves through digital handshakes with updated, intelligent CAPTCHAs. Imagine websites confidently identifying specific named users via trusted agents. Monetising that authentication process through blockchain technologies could create genuinely exciting possibilities, where users willingly trade verified identities for discounts or perks—effectively “selling” their digital selves securely and transparently.

If you’ve experimented with agent-based offerings in the market, you will also have noticed how linear they are. Truth be told, it’s quite similar to the IFTTT setup that’s been around for a while, albeit more “intelligent” in determining its next step. However, true agent-based systems need to become more decentralised, autonomous, and self-learning.

This will leverage some of the more traditional branches of agent-based technology like MAS (Multi-Agent Systems), Complex Adaptive Systems (CAS), and Reinforcement Learning (RL). MAS isn’t new—it’s been around since at least the late 80s and early 90s. It’s a field of artificial intelligence that involves multiple autonomous agents interacting within a shared environment. These agents communicate, cooperate, negotiate, or compete to solve complex problems collectively, rather than individually. Recent advancements have led to the development of MCP—Model Context Protocol. MCP is essentially an emerging approach to standardising how AI models—particularly autonomous agents—manage and share context. It defines a structured way for agents to access, maintain, and synchronise contextual data, enabling better coordination, continuity, and more coherent interactions. Think of MCP as the “middleware for context”—allowing AI agents to remain synchronised, aware, and consistent as they independently and autonomously move across different environments, executing tasks, and interacting not only with each other but also with different internal and external environments.

Let’s make this practical by extending my current Operator use-case into a future-oriented version: Returning to my problem of automatically booking seats for a sold-out Hans Zimmer concert in London, an AI agent faces multiple challenges, some of which I experienced in the current iteration of OpenAI’s Operator:

Fragmented systems across ticketing platforms are currently resolved using standard web-based interface strategies.
Gatekeepers (e.g., CAPTCHAs, bot-blockers) are specifically designed to prevent automated interactions.
Rapidly fluctuating availability and prices, make linear approaches inadequate.
Uncertain user preferences and priorities (best seat vs cheapest seat vs group bookings). A personal “Avatar” agent is therefore required to securely store these preferences and potentially leverage this personal data for user benefit.

Solving for this scenario requires sophisticated interaction and coordination among multiple autonomous agents—precisely where MAS, CAS, MCP, and RL become powerful allies.

1. Multi-Agent Systems (MAS):

MAS involves multiple autonomous agents working together. Here’s how it helps:

Task Specialisation: One agent searches for the best seats, another focuses on lowest prices, while another simultaneously negotiates and communicates with the ticketing platform’s system.
Collaboration and Coordination: MAS agents dynamically collaborate—sharing context and preferences—enabling efficient parallel searches, rapid response to market changes, and combined decision-making for the optimal outcome.
Negotiation: Agents might even negotiate with other user-agents trying to resell or exchange tickets, creating cooperative scenarios beneficial for all parties.

2. Complex Adaptive Systems (CAS):

Concert ticket availability, prices, and user behaviour form a complex adaptive system—continuously evolving and unpredictable:

Dynamic Adaptability: Agents recognise and rapidly adapt to changing conditions—seat availability, fluctuating prices, and shifting user demand.
Emergent Patterns Recognition: CAS-informed agents identify patterns (e.g., prices tend to drop closer to the concert date, seats open up after cancellations, etc.), dynamically adjusting their strategies.
Systemic Thinking: Agents see the ticketing landscape as an ecosystem rather than isolated events, enabling better long-term booking strategies, risk assessment, and price optimisation.

3. Model Context Protocol (MCP):

MCP brings standardised, structured context management, significantly enhancing agents’ real-world effectiveness:

Unified Contextual Understanding: Agents consistently maintain critical information: user seating preferences, budget constraints, historical concert choices, location preferences, etc., ensuring decisions align closely with user expectations.
Efficient Integration and Interaction: MCP standardises interactions between agents and ticketing platforms. Contextual state (e.g., user’s buying intent, seat selection preferences, billing information) is seamlessly passed between agents, enhancing efficiency and reducing errors.
Traceability and Accountability: MCP allows clear tracking of decision-making contexts. Users (and regulators) have a transparent view of how and why agents make specific decisions—critical if disputes or transaction failures arise.

4. Reinforcement Learning (RL):

RL enables agents to improve strategies over time through continuous trial and error:

Optimised Decision-Making: Agents continuously learn from successful and unsuccessful attempts (e.g., missed opportunities, price spikes, last-minute seat availability), refining their approach to maximise the likelihood of successful bookings in future attempts.
Adaptive Strategy: Through reinforcement learning, agents proactively recognise trends, like peak demand windows or optimal booking times, adjusting their behaviours accordingly for better outcomes.
Predictive Actions: Over repeated interactions, RL-driven agents anticipate issues such as peak demand periods, rapidly escalating prices, or increased security measures, becoming increasingly adept at navigating complex purchasing environments.

This comprehensive integration is where agent-based technologies really begin to deliver. From what I’ve observed, we still have some distance to cover before platforms offer this level of integrated multi-agent functionality. While certain pieces exist, no single integrated platform currently delivers it all. I reckon the first to master this integration solution will dominate the market.

It’s a fascinating new world we’re stepping into, full of potential—but also demanding careful consideration, strategic design, and well-defined governance. Are we ready for it?

Marc De Kock

Automation - AI Shepard - Strategy - Architecture - Delivery

1 周

I am envious that you got your mittens on operator.

1 次回应

要查看或添加评论，请登录

Craig Martin的更多文章

The AI Disruption of Business Modelling: From Static Models to Intelligent Agents

2025年2月26日

The AI Disruption of Business Modelling: From Static Models to Intelligent Agents

If you've been following the rapid rise of artificial intelligence—especially large language models (LLMs)—you've…

4 条评论
Human Capability Sets

2019年12月16日

Human Capability Sets

Some time ago we published a series of videos on the business design way. The challenge is they are hidden away behind…

6 条评论
On airport chaos, dark matter, creative desperation and terrorism

2019年4月30日

On airport chaos, dark matter, creative desperation and terrorism

I recently returned from a trip to the US, arriving at Melbourne airport at around 6am. What greeted me was not the…

3 条评论
Australia, the production of coal and climate change

2018年11月21日

Australia, the production of coal and climate change

Updated off the back of a good night's sleep. Disclaimer: I am not an economist.

8 条评论
Designing the Society of the future - Project 1 - The Circular Economy for Fashion

2018年10月21日

Designing the Society of the future - Project 1 - The Circular Economy for Fashion

A few months have passed since I posted my story on “Designing the Society of the future. Calling all dreamers”.

1 条评论
Designing the Society of the future. Calling all dreamers

2018年8月3日

Designing the Society of the future. Calling all dreamers

A few weeks ago I sat in a cosy restaurant having an informal chat with a few government contacts in Canberra. The key…

10 条评论
Changing your mind about your heart: Shifting Mindsets - Part 2 of 4

2017年11月13日

Changing your mind about your heart: Shifting Mindsets - Part 2 of 4

In my first article I introduced the Human Capability sets and the “doing” aspects of human capability. I have included…

5 条评论
Changing your mind about your heart: Part 1 of 4

2017年11月9日

Changing your mind about your heart: Part 1 of 4

They stared at me with a look of shock on their faces. I had made the mistake of referring to them as “fixed minded”!!!…

9 条评论
Accelerating from process to heuristics

2017年6月21日

Accelerating from process to heuristics

A while back I attended a conference in the US that was promoting tools and techniques for formalising the description…

4 条评论
Your thinking is old and dated, but you don’t think so

2017年5月31日

Your thinking is old and dated, but you don’t think so

Over the past 22+ years, I have been studying and working with heuristics. Sometimes known as rules of thumb.

19 条评论

See all articles

Craig Martin的更多文章

The AI Disruption of Business Modelling: From Static Models to Intelligent Agents

Human Capability Sets

On airport chaos, dark matter, creative desperation and terrorism

Australia, the production of coal and climate change

Designing the Society of the future - Project 1 - The Circular Economy for Fashion

Designing the Society of the future. Calling all dreamers

Changing your mind about your heart: Shifting Mindsets - Part 2 of 4

Changing your mind about your heart: Part 1 of 4

Accelerating from process to heuristics

Your thinking is old and dated, but you don’t think so