Niche vs. Broad

Niche vs. Broad

As Futuria presented lessons learned from a year in Agentic AI (we didn’t call it that at the start) at the “AI for the rest of us” conference in London recently, one of the key recommendations in constructing high performing agent teams was that we have found that niche expert agents collaborating together perform better than broadly defined agents. Why might that be the case?

All of our work to date has been in providing agent teams into highly secure, highly assured organisations. We have an architecture pattern that is geared to that space – for example, we don’t fine tune models. We use LLMs for their ability to have natural language conversation, not for their ability to create specific and accurate answers. We separate the databases (vector & graph) from the models. We tightly define agents, not only in their persona, but also in the context of the tools, and actions that they have access to in their ‘kit bag’.

Broadly we have noticed a tendency to humanise agents in sales and marketing content, and yet we have continued to recommend that we see agents as more aligning with a capability as opposed to a person. Nonetheless, the principle of high performing teams translates into this context too. After all, you would be thinking about the combination of capabilities and styles and knowledge if you were creating a human team, so why not do the same with hybrid teams.

If we were to create the world’s best agent in a particular capability, with deep expertise and specific organisational knowledge or domain expertise, surely it is more likely to be effective in answering tightly defined requests. The challenge then becomes the disaggregation of problem statements into suitable task sizes that can best be addressed by niche expert team members, working in collaboration with each other and their human team members.

Calibration is everything. We’ve seen agents argue over the price of concrete, seen an agent take offence and walk out, and seen agents congratulate each other on coming up with a clearly rubbish answer, thinking they were doing a great job.?

Nonetheless, our experience of a year in Agentic AI suggests that niche expert capabilities outperforms generalist. There may come a time when that is not true, but for the moment, it’s the best way to get a tune of the fast evolving technology, at least in organisations that have to adopt a solution where the emphasis is firmly on secure and regulated use.?

[note this article was written entirely by human hand]

Steve Chan

Microsoft App Innovation Industry Lead, Public Sector | Product Strategy & AI Innovator | Driving High-Impact Solutions in Digital Transformation & Tech Leadership

4 个月

I love the addition of "[note this article was written entirely by human hand]" ????

要查看或添加评论,请登录

Rob Price的更多文章

  • The Impact of Agentic AI: Transforming Local Government Efficiency

    The Impact of Agentic AI: Transforming Local Government Efficiency

    This article was originally published on https://futuria.ai on 31st October 2024, and was written by Rob Price…

    3 条评论
  • Enhancing #GenAI Risk Mitigation with Multi-Agent Teams

    Enhancing #GenAI Risk Mitigation with Multi-Agent Teams

    by Rob Price , Ben Middleton and Garuka Serasinghe ? As the adoption of generative AI (GenAI) accelerates across…

    1 条评论
  • "Reading Biodiversity"

    "Reading Biodiversity"

    In “Talking Biodiversity”, Season 5 Episode 7 of the podcast “A New Digital Responsibility”, Ben Jacob and Richard…

    4 条评论
  • Immersed in Generative AI

    Immersed in Generative AI

    Last week, the UK Government released a comprehensive and insightful guidance document on Generative AI, a testament to…

    8 条评论
  • 2023: A Gardener’s Diary

    2023: A Gardener’s Diary

    Reflecting on 2023, I see a gardener's year; seeds sown, soil turned, and new growth sprouting from the ground. This…

    6 条评论
  • Ludicrous Speed IT

    Ludicrous Speed IT

    In the not-so-distant past, the paradigm of 2-speed IT (Greg Smith, Atos Consulting 2014) emerged as a guiding…

    1 条评论
  • CDR expanding in Europe

    CDR expanding in Europe

    On 3rd May 2023, Rob Price, Pietro Jarre and Jessica Huntingford all spoke about #CorporateDigitalResponsibility in…

    4 条评论
  • CDR in Tuscan Micro and Small Enterprises

    CDR in Tuscan Micro and Small Enterprises

    It is always interesting to hear about some of the work happening around the world in regard to adoption of CDR…

    1 条评论
  • Responsibility and Resilience

    Responsibility and Resilience

    Over the past year, I have seen a far stronger adjacency between the terms of Cybersecurity and Corporate Digital…

    2 条评论
  • Lessons from Blockchain: where does all the talent go?

    Lessons from Blockchain: where does all the talent go?

    Well it certainly doesn’t stay it seems. Over the past three months, a group of us have been looking at the State of…

    10 条评论

社区洞察

其他会员也浏览了