登录查看更多内容

In a world of Large ACTION Models, what does the future of computing hold?

Izam Ryan

Associate Director @ K3 Advantage | Driving Value Creation with Strategic, Data-Driven Insights

发布日期: 2024年4月21日

So last night I was reading a bedtime story to my daughter - we come up with a prompt together and set the task for Google Gemini. We had some great success with "Tell me a bedtime story of Warrior Princess Alysha. She meets an old lady on her way to the market. Alysha has empathy for the old lady and was rewarded with three magical beans. Alysha has an adventure in the clouds after climbing the resulting magical beanstalk."

But one time we had a typo and Gemini was prompted, "Tell me the bedtime story of Process Alysha who meets a dashing prince from another kingdom".

Our protagonist quickly gained analytical and engineering skills and, through applying her STEM skills brought progress and joy to her loyal subjects through rural electrification projects, high-efficiency railway infrastructure investments and improvements to regional governance models.

It got me thinking - in this world of increasingly sophisticated Foundational Models (FMs) - you probably know them as "Large Language Models" like Chat-GPT 4 (Turbo!) and Google's Gemini (and Anthropic's Claude!). What will the future hold for us consultants? What would an LLM-embedded in our daily operating systems look like?

Well for starters, the paradigm of mouse and keyboard will slowly evolve away to an augmented M-A-K UI : Mouse-and-Agent-and-Keyboard. After watching us interact with computers for long enough, a LAM (Large Action Model) embedded in the OS's UI would build a library of Actions (not just Languge). What do I mean by that?

"Get the latest news and summarise the headlines for me." It'll learn to open your favourite news sites, ingest the data and aggregate it.

"Book train tickets for my weekend trip" and it'll open the price comparison sites, your diary, get the right routes and book everything.

领英推荐

Microsoft at 50—still hellbent on domination

WIRED 3 个月前

ODSC’s AI Weekly Recap: Week of July 5th

Open Data Science Conference (ODSC) 8 个月前

TestDevLab's Newsletter: January 2025 ??

TestDevLab 1 个月前

"Suggest ideas for meal prep next week, summarise suggested ingredients that meet my nutritional needs, and prepare a shopping list with my preferred suppliers" and it'll run the workflow.

Because to the outside world - my interactions with the Internet are just that : clicks, forms, emails, etc. If that were all intermediated by a Foundation Model (FM) what would the future look like?

The way we work will change subtly. Introductions at the beginning of work calls will become more important as well as synthesising action points and summaries will become more important. Because you never know which attendees are running with FMs - you need to help out their AI's by introducing yourself (so that the FM can build a model of what your voice sounds like) and summarising the key points (inevitably someone's going to put the call transcript through an FM asking "who was designated what tasks to do by when?"). Ever get that feeling when you're on a call but the other party has you on loudspeaker and someone else in the room at their end is listening in on the call? Get used to that feeling! With audio transcription and LLM's being invited along to work calls, expect that an FM somewhere is going to be analysing your speech and video feeds.
Will we be using Agents to represent us in our online interactions? With FM's being able to generate video and audio as well as text - could you imagine a time where we instruct our FM-based Agents to intermediate for us in the world? With the amount of AI-generated content being put out there - what's the future role of hand-crafted, tailored, specific content? Conversely - what about the role of Bad Actors in a world dominated by AI-generated content? With the rise of false news and extreme opinions - how hard would it be to stage a performance for the "news" outlets and social media feeds to sway popular opinion?
Will we be using Agents to proxy for us in learning environments? Imagine an FM that is tailored to early years education: a storybook that is constantly learning from the child's lived environment and adapts its curriculum and learnings accordingly. Kid needs to pick up a second and third language skills? Imagine the FM automatically generates Duolingo-style interactive language classes but drawing on themes and subject matter from the child's lived environment.
What will be the role of traditionally "people-oriented" service professions? Sports management, Public relations, media & communications, design, consulting, accountancy and legal professions? I uploaded a video of a fencing duel to Gemini and asked for it's analysis - surprisingly Gemini was able to (after a bit of prompting) produce a well-informed analysis and summary, roughly on the level of a graduate learner with some modicum of skill. I asked Gemini to break down the techniques used in the video and asked for feedback - the answers were actually something not too far off.

In a world of FMs, LLMs and LAMs - what's a professional to do? Well, now more than ever qualities like empathy, creativity, judgement, and critical thinking will become even more important. Where FM's can handle the gruntwork of doing the heavy lifting, it now falls to us to direct the FM's, and even architect workflows around FM's (organising FM's to coordinate the work of other FM's!).

Just my musings on a Sunday evening - feel free to comment on speculate further in the comments section below!

Vladimir Parkov

Program & Project Leader | Strategic Planning, Data Analytics, Digital & AI Transformation | I Help Businesses Boost ROI with Data-Driven Strategies

10 个月

Izam Ryan, great stuff. It's really thought-provoking. I'm adding my two cents here: for LAMs to be subtle and widely adopted, ironically, they need to be disruptive at the same time. That can be achieved only if these agents gather a lot of contextual information about you—all of your interactions with digital surfaces and beyond. Will you give them permission to do so to become the AI-augmented man of tomorrow? Embedded artificial intelligence means changing natural intelligence a lot :)

1 次回应

要查看或添加评论，请登录

Izam Ryan的更多文章

Rituals of today's rainmakers

2024年4月2日

Rituals of today's rainmakers

Read this HBR article on "What Today's Rainmakers Do Differently" where the authors make the case for five different…

2 条评论
Begin with the WHY. What is your PURPOSE?

2020年1月10日

Begin with the WHY. What is your PURPOSE?

One of my passions is in coaching others to bring their best self to the workplace or the competitive arena. I've…
How to nurture a healthy data culture in 3 steps

2019年12月23日

How to nurture a healthy data culture in 3 steps

Tableau's recent article in Forbes gives a neat, 3-step suggestion on how to nurture a data culture in your…
What 5 words?

2019年12月18日

What 5 words?

I was recently asked this question ..
"Does that make sense?"

2019年12月12日

"Does that make sense?"

Sharing this article here from Hubspot's blog: https://blog.hubspot.
To get unconventional results, you have to challenge convention

2015年3月31日

To get unconventional results, you have to challenge convention

Image above: Mr Phileas Fogg's hot air balloon. In "Around the World in 80 Days", he accepted a bet for ￡20k, proving…

2 条评论
A 2015 resolution - Improving decision making

2015年1月4日

A 2015 resolution - Improving decision making

Looking up at the #BlueSkies above London, I thought back on a provocative question asked by a close friend of mine…

3 条评论

See all articles

In a world of Large ACTION Models, what does the future of computing hold?

Izam Ryan

Associate Director @ K3 Advantage | Driving Value Creation with Strategic, Data-Driven Insights

领英推荐

Izam Ryan的更多文章

社区洞察

其他会员也浏览了

Exploring Google Gemini 1.5 Pro

Foundational Computer Science Principles for AI-Driven Systems: A Comprehensive Literature Review [AI-generated content]

AI took over Microsoft Build; New AI Co-Pilot features are on the way

The Core Limitations of Agent Technology: Analysis of Evolution from Transitional Technology to System Components

Viso eyes no-code for the future of computer vision and scores funding to scale

Early Tests Demonstrate Google's Gemini Pro 1.5's Remarkable Memory

Unlocking the Secrets of Edge Computing: How This Emerging Technology is Transforming the Tech and Finance Landscape in 2023

The Battle for AI Gravity

Architecting Solid Foundations for Scalable Knowledge Graphs

Issue #220 - THE ML ENGINEER ??

领英推荐

Izam Ryan的更多文章

Rituals of today's rainmakers

Begin with the WHY. What is your PURPOSE?

How to nurture a healthy data culture in 3 steps

What 5 words?

"Does that make sense?"

To get unconventional results, you have to challenge convention

A 2015 resolution - Improving decision making

社区洞察

其他会员也浏览了

Exploring Google Gemini 1.5 Pro

Foundational Computer Science Principles for AI-Driven Systems: A Comprehensive Literature Review [AI-generated content]

AI took over Microsoft Build; New AI Co-Pilot features are on the way

The Core Limitations of Agent Technology: Analysis of Evolution from Transitional Technology to System Components

Viso eyes no-code for the future of computer vision and scores funding to scale

Early Tests Demonstrate Google's Gemini Pro 1.5's Remarkable Memory

Unlocking the Secrets of Edge Computing: How This Emerging Technology is Transforming the Tech and Finance Landscape in 2023

The Battle for AI Gravity

Architecting Solid Foundations for Scalable Knowledge Graphs

Issue #220 - THE ML ENGINEER ??