登录查看更多内容

The Challenge of the AI Demo

Tomasz Tunguz

发布日期: 2024年9月5日

The AI Demo isn’t easy. Many of the major AI companies have demoed their AI systems, first starting with pre-recorded, & now pushing into live demos. They don’t always work.

Multiply Murphy’s Law by a non-deterministic system & it’s not unreasonable to expect AI demos to nearly always hiccup.

Demo disruptions aren’t disaster. These systems are early & changing rapidly. They might suggest the system requires work & tuning, not a fundamental challenge.

But, they can be problematic in proofs-of concept.

Proofs of concept are extended demonstrations of the software. Well-structured PoCs align on success criteria at the outset. These criteria enable vendors & customers to agree on what success looks like.

Worflow proofs-of-concept are relatively straightforward. They are deterministic. Can I process a loan application in 5 minutes? Yes or no.

But as AI applications shift to selling outcomes implicitly or explicitly, the PoC becomes a testing ground of those outcomes. Non-determinism means sometimes the PoC won’t produce the required wow moment. This also means the PoC criteria must be more flexible.

Madhuri M 11 个月前

Will AI erode S2P's rigid inter-module barriers?

Spend Matters 6 个月前

Elevating AI’s Stakeholder Appeal: Telling a…

Emily Lewis, MS, CPDHTS, CCRP 8 个月前

How does a buyer evaluate a probabilistic system?

Do we compare it to human performance? Speaking to some practitioners, they’ve shared with us human labelers typically agree on 60-70% of the time. Does a AI robot need to be as accurate as a human assuming it will be much less expensive? Or will we expect more as we do in self-driving cars?

If AI systems require human assistance, then the ROI of the system must include some human operating expense - whether explicit or implicit.

Some teams will want to benchmark systems in parallel to determine the relative performance. With most startups building atop existing models & setting aside differences in fine-tuning, the ultimate performance should be relatively comparable, provided they use the same data sets. Will startups compete on access to different data sets?

Today, there are more questions than answers about how to sell AI agent systems. We’re hosting an event on the evening of Sep 10th in San Francisco to interview leaders in the space moderated by Dave Morse, former CRO at Hebbia & VPS/VPCS at ScaleAI to talk about some of these questions.

If you’re interested to attend, see the details here.

Tomasz Tunguz

114,007 位关注者

Sven Brueckner

Data Science & AI Expert: Advancing Self-Organizing Systems and Artificial Intelligence for Innovative Solutions

1 个月

Add to the non-determinism of the AI system that you are demoing the open and often surprising dynamics that a real-time environment throws at you! As we are shifting from pre-trained models to continuously and autonomously executing agents, scripting a demo *with* them takes extensive practice. Nathaniel Green

Xiaoze Jin

Data, Gen AI and Agentic AI Science Engineering

1 个月

It might be use case based regarding the error acceptance; also, build vs buy is getting obvious regarding enterprise use cases. So in my opinion, for those startups selling solutions, find the niche and market will be a good bet

4 次回应

Shantanu Gangal

CEO @ Prodigal | Lending intelligence

1 个月

Love this quote in Morgan Housel 's book. That becomes especially true when they need to make a commitment of dollars and/or time. Even then as the industry is going through an early adoption phase (or is it over, already?!) it is almost necessary to work with those who are comfortable with the future and be 100% honest with them.

8 次回应

Mark Concannon

1 个月

Tomasz, I like your points here on challenges with live demos. As you begin to discuss proofs-of-concept (PoCs) and measuring success, I would propose what we should be looking at is proofs-of-value (PoVs). My partner, Seth Earley, and I believe this is the path to real impact and confirming opportunity in the AI space. While PoVs are not quick demos, as they require time with the raw data, real data architecture work, and real implementation efforts for the AI itself, they are still short projects that directly demonstrate the success (or failure) of the AI applied to a discreet business problem. Measuring this impact is directly comparable to the process step it replaces.

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

The Challenge of the AI Demo

Tomasz Tunguz

领英推荐

Tomasz Tunguz

114,007 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

How I Created an AI Book using ERNIE Bot, Baidu’s Powerful AI Chatbot

?? Welcome to AI Insights Unleashed! ?? - Vol. 28

Just be ourselves? We are not robots yet..

My 7 Hot Takes from Microsoft’s AI Tour in Berlin ????

People Are Not Robots

My experience at CES 2019.

I am not a robot. I just have to work with one.

Robot CEOs are coming

Your Weekly AI Roundup #32

Talent patterns, tacit knowledge and the future of work

领英推荐

Tomasz Tunguz

114,007 位关注者

My AI Rube Goldberg Machine

2024年10月29日

Productivity One Year from Now

2024年10月28日

AI Prompts as PRDs : Why Prompts Will Become Important IP Assets

2024年10月21日

The Premise of a New S-Curve in AI

2024年10月17日

Why Lifetime Value is Relevant Again in Software

2024年10月15日

A Challenge to SaaS Orthodoxy

2024年10月10日

How M&A Fosters Innovation

2024年10月8日

Where is the Budget for AI Coming From?

2024年10月1日

Would You Listen to AI Generated Podcasts?

2024年9月30日

Interwoven with Initia

2024年9月25日

社区洞察

其他会员也浏览了

How I Created an AI Book using ERNIE Bot, Baidu’s Powerful AI Chatbot

?? Welcome to AI Insights Unleashed! ?? - Vol. 28

Just be ourselves? We are not robots yet..

My 7 Hot Takes from Microsoft’s AI Tour in Berlin ????

People Are Not Robots

My experience at CES 2019.

I am not a robot. I just have to work with one.

Robot CEOs are coming

Your Weekly AI Roundup #32

Talent patterns, tacit knowledge and the future of work