To Be or Not To Be an Agent

To Be or Not To Be an Agent

Should software assist humans or act on their behalf?

In 2016, the question was easy to answer : sell Ironman not Robocop. Technology hadn’t reached the level of sophistication we have attained today where AI is 90% as capable as a high-school student, the MMLU benchmark for AI is precisely this.

The next generation of software startups have a strategic question with different terminology & potentially a different conclusion.

To be or not to be an agent, acting on behalf of workers?

Copilots, like Github’s, complete their humans’ sentences in code, an AI pair programmer. Copilots have proven to increase productivity by 50-75% according to data points from Microsoft & ServiceNow.

Devin AI, the world’s first AI software engineer aka agent, authors software in place of a human.

We don’t yet know how productive agents will be, but if Devin is any indication, the productivity gains could mirror that of robots in manufacturing.

A 2020 MIT study found 1 robot in a manufacturing facility replaced 3.3 workers. Instead of a 50% improvement, these robots were 2.3x more productive. Robots can work 24h in contrast to a human’s shift of 8 hours, which increases productivity 2.0x simply through longer shifts.1

Can a startup capture more value selling a copilot with 50% performance improvement or an agent with a 230% performance improvement?

It hinges on a key development in GTM - the subject of tomorrow’s post.


1I’m ignoring maintenance & downtime in this naive calculation.

Bogdan Grigorescu

Sr Tech Lead | Engineering | Automation

11 个月

Do we want machines to have agency? Acting on someone's behalf means having agency.

Christel-Silvia Fischer

DER BUNTE VOGEL ?? Internationaler Wissenstransfer - Influencerin bei Corporate Influencer Club | Wirtschaftswissenschaften Universit?t Münster

11 个月

Vielen Dank Tomasz Tunguz

Scott Gooding

CEO | Sr. Sales Manager, VP of Sales

11 个月

GOP equipment holdings LLC is looking for consultants and agents in every country in the world, except United States, Nigeria, and the Middle East

Khyati Sundaram

CEO @ Applied | B2B SaaS, Venture funding, M&A execution

11 个月

Ironman vs. Robocop is highly dependent on use cases. High risk use case I def want ironman - jury still out what the world will see manifested

要查看或添加评论,请登录

Tomasz Tunguz的更多文章

  • Four Marketing Principles That Redefine Markets from Klaviyo's Former CMO

    Four Marketing Principles That Redefine Markets from Klaviyo's Former CMO

    During a recent Theory Office Hours with Kady Srinivasan, former CMO at Lightspeed Commerce, Dropbox, and Klaviyo, we…

    4 条评论
  • The Complete Guide to SaaS Pricing Strategy

    The Complete Guide to SaaS Pricing Strategy

    Most startups play defense when discussing pricing with customers. They dance between asking for too little, leaving…

    18 条评论
  • What Happened to My Traffic?

    What Happened to My Traffic?

    Chegg filed suit against Google for changes in their algorithm forcing the company to consider a sale. They allege the…

    4 条评论
  • AI Fluency : The Next Interviewing Skill

    AI Fluency : The Next Interviewing Skill

    Algorithms needed for unpredictable journey. Significant compute costs, endless data processing, long periods of…

    7 条评论
  • Auctions in AI : Cost of Capital as a Strategic Advantage

    Auctions in AI : Cost of Capital as a Strategic Advantage

    A decade ago, most startup pitches ended with a calculation justifying the amount they sought to raise. In other words,…

    7 条评论
  • The AI Elbow's Impact : What Reasoning Means for Business

    The AI Elbow's Impact : What Reasoning Means for Business

    October 2024 marked a critical inflection point in AI development. Hidden in the performance data, a subtle elbow…

    8 条评论
  • Theory is Looking for a Head of AI

    Theory is Looking for a Head of AI

    Theory’s name isn’t just a name - it’s our ethos. We develop & test theories about the future of technology, business…

    5 条评论
  • Fast-Track Your Growth: GTM & Marketing Office Hours for SF Founders

    Fast-Track Your Growth: GTM & Marketing Office Hours for SF Founders

    For pre-seed to Series B founders, navigating GTM strategy, marketing, and positioning can be challenging. When should…

    8 条评论
  • Faster Sales Cycles & Software Buyer Confidence

    Faster Sales Cycles & Software Buyer Confidence

    Cloudflare’s earnings last week revealed something more significant than just company optimism: a fundamental shift in…

    4 条评论
  • AI Impact Curves

    AI Impact Curves

    What is the impact of AI across different levels of seniority? Over the weekend, I read Sergey Tselovalnikov’s post on…

    10 条评论

社区洞察

其他会员也浏览了