登录查看更多内容

When 1 is Bigger than 4 for AI

Tomasz Tunguz

发布日期: 2023年5月3日

+ 关注

I asked ChatGPT about the numbers 1 & 4. Which one is bigger?

Sometimes, 1 was bigger. Other times, 4 was bigger.??

Sharon Zhou ran this experiment at scale?to showing the order of yes & no matters in the response.

This is called a non-deterministic or stochastic answer. Similar inputs do not consistently produce identical outputs. The answers have inconsistent logic.

We live with stochastic systems daily : weather reports, ETAs on Google maps, stock portfolio construction. We are stochastic - humans can be moody, err in our calculations, or change our minds with new information.

Ben Schneider - Healthcare IT Specialist 1 年前

ChatBots vs AI Assistant vs AI Agent

ángel Molina Laguna 4 个月前

AI: friend or foe?

Wes Wilkes 1 年前

In these conversations, the robot is sometimes wrong, but never in doubt. When a system produces an answer, we should verify the answer is correct. It’s not just logical errors that occur: hallucinations, when the system invents answers that don’t exist,?plagued about half of Bing chat results in this Stanford study.

We haven’t calibrated ourselves to the level of doubt to express, yet. Like working with a new colleague, we need to understand their strengths & weaknesses.

For consumers, the universe of acceptable outcomes can be quite broad. A?rabbit on top of a fire truck?has many acceptable answers.

But in the B2B world, consistency matters. Businesses using genAI will demand consistent answers to prompts like these : what is the company’s revenue by region? Or how do I reset my password? Or how much would I pay if I used a 1000 units of a product?

GenAI will need to write, create, & calculate with a significantly better error rate than humans.

I’m working with?ProductBoard to understand how different B2B startups are planning to leverage AI with a survey. If you’re integrating GenAI into your product & interested to hear others’ plans, please fill it out, & we’ll send you the anonymized raw data. Look for the results to be published in a few weeks.

Tomasz Tunguz

113,885 位关注者

Sajid Khan Alyani

SEO Expert, OnPage Offpage SEO, Blogger, WordPress Editor,Website Designer, Proud to be SEBT 3, Tanveer Nandla Student.

1 年

How to Access ChatGPT 4 for Free, https://www.youtube.com/@learnifyai/ visit the YouTube video section, watch the video title there, remember me in your prayers,

Nevenka Vuk

Analyst

1 年

Vam kar povem..4 sploh ne obstaja..obstaja pa 4 plus 1..

1 次回应

Karin Maquet

Passionate business coach for start ups and scale ups based in Belgium. Focus 360° - growth strategy, marketing, finance, funding, HR, technology, ?Feet on the ground, Head in the sky?

1 年

Thanks for sharing. Quite confronting for many tech companies jumping on AI ??. Your survey outcome will be useful.

1 次回应

Leon Bombotas

Analytics and AI Exec | Innovative mindset, technical depth, and business leadership

1 年

Tomasz Tunguz The prompting challenges are solvable by using/calling plugins like the Python code interpreter. The bigger problem for Enterprise I think is that the underlying data will need to be highly abstracted (cleansed, merged etc, outliers removed, field headings clearly labeled) in order for it to be usable by a gpt interface. Data analysts/scientists refer to this data prep as the "80%" of the work and it remains (for now) a highly manual task.

1 次回应

Wojciech Gryc

Founder, advisor, investor | Working on LLM-powered products

1 年

Great point. I think developer tools for LLMs, especially around robustness and predictability, are so critical and yet very underdeveloped today. We need these to actually integrate LLMs directly into our workflows.

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

When 1 is Bigger than 4 for AI

Tomasz Tunguz

领英推荐

Tomasz Tunguz

113,885 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

AI: friend or foe?

AI stole my job!

Enhancing Efficiency with Accurate AI Prompting

ChatGPT-5 ready to kill other AI services.

3 ways AI makes almost any business task easier

What happens your intelligent AI assistant goes awful?

From Hype to Reality: "Doing AI" instead of "Talking AI"

Title: ChatGPT & The Chicken Soup Problem: A Deep Dive into AI Models

ChatGPT is everywhere nowadays

MrBeast vs MrMachine

领英推荐

Tomasz Tunguz

113,885 位关注者

Why Lifetime Value is Relevant Again in Software

2024年10月15日

A Challenge to SaaS Orthodoxy

2024年10月10日

How M&A Fosters Innovation

2024年10月8日

Where is the Budget for AI Coming From?

2024年10月1日

Would You Listen to AI Generated Podcasts?

2024年9月30日

Interwoven with Initia

2024年9月25日

AI Advantage for Startups : Changing the Workflow through Services

2024年9月23日

Will We See a Better Exit Market Next Year?

2024年9月20日

Writing Software for Robots

2024年9月19日

2024 Theory Ventures Go-to-Market Survey: Optimism Rises Amid Changing Market Dynamics

2024年9月16日

社区洞察

其他会员也浏览了

AI: friend or foe?

AI stole my job!

Enhancing Efficiency with Accurate AI Prompting

ChatGPT-5 ready to kill other AI services.

3 ways AI makes almost any business task easier

What happens your intelligent AI assistant goes awful?

From Hype to Reality: "Doing AI" instead of "Talking AI"

Title: ChatGPT & The Chicken Soup Problem: A Deep Dive into AI Models

ChatGPT is everywhere nowadays

MrBeast vs MrMachine