登录查看更多内容

How Often Does AI Hallucinate?

John Andrews

Creative Problem Solver | Retail Co-Innovation Leader | Marketing Technologist

发布日期: 2023年11月18日

It Depends On The Platform.

Generative AI makes things up sometimes. Vectara, an AI search platform, has created an open-source Chatbot hallucination leaderboard on GitHub. Hallucinations ranged from 3% for Chat GPT 4 to over 27% for Google Palm 2 Chat.

This is predictably disconcerting for users and for businesses seeking to integrate these models into their workflows. We are used to computer systems being precise and the information that they provide is reliable, so when we get incorrect, false, or just plain made-up details, we are understandably uncomfortable.

Like humans, Generative platforms aren't precision systems. They are the sum of content on which they have been trained, which itself isn't always correct or factual and sometimes intentionally misleading. If we have learned anything from social media, 'facts', are open to interpretation. Even fact-checking is open to interpretation based on the entity checking the facts. Then, there is simply the need to dig deeper into AI-provided information, as the infamous legal brief submitted with case hallucinations demonstrated.

Vectara's approach is an exciting step towards building some safeguards and balances to the information being created by generative AI. How often are the platforms returning hallucinations from sets of known data and how skewed is the data being returned?

I imagine we'll see more models like this quickly developing, creating another dynamic. Who decides what is a hallucination and what is not? Similar to the factual debates in the social media realm, who is the arbiter of truth and on what information is that truth based?

领英推荐

5 AI must reads for the weekend

安永 11 个月前

The future of AI: How artificial intelligence will…

简柏特 10 个月前

Wait, Maybe We Should Regulate Data, and Not Companies

John Battelle 1 年前

Who decides what is a hallucination and what is not?

Cambridge Dictionary recently picked Hallucinate as its word of the year, referring to the AI reference of the word. Some people disagree with using the term as it anthropomorphizes AI but it has taken hold in the common lexicon. At this point, most users are aware that AI isn't infallible and will even agree with false premises when challenged. Most platforms are labeling this now as well and new AI instances like Snap are making light of the behavior by apologizing in advance.

The bottom line is AI is developing at a blistering pace and it will not only hallucinate but also be managed in ways that may or may not align with facts. As AI platforms begin to interact, this phenomenon will morph in unexpected ways. In this way, they are becoming more human than not.

Oluranti Owoseeni

Web3 Educator | Storyteller | Digital Growth Strategist

10 个月

These are profound thoughts, I hope developers are looking into checking these errors to prevent the machines from morphing into us

Oluranti Owoseeni

Web3 Educator | Storyteller | Digital Growth Strategist

10 个月

Very interesting topic

Amanda Sturgill

Storytelling; Analytics; Measurement; Multimedia Content Strategy | Teaching all of the above

11 个月

We're dealing with this quite a bit in higher ed. You have to know what truth is before you can tell if it is what you are getting.

1 次回应

Andrés Susarret

Meeting customer needs globally by managing of Professional Services, Technical Support, and Customer Success Management | Recognizing the Voice of the Customer and enhancing the Customer Experience

11 个月

Hallucination is not purely random or uncontrolled event. If you are working on putting a LLM generative AI component into your workflow, you need to not only know the source of the model, but the context window size (how much the model can process in a single query), training/tuning methods, and other factors matter. You can completely design your solution so that hallucination is NOT a factor. That being said, one risk of relying on a supplier like OpenAI for a product like ChatGPT 4 is that they won’t divulge some of the key information you need for a more deterministic design and can change the tuning at any time, or can become unavailable or overloaded at any time.

3 次回应

Matthew Z.

Logistics Ambassador who is Logistically Obsessed | Co-Founder MonarKonnect

11 个月

You know I read these post have a great thought go to comment and there is John's comments which echos my thoughts lol

2 次回应

查看更多评论

要查看或添加评论，请登录

John Andrews的更多文章

How Edge Computing is Transforming the Retail Experience: A Shopper’s Journey

2024年9月24日

How Edge Computing is Transforming the Retail Experience: A Shopper’s Journey

On a rainy Tuesday afternoon, Mark arrives at an auto parts store and sees that his dashboard warning light is on…

4 条评论
A Weekend of New Beginnings

2024年7月2日

A Weekend of New Beginnings

This past weekend we marked a significant milestone for my family and me as we closed on our new home in Hickory, NC…

6 条评论
Leveraging Synthetic Data for Smarter, More Ethical Marketing

2024年5月19日

Leveraging Synthetic Data for Smarter, More Ethical Marketing

IIn today's digital landscape, data is crucial for effective marketing strategies. However, traditional data collection…

5 条评论
How AI Is Revolutionizing Shopper Marketing: The Rise of the AI Companion

2024年4月9日

How AI Is Revolutionizing Shopper Marketing: The Rise of the AI Companion

Imagine scrolling through your social media feed and coming across an image of a sleek, stylish Ferrari baseball cap…

9 条评论
Co-Piloting Through AI Bootcamp Class

2024年2月18日

Co-Piloting Through AI Bootcamp Class

Despite all the over-the-top excitement surrounding AI, I find myself turning to these tools more frequently for…

8 条评论
Navigating the Enigma of Holiday Spending for 2023

2023年11月7日

Navigating the Enigma of Holiday Spending for 2023

Will They Spend, or Won't They? The 2023 holiday spending season unfolds as a perplexing enigma, where three distinct…

5 条评论
A Better Search Experience Using LLMs

2023年10月13日

A Better Search Experience Using LLMs

A Beer and an Opportunity for BBQ Discovery in Memphis I recently tried a real-world comparison of search 2.0 via…

21 条评论
Creating a Harmonious Retail Space: Engaging with Customers on Their Terms

2023年9月18日

Creating a Harmonious Retail Space: Engaging with Customers on Their Terms

I was on a call today with a communications tech startup called Uptok that has the mission of 're-humanizing digital…

6 条评论
Unveiling ROPA: Redefining Shopping with Research Online and Purchase Anywhere

2023年8月9日

Unveiling ROPA: Redefining Shopping with Research Online and Purchase Anywhere

Everyone loves some Bopis! The delicious Filipino street food! Ok, so maybe not in the case the ever-evolving world of…

13 条评论
Seamless Shopping: Transforming the Retail Experience Amidst Employee Challenges

2023年7月3日

Seamless Shopping: Transforming the Retail Experience Amidst Employee Challenges

In today's digital age, the retail industry is undergoing a significant transformation, with seamless shopping emerging…

4 条评论

See all articles

How Often Does AI Hallucinate?

John Andrews

Creative Problem Solver | Retail Co-Innovation Leader | Marketing Technologist

领英推荐

John Andrews的更多文章

社区洞察

其他会员也浏览了

This AI newsletter is all you need #27

Unveiling the Black Box: How Explainable AI Makes AI Decisions Transparent

Why Most Businesses Are Using AI Wrong—and How to Fix It Fast

Continuous LLM Monitoring - Observability to ensure Responsible AI

Peering into the Future: What to Expect from AI and Machine Learning in 2024 and Beyond – Key Predictions and Industry Shifts

Welcoming Our New Robot Overserfs

Unveiling AI Hallucinations

AI Weekly Digest - October 23 2023

Is AI Doomed to Collapse Under Its Own Weight? Unpacking the ‘Dead Data’ Crisis

AI Unveiled: 50 Transformative Ways Artificial Intelligence is Reshaping Business and Society

领英推荐

John Andrews的更多文章

How Edge Computing is Transforming the Retail Experience: A Shopper’s Journey

A Weekend of New Beginnings

Leveraging Synthetic Data for Smarter, More Ethical Marketing

How AI Is Revolutionizing Shopper Marketing: The Rise of the AI Companion

Co-Piloting Through AI Bootcamp Class

Navigating the Enigma of Holiday Spending for 2023

A Better Search Experience Using LLMs

Creating a Harmonious Retail Space: Engaging with Customers on Their Terms

Unveiling ROPA: Redefining Shopping with Research Online and Purchase Anywhere

Seamless Shopping: Transforming the Retail Experience Amidst Employee Challenges

社区洞察

其他会员也浏览了

This AI newsletter is all you need #27

Unveiling the Black Box: How Explainable AI Makes AI Decisions Transparent

Why Most Businesses Are Using AI Wrong—and How to Fix It Fast

Continuous LLM Monitoring - Observability to ensure Responsible AI

Peering into the Future: What to Expect from AI and Machine Learning in 2024 and Beyond – Key Predictions and Industry Shifts

Welcoming Our New Robot Overserfs

Unveiling AI Hallucinations

AI Weekly Digest - October 23 2023

Is AI Doomed to Collapse Under Its Own Weight? Unpacking the ‘Dead Data’ Crisis

AI Unveiled: 50 Transformative Ways Artificial Intelligence is Reshaping Business and Society