登录查看更多内容

How Often Does AI Hallucinate?

John Andrews

Creative Problem Solver | Retail Co-Innovation Leader | Marketing Technologist

发布日期: 2023年11月18日

It Depends On The Platform.

Generative AI makes things up sometimes. Vectara, an AI search platform, has created an open-source Chatbot hallucination leaderboard on GitHub. Hallucinations ranged from 3% for Chat GPT 4 to over 27% for Google Palm 2 Chat.

This is predictably disconcerting for users and for businesses seeking to integrate these models into their workflows. We are used to computer systems being precise and the information that they provide is reliable, so when we get incorrect, false, or just plain made-up details, we are understandably uncomfortable.

Like humans, Generative platforms aren't precision systems. They are the sum of content on which they have been trained, which itself isn't always correct or factual and sometimes intentionally misleading. If we have learned anything from social media, 'facts', are open to interpretation. Even fact-checking is open to interpretation based on the entity checking the facts. Then, there is simply the need to dig deeper into AI-provided information, as the infamous legal brief submitted with case hallucinations demonstrated.

Vectara's approach is an exciting step towards building some safeguards and balances to the information being created by generative AI. How often are the platforms returning hallucinations from sets of known data and how skewed is the data being returned?

I imagine we'll see more models like this quickly developing, creating another dynamic. Who decides what is a hallucination and what is not? Similar to the factual debates in the social media realm, who is the arbiter of truth and on what information is that truth based?

简柏特 11 个月前

This AI newsletter is all you need #27

Towards AI 1 年前

Unveiling the Black Box: How Explainable AI Makes AI…

Emmanuel Ramos 6 个月前

Who decides what is a hallucination and what is not?

Cambridge Dictionary recently picked Hallucinate as its word of the year, referring to the AI reference of the word. Some people disagree with using the term as it anthropomorphizes AI but it has taken hold in the common lexicon. At this point, most users are aware that AI isn't infallible and will even agree with false premises when challenged. Most platforms are labeling this now as well and new AI instances like Snap are making light of the behavior by apologizing in advance.

The bottom line is AI is developing at a blistering pace and it will not only hallucinate but also be managed in ways that may or may not align with facts. As AI platforms begin to interact, this phenomenon will morph in unexpected ways. In this way, they are becoming more human than not.

Oluranti Owoseeni

Web3 Educator | Storyteller | Digital Growth Strategist

11 个月

These are profound thoughts, I hope developers are looking into checking these errors to prevent the machines from morphing into us

Oluranti Owoseeni

Web3 Educator | Storyteller | Digital Growth Strategist

11 个月

Very interesting topic

Amanda Sturgill

Storytelling; Analytics; Measurement; Multimedia Content Strategy | Teaching all of the above

1 年

We're dealing with this quite a bit in higher ed. You have to know what truth is before you can tell if it is what you are getting.

1 次回应

Andrés Susarret

Meeting customer needs globally by managing of Professional Services, Technical Support, and Customer Success Management | Recognizing the Voice of the Customer and enhancing the Customer Experience

1 年

Hallucination is not purely random or uncontrolled event. If you are working on putting a LLM generative AI component into your workflow, you need to not only know the source of the model, but the context window size (how much the model can process in a single query), training/tuning methods, and other factors matter. You can completely design your solution so that hallucination is NOT a factor. That being said, one risk of relying on a supplier like OpenAI for a product like ChatGPT 4 is that they won’t divulge some of the key information you need for a more deterministic design and can change the tuning at any time, or can become unavailable or overloaded at any time.

3 次回应

Matthew Z.

Logistics Ambassador who is Logistically Obsessed | Co-Founder MonarKonnect

1 年

You know I read these post have a great thought go to comment and there is John's comments which echos my thoughts lol

2 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

How Often Does AI Hallucinate?

John Andrews

Creative Problem Solver | Retail Co-Innovation Leader | Marketing Technologist

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Why Most Businesses Are Using AI Wrong—and How to Fix It Fast

Peering into the Future: What to Expect from AI and Machine Learning in 2024 and Beyond – Key Predictions and Industry Shifts

Continuous LLM Monitoring - Observability to ensure Responsible AI

Welcoming Our New Robot Overserfs

Unveiling AI Hallucinations

AI Weekly Digest - October 23 2023

Is AI Doomed to Collapse Under Its Own Weight? Unpacking the ‘Dead Data’ Crisis

AI Unveiled: 50 Transformative Ways Artificial Intelligence is Reshaping Business and Society

Crafting Trustworthy Generative AI: Building Beyond Hallucinations, Prompt Engineering, and Ensuring Governance

Will we have global rules on artificial intelligence?

领英推荐

Best Buy's Digital Evolution Meets Economic Reality: A Transformation Story in Progress

2024年11月26日

Outmaneuvering the Elephants: Lessons from Scipio Africanus in Today’s Retail Battlefield

2024年10月28日

How Edge Computing is Transforming the Retail Experience: A Shopper’s Journey

2024年9月24日

A Weekend of New Beginnings

2024年7月2日

Leveraging Synthetic Data for Smarter, More Ethical Marketing

2024年5月19日

How AI Is Revolutionizing Shopper Marketing: The Rise of the AI Companion

2024年4月9日

Co-Piloting Through AI Bootcamp Class

2024年2月18日

Navigating the Enigma of Holiday Spending for 2023

2023年11月7日

A Better Search Experience Using LLMs

2023年10月13日

Creating a Harmonious Retail Space: Engaging with Customers on Their Terms

2023年9月18日