登录查看更多内容

Avoid model hallucinations

Julien Coupez

Google Customer Engineer for Startups | x-AWS | Former founder/CTO | Mentor

发布日期: 2024年4月30日

This week in "2 minutes for...", we'll be looking at how to minimize hallucinations in language model responses and that's no mean feat, as OpenAI news coverage in the EU attest. This post is the third in a series on prompt engineering, following on from "Mastering the Art of Prompting" and "5 key prompt engineering techniques using Claude".

What is hallucination and why does models hallucinate?

Generative AI models like ChatGPT and DALL-E have revolutionized content creation, but a concerning phenomenon called "hallucinations" has emerged. Hallucinations occur when a generative AI confidently fabricates content that doesn't align with reality. A non-exhaustive list of factors are causing theses fake realities:

Misinformation in the training dataset: Internet contains a lots of inaccuracies, biases and flawed data.
Lack of understanding: While incredibly fluent, current AI models don't truly comprehend the information they process. They predict what text is likely to come next based on patterns, not knowledge
Overconfidence: Instead of saying "I don't know," models may confidently hallucinate an answer just to match the query.

The worst of it is probably the confidence that the model inspires, which is often misleading: it's too believable to be true, and sometimes too complex to be verifiable.

领英推荐

The future of AI: How artificial intelligence will…

简柏特 1 年前

Prompt Engineering, AI Agents, and LLMs: Kick-Start a…

Towards Data Science 1 年前

AutoGPTs could Transform the World At the Speed of A.I.

Michael Spencer 1 年前

How to Deal with hallucinations

Apart from avoiding pills with dubious effects, hallucinations can unfortunately never be completely avoided. But I'm going to give you 4 techniques to apply in your prompts to minimize their appearance as much as possible.

Ask your language model to say "I don't know" if he doesn't know. You can also do this with your children, it works very well ;)
Tell him to answer only if he is very confident in the answer.
As indicated in the "5 key prompt techniques", ask your model to think step by step. This will allow the model to double-check. For example, you can ask it to <think> in tags that you will then remove from the final response.
Ask the model to find relevant quotes from long documents then answer using the quotes

Conclusions

You want to put genAI into your products, but hallucinations are bad for your startup. By implementing these techniques in your prompts, you'll be able to take your application from a not-so-reliable prototype to a production version that doesn't talk nonsense.

About Me

I help startups get through their journey: architecture on AWS, security, cost optimization, business development. In short, if you've got a great idea and a good team, don't hesitate to message me!

要查看或添加评论，请登录

Julien Coupez的更多文章

Build the future, build a datalake

2024年10月8日

Build the future, build a datalake

In the age of LLMs and other artificial intelligence models, it's common to hear about the billions of "parameters" in…
3 reasons why you should use SSO with your AWS account

2024年7月9日

3 reasons why you should use SSO with your AWS account

Hey there, startup warriors! This week in "I've got 2 minutes to..

1 条评论
Stop using just one AWS Account

2024年6月4日

Stop using just one AWS Account

As a startup, speed and agility are your superpowers. And it's very easy to mistake speed for precipitation.
Extract information with a LLM

2024年5月16日

Extract information with a LLM

This week in "2 minutes for..

1 条评论
Break-Even for dummies

2024年5月7日

Break-Even for dummies

Hey there, fellow startup enthusiasts! Welcome back to our "2 Minutes for..
Achieve Compliance in Canada with AWS

2024年4月23日

Achieve Compliance in Canada with AWS

When you start a company, the last thing you think of is compliance, right. Many of the founders I talk to don't know…
5 key prompt engineering techniques using Claude

2024年4月16日

5 key prompt engineering techniques using Claude

This week on "2 minutes for..

1 条评论
Mastering the Art of Prompting

2024年4月2日

Mastering the Art of Prompting

This week on "2 minutes for..
5 Tips to optimize your serverless database costs

2024年3月26日

5 Tips to optimize your serverless database costs

Hey everyone! In this week's "2 Minutes for..

2 条评论
Security on AWS - Email distribution lists

2024年3月21日

Security on AWS - Email distribution lists

Hey there, tech enthusiasts! Today in "2 minutes for..

See all articles

Avoid model hallucinations

Julien Coupez

Google Customer Engineer for Startups | x-AWS | Former founder/CTO | Mentor

What is hallucination and why does models hallucinate?

领英推荐

How to Deal with hallucinations

Conclusions

About Me

Julien Coupez的更多文章

社区洞察

其他会员也浏览了

Long-Term Memory: AI's Maturity from A Party Trick To An Organizational Asset

Tools and Materials: A Mental Model for AI

Why Most Businesses Are Using AI Wrong—and How to Fix It Fast

"Effective Prompt Engineering Strategies for Enhancing our LLM Models"

Why is it critical for AI Product Managers to be Aware of Extrinsic Hallucinations in AI Products

AI in Transition: Lessons from 2024 and the Road Ahead for Reasoning

AI Agents: The Future of Work

OpenAI's O1 Model Series: Ushering in a New Era of AI Reasoning

Navigating the Frontier of AI Agents: A Glimpse into the Future

Decoding the Art of the Prompt Engineer: Unleashing the Power of AI

What is hallucination and why does models hallucinate?

领英推荐

How to Deal with hallucinations

Conclusions

About Me

Julien Coupez的更多文章

Build the future, build a datalake

3 reasons why you should use SSO with your AWS account

Stop using just one AWS Account

Extract information with a LLM

Break-Even for dummies

Achieve Compliance in Canada with AWS

5 key prompt engineering techniques using Claude

Mastering the Art of Prompting

5 Tips to optimize your serverless database costs

Security on AWS - Email distribution lists

社区洞察

其他会员也浏览了

Long-Term Memory: AI's Maturity from A Party Trick To An Organizational Asset

Tools and Materials: A Mental Model for AI

Why Most Businesses Are Using AI Wrong—and How to Fix It Fast

"Effective Prompt Engineering Strategies for Enhancing our LLM Models"

Why is it critical for AI Product Managers to be Aware of Extrinsic Hallucinations in AI Products

AI in Transition: Lessons from 2024 and the Road Ahead for Reasoning

AI Agents: The Future of Work

OpenAI's O1 Model Series: Ushering in a New Era of AI Reasoning

Navigating the Frontier of AI Agents: A Glimpse into the Future

Decoding the Art of the Prompt Engineer: Unleashing the Power of AI