登录查看更多内容

A Survey on Hallucination in Large Language Models

Nikhill Gadkar

Helping companies win the customer relationship moments with AI

发布日期: 2023年9月14日

A Survey on Hallucination in Large Language Models. In this paper, the authors examine recent efforts to detect, explain, and mitigate hallucinations in Large Language Models (LLMs). They introduce taxonomies of LLM hallucination phenomena, provide evaluation benchmarks, analyse current strategies for reducing LLM hallucination, and suggest potential avenues for future research.

I read the paper out of curiosity to understand the causes of hallucination, whether there's a remedy, and if prompting can reduce or even eliminate hallucination.

?To access the paper click here - https://arxiv.org/abs/2309.01219

Why do LLMs hallucinate?

Despite their impressive performance across a range of tasks, LLMs occasionally produce hallucinations. Hallucination refers to moments when LLMs generate content that:

Deviates from the user's input.

1.?????? Conflicts with previously generated content.

2.?????? Is inconsistent with accepted world knowledge.

3.?????? These hallucinations challenge the dependability of LLMs in practical applications.

Some insights into LLM hallucination include:

Training Data Distribution: The likelihood of LLM hallucination correlates with the training data's distribution. If biases or errors are present in the training data, the model may reproduce or even exaggerate these errors.
Problems with Content Created by People: Hallucinations can also originate from human-generated datasets, manifesting as outdated information, biases, or even made-up statements. Since LLMs learn from these datasets, they can adopt and replicate these hallucinations.
Knowledge Memorization Abilities: This refers to an LLM's ability to remember, retrieve, and reason with the vast amount of information it's trained on. Shortcomings in these abilities can result in hallucinations.
Overestimation of Abilities: LLMs can sometimes overestimate their capabilities. They may be able to assess the accuracy of their answers but might not always discern the limits of their knowledge.

Margaretta Colangelo 5 个月前

Advanced Prompt Techniques for Large Language Models

Sanjay Kumar MBA,MS,PhD 2 个月前

Evaluating RAG Systems: A Comprehensive Approach to…

Snigdha Kakkar 6 个月前

Strategies to Reduce Hallucinations in LLMs

Mitigation during Pre-training:

Knowledge Acquisition from Credible Sources: Gathering accurate and current knowledge from trustworthy sources can "hot patch" LLMs, updating their knowledge and correcting any outdated or false information.
External Knowledge Bases: Data can be sourced from external knowledge repositories, such as vast unstructured datasets, structured databases, specific sites like Wikipedia, or the entire web.

Optimizing Model Architecture:

Multi-branch Decoder (MBD): This technique uses a multi-branch decoder to potentially diminish hallucinations. What is a MBD? ?Imagine you're trying to solve a puzzle, and there are multiple ways to approach it. Instead of just following one way, you decide to explore several paths at the same time to see which one works best. A MBD does something similar for language models. It allows the model to generate multiple possible answers or outputs simultaneously. By exploring these different "branches" or paths, the model can then choose the best or most accurate response. This approach can help in reducing mistakes or inaccuracies, like hallucinations, in the model's outputs.
Uncertainty-aware Decoder: This method emphasizes recognizing the model's prediction uncertainties to counteract hallucinations.
Bidirectional Autoregressive Architecture: This strategy models language both forwards and backwards. Using bidirectional data might help decrease hallucinations.

Knowledge Utilization:

Generation-time Supplement: This method involves appending retrieved knowledge or feedback to user queries before prompting the LLM. It capitalizes on the LLM's in-context learning ability, allowing the model to glean and use crucial information from the context to correct false claims.
Post-hoc Correction: This strategy employs an auxiliary "fixer" to amend hallucinations during post-processing. After the LLM produces an initial answer, this fixer inspects and rectifies any errors or hallucinations.

How Prompting Can Minimize or Eradicate Hallucinations:

Chain-of-Thought Prompt: This technique requires LLMs to produce reasoning steps before giving the final response. By compelling the model to think through its answer sequentially, it might lower the likelihood of hallucination. However, this approach could also lead to hallucinated reasoning steps.

The key insight is that using accurate and factually correct data can greatly diminish hallucinations in LLMs. Although the paper offers qualitative observations and cites multiple studies, it doesn't present specific numerical data or statistics that directly correlate the precision of training data with a decrease in hallucinations. Nonetheless, the cited studies and references bolster the idea that accurate data is crucial in mitigating hallucinations in LLMs.

#llm #llms #hallucination #chatgpt

要查看或添加评论，请登录

Nikhill Gadkar的更多文章

Unlocking the Future of Robotic Process Automation (RPA): Introducing the 5 Levels of RPA Maturity

2024年10月21日

Unlocking the Future of Robotic Process Automation (RPA): Introducing the 5 Levels of RPA Maturity

As businesses continue to embrace automation, it’s important to understand the various stages of Robotic Process…
We Always Make the Wrong Choices to Reach the Right Place

2024年4月11日

We Always Make the Wrong Choices to Reach the Right Place

In our journey through life, we are often confronted with a numerous of choices. Some lead us to immediate success…

1 条评论
Cognitive Load Made Simple: How AR Clears the Path for Easier Learning

2024年3月12日

Cognitive Load Made Simple: How AR Clears the Path for Easier Learning

Edited excerpts from the paper "Why Every Organization Needs an Augmented Reality Strategy" published in HBRs 10 Must…

1 条评论
Why Algorithms Lead Us Astray

2024年2月29日

Why Algorithms Lead Us Astray

Edited excerpts from the paper Algorithms Need Managers, from the HBR's 10 Must Reads, On AI, Analytics and the New…
ALIGNING LARGE MULTIMODAL MODELS WITH FACTUALLY AUGMENTED RLHF

2023年10月5日

ALIGNING LARGE MULTIMODAL MODELS WITH FACTUALLY AUGMENTED RLHF

This is my insights from the paper ALIGNING LARGE MULTIMODAL MODELS WITH FACTUALLY AUGMENTED RLHF. What is Large…
The Rise and Potential of Large Language Model Based Agents: A Survey

2023年9月27日

The Rise and Potential of Large Language Model Based Agents: A Survey

Here are my insights from the paper, The Rise and Potential of Large Language Model Based Agents: A Survey An "agent"…
Automation is ideally about unlocking human potential to do tasks differently and do different, higher-value tasks.

2023年7月7日

Automation is ideally about unlocking human potential to do tasks differently and do different, higher-value tasks.

In the paper Generative AI Will Enhance — Not Erase — Customer Service Jobs by Paul R. Daugherty, H.

2 条评论
The Evolving Role of Risk Analysis: Traditional Risk Analysts vs. AI Risk Analysts

2023年7月6日

The Evolving Role of Risk Analysis: Traditional Risk Analysts vs. AI Risk Analysts

A traditional Risk Analyst in the banking sector is responsible for identifying and assessing threats that could…
The Future of Banking: 10 AI Roles That Will Revolutionize the Industry

2023年7月6日

The Future of Banking: 10 AI Roles That Will Revolutionize the Industry

AI and automation are indeed changing the job landscape, and while they may replace certain tasks, they are also…
The Future of Secondary Research: Combining AI and Search Engines for In-depth Industry Analysis

2023年7月4日

The Future of Secondary Research: Combining AI and Search Engines for In-depth Industry Analysis

Using ChatGPT and search engines like Google together leverages the strengths of both tools, providing a more…

See all articles

A Survey on Hallucination in Large Language Models

Nikhill Gadkar

Helping companies win the customer relationship moments with AI

领英推荐

Nikhill Gadkar的更多文章

社区洞察

其他会员也浏览了

Does Fine-Tuning cause more Hallucinations, and how does cross-layer Attention reduce Key-Value Cache size?

The LLM Revolution: Exploring the Depths of Large Language Models

Navigating the Challenges of Deploying Large Language Models at Scale - ongoing research initiative.

Role of RAG Noise in Large Language Models & Strategic Chain-of-Thought

Unlocking Business Potential with Large Language model (LLMs)

July 16th Part 3 - Benchmark Tests for Large Language Models | Relationship between LLMs, KGs, Ontology

Staying Unmuted – with Subha Vadlamannati

In the Era of LLM: A Critical Look at Large Language Models

Next-Generation LLM Evaluation: Bridging Academic Benchmarks and Real-World Performance

领英推荐

Nikhill Gadkar的更多文章

Unlocking the Future of Robotic Process Automation (RPA): Introducing the 5 Levels of RPA Maturity

We Always Make the Wrong Choices to Reach the Right Place

Cognitive Load Made Simple: How AR Clears the Path for Easier Learning

Why Algorithms Lead Us Astray

ALIGNING LARGE MULTIMODAL MODELS WITH FACTUALLY AUGMENTED RLHF

The Rise and Potential of Large Language Model Based Agents: A Survey

Automation is ideally about unlocking human potential to do tasks differently and do different, higher-value tasks.

The Evolving Role of Risk Analysis: Traditional Risk Analysts vs. AI Risk Analysts

The Future of Banking: 10 AI Roles That Will Revolutionize the Industry

The Future of Secondary Research: Combining AI and Search Engines for In-depth Industry Analysis

社区洞察

其他会员也浏览了

Does Fine-Tuning cause more Hallucinations, and how does cross-layer Attention reduce Key-Value Cache size?

The LLM Revolution: Exploring the Depths of Large Language Models

Navigating the Challenges of Deploying Large Language Models at Scale - ongoing research initiative.

Role of RAG Noise in Large Language Models & Strategic Chain-of-Thought

Unlocking Business Potential with Large Language model (LLMs)

July 16th Part 3 - Benchmark Tests for Large Language Models | Relationship between LLMs, KGs, Ontology

Staying Unmuted – with Subha Vadlamannati

In the Era of LLM: A Critical Look at Large Language Models

Next-Generation LLM Evaluation: Bridging Academic Benchmarks and Real-World Performance