登录查看更多内容

Why does ChatGPT provide inaccurate citations?

Josh Sackman

President and Cofounder at AppliedVR | Advisor and Board Member | Digital health innovation

发布日期: 2023年1月26日

I wanted to see how well ChatGPT understands public figures and can report biographical information.

For my article on breathwork, I asked the AI to summarize the biography of Wim Hof and to include his Guinness World Records. I was happy to see an accurate description.

I was glad to see it also cited some of the media outlets that have covered Hof, so I wanted to find some of those sources.

It seems ChatGPT has fairly poor short-term memory and didn't full grasp the request of the "him" I was referring to. It shared a number of Guinness World Records related to cold exposure that didn't seem to match the records it provided above.

Those weren't quite the news articles I was looking for, so I improved my prompt to be more specific. The response appeared to be successful, finding articles in Telegraph, The Guardian, BBC and CNN.

However, as I clicked on the links...none of them worked!

领英推荐

Reading and Writing in the Age of AI

Carl Savage 1 年前

Can ChatGPT Make Stone Tools?

Ted Prince 1 年前

On ChatGPT and the Paranoia against AI

Soo Bin Yeo 1 年前

Interestingly, each of those publications have actually covered Wim Hof's achievements. And the titles of the articles are factually correct. But, similar to my prior experience with ChatGPT as a research assistant, none of the articles reported are real!

In searching for the articles' titles the AI provided, none of those headlines have ever been printed within those publications. Rather, it provided article names and links for plausible publications that could exist...but each was a complete fabrication.

Why Does ChatGPT provide inaccurate citations?

This led me to explore more about this limitation of ChatGPT. Despite some interesting, but highly exaggerated and sensational headlines such as "ChatCPT Is Dumber Than You Think" in the Atlantic and "AI Platforms like ChatGPT Are Easy to Use but Also Potentially Dangerous" in Scientific American, they didn't really reveal the source of the problem. It wasn't until I dove into a YCombinator Hacker News thread until I started understanding this issue better.

They key idea is that ChatGPT is a language model, and not a knowledge model. One user explains, "I've seen it referred to as 'stochastic parroting' elsewhere, and that probably gives more insight into what is happening. These large language models are trained to predict the next word for a given input. And they don't have a choice about this; they must predict the next word, even if it means that they have to make something up."

Another user expands, "These model are extremely good when they have creative freedom and are used to produce some kind of art. Poems, text in various styles, images without important details, and so on. But they fail miserably when you give them tasks that require world knowledge or precision."

So it's not that ChatGPT is dumb or dangerous, but that our expectations are misguided about how ChatGPT works and how it can currently be used.

Key Takeaways

ChatGPT is a great creative writer, and can output a good biography. As a biographer, ChatGPT did a decent job researching key facts about a public figure and reporting back a synthesized paragraph citing their achievements. In fact, I fed ChatGPT my resume and key points I use in my bio, and I was really happy at the short bio the AI generated. It was concise and well written. I could definitely see tools being developed to write effective bios and resumes.
ChatGPT is a language model, and not a knowledge model. The most surprising finding so far is the AI's inability to provide citation links. This seems like such an easy task: simply report back an article title and link that exists on the internet. Yet it completes a much more complex and fraudulent task: it makes up article titles and URLs that seem like they should exist, but they don't. This undermines the credibility of ChatGPT for the public, and even with understanding its limitations, it still makes me fearful to trust the AI's accuracy in other ways. The implications of this shortcoming is that we all must manually fact check any output of ChatGPT before using the generative copy it develops.
GPT-4 may put this issue in the past. The current GPT-3 model is trained on 175 billion parameters. Comparatively, the upcoming GPT-4 model is trained on 100 trillion parameters! Some speculate that the training set for GPT-4 is equivalent to nearly 25% of the entire public internet. It seems like with this increased base for learning, we will move towards something that feels more like a knowledge model as its trained to provide more comprehensive and accurate output.

The Healthy Executive

1,317 位关注者

Marquis Rosen

US Marine Veteran Turned Entrepreneur: Leading unique Web Design, Event Hosting, and Non-Profit Transformations with Vibrant Visions Consulting and DDR Sanctuary.

1 年

Thanks for sharing

Daniel Molitor

1 年

Now complete the circle. Assign a machine reader to assess your machine written piece.

Les Shute

AI Innovation Leader | Chief Innovation Officer @ Infinity Nurse Case Management | Generative AI & Digital Transformation | Microsoft Certified

1 年

Great insights, Josh! I totally agree that this train has left the station and will continually improve over a relatively-short time. What some thought would be impossible until decades from now is actually happening.

Almira Osmanovic Thunstr?m

Developer, Innovator and AI Researcher

1 年

Most peer review and documentation is behind paywalls. LLMs are trained mainly on open crawl. It has nothing of value to learn from. The same is not for GPT-3.5 as there was an inclusion of arXiv in its training data, but the sources are all outdated. That is my theory :).?

Valentino Megale, PhD ??

?? Startup founder innovating #healthcare and pain management with #XR tech at Softcare Studios | ?? Digital Safety & Privacy at XRSI | ?? Startup mentor & Lecturer on Emerging Technologies | PhD in Neuropharmacology

1 年

In the meantime, https://www.perplexity.ai/ looks like a useful tool fixing the gap around the references.

2 次回应

查看更多评论

要查看或添加评论，请登录

Josh Sackman的更多文章

Looking to the Future: Apple Vision Pro and the Transformative Potential of XR

2023年6月6日

Looking to the Future: Apple Vision Pro and the Transformative Potential of XR

Over the past 9 years, I have witnessed the evolution of extended reality (XR) technology, from the early days of the…

8 条评论
The Potential of ChatGPT for Personalized Meditations

2023年3月6日

The Potential of ChatGPT for Personalized Meditations

With the recent announcements around the ChatGPT and Whisper APIs, allowing developers to integrate OpenAI models into…

6 条评论
My Favorite 2 Breathing Methods to Regulate Your Nervous System

2023年1月24日

My Favorite 2 Breathing Methods to Regulate Your Nervous System

As leaders, it's essential that we have the ability to regulate our nervous system in order to stay focused, calm, and…

10 条评论
AI-Generated Health Protocols

2023年1月20日

AI-Generated Health Protocols

For my recent article on the benefits of cold, I wanted to put ChatGPT to the test in developing a cold exposure…
Embracing the Cold: How a Daily Cold Shower Can Help You Lead with Resilience

2023年1月17日

Embracing the Cold: How a Daily Cold Shower Can Help You Lead with Resilience

As leaders, we are constantly faced with challenges and stressors that can test our resilience and resolve. While it…

8 条评论
ChatGPT Is My Meditation Teacher

2023年1月12日

ChatGPT Is My Meditation Teacher

In developing this week's post on presence, in addition to continuing in its role as my writer's assistant, I tasked…

8 条评论
The Power of Presence: Leveraging Lessons from Virtual Reality into Leadership

2023年1月10日

The Power of Presence: Leveraging Lessons from Virtual Reality into Leadership

As the Co-Founder of AppliedVR, I have spent the past 8 years studying the unique properties of virtual reality and its…

2 条评论
Why I fired ChatGPT as my research assistant

2023年1月6日

Why I fired ChatGPT as my research assistant

In writing my recent article on mindset and self-tracking, I turned to ChatGPT to help research some academic journal…

14 条评论
Why Your Mindset Matters When It Comes to Self-Tracking

2023年1月4日

Why Your Mindset Matters When It Comes to Self-Tracking

Behavior change is hard. Many people struggle to maintain behavior changes over the long term, with reports of…

3 条评论
The Writer's Assistant: Co-Creating with ChatGPT

2022年12月30日

The Writer's Assistant: Co-Creating with ChatGPT

In addition to assembling the Healthy Executive toolkit — where I share my journey to optimal health and performance…

9 条评论

See all articles

Why does ChatGPT provide inaccurate citations?

Josh Sackman

President and Cofounder at AppliedVR | Advisor and Board Member | Digital health innovation

领英推荐

Why Does ChatGPT provide inaccurate citations?

Key Takeaways

The Healthy Executive

1,317 位关注者

Josh Sackman的更多文章

社区洞察

其他会员也浏览了

What Is ChatGPT Doing … and Why Does It Work? – the summary

ChatGPT vs Claude - Who Reigns Supreme in the AI Arena?

Is that ChatGPT I'm reading?

What it means to write

A Comparative Analysis: My Initial Test of ChatGPT versus Google's Bard.

Research Update - Is ChatGPT ?????

ChatGPT – WSJ DECLARES PANIC!

Monday Inspiration: (April 2023): The limitations of ChatGPT, the Age of Average and cultural homogenisation

ChatGPT is a Bullsh*tter

Avoidance or Responsibility? Reevaluating AI’s Approach to Controversial Topics

领英推荐

Why Does ChatGPT provide inaccurate citations?

Key Takeaways

The Healthy Executive

1,317 位关注者

Josh Sackman的更多文章

Looking to the Future: Apple Vision Pro and the Transformative Potential of XR

The Potential of ChatGPT for Personalized Meditations

My Favorite 2 Breathing Methods to Regulate Your Nervous System

AI-Generated Health Protocols

Embracing the Cold: How a Daily Cold Shower Can Help You Lead with Resilience

ChatGPT Is My Meditation Teacher

The Power of Presence: Leveraging Lessons from Virtual Reality into Leadership

Why I fired ChatGPT as my research assistant

Why Your Mindset Matters When It Comes to Self-Tracking

The Writer's Assistant: Co-Creating with ChatGPT

社区洞察

其他会员也浏览了

What Is ChatGPT Doing … and Why Does It Work? – the summary

ChatGPT vs Claude - Who Reigns Supreme in the AI Arena?

Is that ChatGPT I'm reading?

What it means to write

A Comparative Analysis: My Initial Test of ChatGPT versus Google's Bard.

Research Update - Is ChatGPT ?????

ChatGPT – WSJ DECLARES PANIC!

Monday Inspiration: (April 2023): The limitations of ChatGPT, the Age of Average and cultural homogenisation

ChatGPT is a Bullsh*tter

Avoidance or Responsibility? Reevaluating AI’s Approach to Controversial Topics