登录查看更多内容

LLMs aren’t good enough at diagnostics yet. What will it take to get them there?

COTA

Together, we can bring clarity to cancer care.

发布日期: 2024年1月24日

Large language models (LLMs) are the foundation of highly hyped generative AI tools like Chat-GPT and Google Bard.? These models use enormous amounts of data scraped from the internet and other sources to produce eerily natural-seeming responses to everything from poetry prompts to homework help, all with simple, intuitive interfaces that anyone can learn to use.

It hasn’t taken long for healthcare and life sciences organizations to start exploring how to use these capabilities to assist with clinical care, particularly in the realm of diagnostics.? LLMs have the ability to synthesize vast quantities of structured and unstructured data in novel and unexpected ways, making them potentially ideal AI doctors to supplement the flesh-and-blood workforce.

However, these models aren’t perfect.? In fact, research is increasingly showing that they are far from it.? A newly published study in JAMA Pediatrics found that Chat-GPT misdiagnosed 72 of 100 pediatric medical cases selected.? For another 11 queries, it delivered a diagnosis too broad to be considered correct.

The tool wasn’t able to identify certain key relationships between diagnoses and underlying factors, and only managed to guess the right organ system involved in the diagnosis less than half the time.

The study builds on previous evidence showing that LLMs aren’t quite prepared for prime time diagnostics, especially when asked to take on complex specialties like oncology.? For example, a team from Switzerland recently found that Chat-CPT’s answers for more than 20% of open-ended questions about radiation oncology were considered “bad” or “very bad” by a team of human clinical reviewers.

And researchers from Mass General, Boston Children’s and Memorial Sloan Kettering found that LLMs recommended one or more non-NCCN-concordant treatment suggestions in more than a third of queries?(34.3%). The model also offered “hallucinations,” or suggestions that were not part of any recommended treatment, in 12.5% of cases.? In addition, the model’s output differed significantly based on how the question was written, adding confusion to the equation.

This doesn’t mean that LLMs are hopeless and should be abandoned for diagnostic decision support.? Instead, it’s a sign that we need to pour more resources into training and tuning large language models to unlock their full potential.??

Bertalan Meskó, MD, PhD 5 个月前

The Extraordinary Ways Elsevier Uses Artificial…

Bernard Marr 5 年前

The Evolution of Healthcare Industry & Role of AI

Muhammad Irfan 8 个月前

Chat-GPT and its publicly available competitors are generalized models that have an acceptable level of competency across many, many areas.? To make a difference in healthcare and life sciences, we need to go deep into narrower datasets focused on the insights that matter most.?

That means developing a larger number of clean, complete, accurate, and representative datasets that combine medical records and clinical trial information with real-world data (RWD) and real-world evidence (RWE) about clinical care and drug efficacy.

This data must be drawn from multiple sources and represent longitudinal patient journeys while being carefully curated with strong adherence to shared data governance principles.

Model trainers will also need to be cautious about introducing unintentional biases into the data, which can happen with poor quality medical literature, outdated care protocols, or studies that are not inclusive and representative of all communities. Training should also involve a variety of human clinical annotators with broad experience treating different patient populations to avoid creating an undesirable loop of confirmation bias.

Creating oncology specific LLMs that are up to the task of real-world clinical diagnosis will be a long-term project for technology developers, data companies, and clinical partners.? Achieving the best results will start with zeroing in on meaningfully curated real-world data that can support bias-free training at scale.

With high-quality fuel for the fire, LLMs could soon become a valuable addition to the clinical and life sciences toolkits and support improved outcomes for patients with cancer and other conditions.

Ajay Rathod [ITIL Master, LSS Black Belt]

10 个月

This article and thoughts are right on the money..! Narrowing down to each subject area will benefit both healthcare professionals and patients. The key is to get an active involvement of HCPs apart from the huge amount of articles available on the web today

1 次回应

Rob Bradley

RWD & AI | Healthcare Data Collaboration & Partnerships @ OM1

10 个月

Accessing LARGE volumes of high quality data is the only way for LLMs to be successful. LLMs won't be successful if we can't address the 'large' and 'high quality' data they require. #dataquality #datasharing Karlsgate

1 次回应

查看更多评论

要查看或添加评论，请登录

LLMs aren’t good enough at diagnostics yet. What will it take to get them there?

COTA

Together, we can bring clarity to cancer care.

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Leveraging Computer Vision For Monitoring Alzheimer's Disease Progression

Interesting reads ... July 2024

How Artificial Intelligence and Machine Learning Might Revolutionize Parkinson's Disease Trials

Transforming Healthcare through Generative AI: Revolutionizing Patient Care and Medical Innovation

Healthcare Present & Future Journey in 2023

Revolutionizing Healthcare: How AI Is Transforming Patient Care in Hospitals

The Role of AI in Revolutionizing Early Diagnosis: How Machine Learning is Transforming Healthcare

Latest AI Breakthroughs in Heart Attack and Cardiac Arrest Prevention

Revolutionizing Treatment: The Convergence of Precision Medicine and AI

Machine Learning’s Transformative Role in Tackling Alzheimer’s Disease

领英推荐

The value of high quality RWD for accurate healthcare analytics

2024年9月24日

Who, what, when…and especially where: Why care settings matter for real-world datasets

2024年6月26日

Introducing CAILIN: The Latest Development from COTA’s AI Lab

2024年6月20日

COTA and Hackensack Meridian Health at ASCO 2024

2024年5月31日

Real-world data is helping to fine-tune care decisions for multiple myeloma patients

2024年5月24日

Exploring standards of care in real-world settings with RWD

2024年4月3日

Registries, claims, and EHR data: What’s the difference between these data sources for life sciences research?

2024年3月12日

3 traits to look for in an oncology real-world data partner

2024年1月30日

For the hematology community, AI and RWD are top of mind

2024年1月26日

What’s in store for 2024? Generative AI, digital health funding, and real-world data

2024年1月16日