GPT-4 Significantly Exceeds Doctors In Assessing Eye Problems
Margaretta Colangelo
Leading AI Analyst | Speaker | Writer | AI Newsletter 57,000+ subscribers
"We could realistically deploy AI in triaging patients with eye issues to decide which patients need to be seen by a specialist immediately."
Dr. Arun Thirunavukarasu, Lead Author, Academic Foundation Doctor at?Oxford University Hospitals NHS?Foundation Trust
This week researchers at University of Cambridge School of Clinical Medicine and Oxford University Clinical Academic Graduate School published a paper demonstrating that AI is much better than non-specialist doctors at assessing eye problems. Lead author Arun Thirunavukarasu, PhD, says that large language models (LLMs) have the potential to improve healthcare by triaging patients with eye problems
Study Highlights
Results
Examination performance in the 87-question mock exam: The study tested GPT-4, GPT-3.5, PaLM2, LLaMA, expert ophthalmologists (E1-E5), ophthalmology trainees (T1-T3), and unspecialized junior doctors (J1-J2) with the same set of questions. GPT-4 gave more accurate responses than all of the LLMs.
2) Significantly greater performance for GPT-4 is highlighted green, significantly inferior performance for GPT-4 is highlighted orange. GPT-4 was superior to all other LLMs and unspecialized junior doctors, and equivalent to most expert ophthalmologists and all ophthalmology trainees.
3) Question subject and type distributions presented alongside scores attained by LLMs, expert ophthalmologists (E1-E5), ophthalmology trainees (T1-T3), and unspecialized junior doctors (J1-J2).
领英推荐
4) Agreement correlates strongly with overall performance and stratification analysis found no particular question type or subject was associated with better performance of LLMs or doctors, indicating that LLM knowledge and reasoning ability is general across ophthalmology rather than restricted to particular subspecialties or question types.
5) Accuracy and relevance of GPT-3.5 and GPT-4 in response to ophthalmological questions.
Reference
Thirunavukarasu, A J et al: ‘Large language models approach expert-level clinical knowledge and reasoning in ophthalmology: A head-to-head cross-sectional study.’ PLOS Digital Health, April 2024. DOI: 10.1371/journal.pdig.0000341
Subscribe, Comment, Join Group
I'm interested in your feedback - please leave your comments.
To subscribe to the AI in Healthcare Milestones newsletter click here.
To join the AI in Healthcare Milestones Group click here.
Copyright ? 2024 Margaretta Colangelo. All Rights Reserved.
This article was written by Margaretta Colangelo. Margaretta is a leading AI analyst who tracks significant milestones in AI in healthcare. She consults with AI healthcare companies and writes about some of the companies she consults with. Margaretta serves on the advisory board of the AI Precision Health Institute at the University of Hawai?i?Cancer Center @realmargaretta
Eh Me and and the Crew Now Too (International Space Station)
10 个月Can Someone Help Me Clarify Why This is code region is Missing from the NCBI since this is suppose to be the HUMAN GENOME PROJECT of 1989?
Territory Director at Acera Surgical, Inc
11 个月It won’t stop here, either. Generative AI will completely transform the ability to diagnose and even make entire treatment protocols obsolete. Concerning for both medical professionals and big business but ultimately great for the patient.
Trauma Healing Coach, Therapist, Social Work Professor, Business Consultant, Writer. I help people and organizations thrive.
11 个月Wow thanks for sharing!
Vice Chair, Digital & Integrative Pathology; Vice Chair, Clinical Outreach; Director, Anatomic Pathology; Professor at The University of Texas Medical Branch
11 个月Excellent summary of a very nice study. All of medicine is being disrupted at an exponential rate in front of our eyes.
spe salvi
11 个月Fabio Scacciavillani