Top 5 Open Medical AI highlights
Bart De Witte
Founder HIPPO AI Foundation | Keynote Speaker, Lecturer | Social Entrepreneur | Desirable Future-ist | LinkedIn Top Healthcare Voice | Digital Health | Medical AI | Open Source
May2024
Summary:
1. Open Pathology AI
Microsoft has released an open-foundation model for whole-slide pathology. The training data hasn’t been opened and the license model used way too restricted, but this is a big leap forward and confirms one of the hypotheses I made a few years ago; the future of medical AI is open.
Strengths:
Weaknesses:
Comment:
When Meta released their first LLaMa model, it was released under a non-commercial, for research only license. Meta's licensing decisions have sparked debates within the open-source community, with some arguing that Llama's licenses do not fully align with open-source principles due to the commercial restrictions and lack of transparency around training data and code. GigaPath also isn’t open source, they restricted the license and did not share the training data, still, by publishing this model, it will grow the community and inspire others to take openness one step further. One interesting sentence in the license that strictly prohibits the use of any personal data included in the Materials for any purpose other than the authorized research, requires maintaining strict confidentiality of such data, and mandates its immediate destruction upon completion of the research to prevent re-identification. Although no personal data hasn’t been shared, it seems they want to protect them from data leakage or users crafting various techniques to reveal information about its training data. From the Data Repo, it seems the data donators came from Karolinska and Raboud, both European well knows university medical centres.
2. OpenBioLLM-70B leads on
OpenBioLLM-70B delivers SOTA performance, while the OpenBioLLM-8B model even surpasses GPT-3.5 and Meditron-70B! The models underwent a rigorous two-phase fine-tuning process using the LLama-3 70B & 8B models as the base and leveraging Direct Preference Optimization (DPO) for optimal performance.
You can download the models directly from Huggingface today.
- 70B : https://huggingface.co/aaditya/OpenBioLLM-Llama3-70B?- 8B : https://huggingface.co/aaditya/OpenBioLLM-Llama3-8B
Summarize Clinical Notes :
领英推荐
OpenBioLLM can efficiently analyze and summarize complex clinical notes, EHR data, and discharge summaries, extracting key information and generating concise, structured summaries
Use Cases: De-Identification, Biomarkers Extraction, Medical Classification, Clinical Entity Recognition, Answer Medical Questions, Summarize Clinical Notes
3. Omi Phi-3-mini-4k-instruct superior to GPT-4
Sum Small is a small language models (SLMs) with groundbreaking performance at low cost and low latency, specifically designed to generate SOAP summaries from medical dialogues. It is a fine-tuned version of the Microsoft/Phi-3-mini-4k-instruct using the Omi Health/medical-dialogue-to-soap-summary dataset. This model demonstrates superior performance compared to larger models like GPT-4.
This model is intended for research and development in AI-powered medical documentation. It is not ready for direct clinical use without further validation and should be integrated with additional safety guardrails before deployment in a medical setting. The model was trained on the Omi Health's synthetic medical-dialogue-to-soap-summary dataset, which consists of 10,000 synthetically generated dialogues and corresponding SOAP summaries.
The Sum Small model is released under the MIT License, which permits broad use with fewer restrictions, making it accessible for both commercial and non-commercial use.
4. Risks and Opportunities of Open-Source Generative AI
An excellent paper from a Champions Team league of Phd Researchers (Oxford, Berkeley, University of Luxembourg,..) argues that the benefits of open-sourcing generative AI outweigh the marginal increase in risks compared to closed-source models. They want that over-regulation could be catastrophic to open-source Gen AI.
Benefits of Open-Source Generative AI:
Risks of Open-Source Generative AI:
In the long-term, the paper suggests that open-sourcing AGI (if achieved) could help increase the likelihood of technical alignment, maintain a balance of power, and enable better decentralized coordination mechanisms - all of which can help mitigate existential risks. It also discusses the potential benefits and non-existential risks of open-sourced AGI.
5. Measuring the Openness of AI Foundation Models: Competition and Policy Implications
Another recent paper presents a comprehensive methodology for measuring the openness of AI foundation models, considering technical, economic, legal, and social factors. The analysis covers 11 prominent AI foundation models, providing a detailed ranking based on 18 variables across three key areas: knowledge problem, implicit contracting problem, and collective action governance problem. The findings challenge the common perception of a clear divide between "open" and "closed" AI models, showing a more nuanced spectrum of openness. Although closed-source AI models pose greater anti-competitive risks, making them a clear target for antitrust scrutiny, as open models are more transparent, can be forked, and do not allow the same leveraging of market power. The implications for antitrust enforcers highlight how the degree of openness can serve as a guide for identifying potential anti-competitive risks.
Strengths of open-source AI:?
Weaknesses of open-source AI:?
Talk AI with me
4 个月Wow this is wild - I’m a bit concerned with this whole branding of open source ? It seems to me the temptations to crowdsource novel ideals and models have already shown too profitable to keep that ethos alive when there are billions ?? on the table. Example : example so far that we are publically aware of is openAI and mistral selling out Microsoft?