Q* and its impact on the pharmaceutical industry
Shaun Tyler
Director Global Software Integration & AI Thought Leader at Koerber Pharma Software
Introduction
Over the last couple of days buzzwords like Q*, AGI, Super AGI etc. dominated AI-news and kicked off the next media hype 5 minutes after the OpenAI chaos-days. As a computer-scientist and a leader of a large and talented software development team, I always try to look beyond the hype. What is the potential of a new technology; how can it be used to empower the company I work for; How can it be used to maybe reduce the constant effort we have in the pharmaceutical industry to uphold GxP regulations and requirements, without reducing our quality but still be compliant? In today's article I would like to demystify Q*, explain why it's not AGI and why it will still impact especially our industry in the upcoming years.
Before we start our journey understanding Q, I would like to get a few things out of the way and that is: Why is Q* not an AGI. For that we need to understand what AGI actually is, or what we understand when talking about it.
Definition of AGI (Artificial General Intelligence):
Capabilities of Q* (as reported):
While Q* shows promise in certain aspects like problem-solving and potentially learning and adapting within its domain, it does not exhibit the full range of capabilities associated with AGI, such as broad adaptability, understanding, and reasoning across diverse domains, and human-like cognition.
The Foundations of Q*: A Deep Dive into Its Core
Introducing Reinforcement Learning: Central to Q* is Q-learning, a form of reinforcement learning. Reinforcement Learning (RL) is a critical area of machine learning where an agent learns to make decisions by performing actions in an environment and receiving feedback in the form of rewards or penalties. This feedback helps the agent learn which actions are most beneficial over time. The distinctive feature of RL is its focus on learning through interaction, rather than from a predefined dataset.
How Reinforcement Learning Works:
Reinforcement Learning's Connection to Q-Learning:
Q-Learning as an Evolution of Reinforcement Learning:
领英推荐
The Q-Learning Algorithm:
Q's Enhanced Learning Mechanism: Q* builds upon traditional Q-learning by incorporating advanced techniques like deep learning and neural networks. This enables Q* to handle more complex, higher-dimensional environments, making it suitable for applications like mathematical problem-solving.
Q and Determinism: Redefining Predictability in AI and what does it mean for Pharma?
The Essence of Determinism in AI:
In the context of artificial intelligence, determinism refers to the ability of an AI system to consistently produce the same output or result from a given input or set of conditions. This predictability is crucial in applications where consistency and reliability are of the essence like in the pharmaceutical industry.
Challenges in Traditional AI: Many AI models, especially those based on probabilistic algorithms or with a high degree of variability in learning, struggle with determinism. They might produce different outcomes under the same conditions due to inherent uncertainties in their learning processes often refered to as hallucinations in the context of chatGPT.
Q's Contribution to Enhanced Determinism:
Impact of Determinism in Q:
Q's Deterministic Approach: A Step Towards More Reliable AI:
Q's Enhanced Determinism and Its Impact on the Pharmaceutical Industry a possible outlook and conclusion
While Q* doesn't bring back Turing-Like determinism that is the foundation of so many GxP regulations for computerized systems it will have a huge impact on the way and especially the speed how fast we can introduce generative AI in the pharmaceutical sector.
Currently everything we do in regard to genAI integration in the pharmaceutical sector involves HITL, human in the loop, sometimes to the point that the benefit of LLM generated context will not be visible, will not even reduce time and effort because you can't trust a system in the pharma context that hallucinates, no matter how much you reduce the temperature depending on the task. Imagine a large recipe with a few hundred or even a thousand generated steps and you manually have to check each value, the benefit wouldn't be there. A while ago I intensively analyzed the GuardRails project that defines measurable guards and thus enforcing expected outcomes, while this one component, relying on a computerized system to not hallucinate is necessary, even is imperative for our industry.
I see in Q-Stars expected evolution not a fear of AGI and media hype. What I see is the potential to release the full impact of genAI for our industry, enabling us to create the next foundation of computerized systems that will most likely be very different, but will increase productivity exponentially.
Until next time
Thanks a lot for clarifying overview, Shaun. This introduction helps me a lot in current discussions.
MES Lead Engineer @ Emerson Automation Solutions
1 年Thank you Shaun. Right questions.