登录查看更多内容

How to leverage Generative-AI for external facing applications using Retrieval-Augmented Generation (RAG)

Phane Mane

AI-Practitioner | Blogger | eCommerce SME | IT Strategist | MS, MBA

发布日期: 2023年12月1日

In my role as an architect at one of the most forward-looking companies in the world, I often interact with my peers from both inside and outside of my company on topics related to leveraging Generative AI for enterprise use cases – and while almost no one is opposed to the idea, I do sense a palpable hesitancy when the conversation turns to external (customer or end-user) facing applications.

I agree with that sentiment.?

Despite the rapid maturity of Large Language Models (LLMs) over the last couple of years, their knowledge is limited to the volume of “parameters” and “tokens” that they were pre-trained on. This means that even if they are fine-tuned with the latest ML techniques like Reinforcement Learning from human feedback (RLHF), Reward modeling, etc. a model’s knowledge is restricted to a snapshot of time and more importantly they lack exposure to data/information internal to an enterprise leading to generic output referred to as "hallucinations".

As you can imagine a model’s inability to output factual, in-context, and relevant information inhibits an enterprise from being able to confidently expose the generated content to its external users like customers, partners, distributors, etc. - which is especially critical for companies in highly regulated industries such as Healthcare, Life-Science, Pharmaceutical, Finance, Insurance, Telecom, etc.

This is where Retrieval-Augmented Generation (RAG) can help.

In this blog, we will discuss the underlying mechanism behind RAG and how it can be used in an enterprise setting to increase the quality/accuracy of content generated by an LLM so it can be used with a higher degree of confidence.

So what is RAG?

RAG, or Retrieval-augmented generation, is a cutting-edge AI framework that enhances large language models (LLMs) by incorporating external knowledge sources. LLMs are notorious for, occasional inconsistencies due to their statistical understanding of words without semantic depth.

I talked about the key considerations when deploying enterprise-scale Generative-AI at your organization in one of my previous blogs.

RAG strategically grounds LLMs with contemporary information that it wasn’t trained on and allows them to have an “open-book” approach to answering the questions thus elevating the quality of responses to be the latest and accurate because it is based on facts retrieved from the reliable sources.

How does RAG work?

RAG optimizes organizational data from structured databases to unstructured documents—by standardizing it into a knowledge repository for seamless integration with a Gen-AI system. This involves employing an embedded language model algorithm for numerical representation stored in a vector database streamlining the retrieval process.

The actual process happens in two steps – i.e., retrieval and augmented content generation where, algorithms extract pertinent information based on the user's prompt, sourcing from indexed internet documents in a curated set of enterprise sources and then augment it with enriched internal training data to craft a bespoke response.

What data sources to use for RAG?

Given the idea behind RAG is to have your LLMs access content that can be verified and trusted, you must identify sources of data and information within your enterprise that it can use.

领英推荐

Navigating The Generative AI Divide: Open-Source Vs…

Bernard Marr 10 个月前

Shakti-1B: A Vision-Language Model Built for…

Kamalakar Devaki 1 个月前

How LLMs are Transforming Bot Building, Botnet…

Open Data Science Conference (ODSC) 1 年前

Some examples could include your ERP databases housing financial transactions, supply chain data, and other operational insights or your CRM systems with proprietary customer databases detailing customer interactions, preferences, transaction history, etc.? They could also be internal document repositories storing corporate policies, project specifications, technical documentation, troubleshooting guides, best practices, and team collaborations.

For a use case such as an AI "co-pilot" for a customer service team, you could use data from your ticketing system with a history of customer interactions, chat transcripts, call logs, and documented resolutions so your human agents can be more effective/efficient in providing accurate, consistent and up-to-date information leading to higher customer satisfaction.

How is RAG applied?

While there are many use cases where RAG can be a game-changer technique, let me touch on a couple that I can see from my industry background.

Clinical Decision Support Systems (CDSS) in Healthcare: By drawing information from diverse sources such as electronic health records (EHRs), medical literature databases, patient histories, and real-time clinical data and then assimilating it with pre-trained LLMs, RAG can ensure that healthcare professionals have access to the most recent and relevant information when making critical decisions about patient care.
Research Optimization in Life Sciences: By leveraging a plethora of research data from public domains and including internal scientific literature, clinical trial databases, genomics data, and pharmacological studies. RAG can accelerate the drug discovery process by mining and synthesizing information from enriched sources to aid researchers in efficiently identifying potential drug and med-tech device discoveries.

What to watch out before using RAG?

Before embarking on RAG for your enterprise needs make sure you have the proper data governance and identified sufficient sources of knowledge base in-house. Rigorous validation mechanisms must be in place to ensure the accuracy and reliability of both internal and externally sourced data that would be referenced by RAG. ?Any deployment of RAG in an externally hosted LLM will require several security measures including encryption algorithms, access control mechanisms, and robust audit trails are technical components critical for safeguarding sensitive information.

Also, for any in-house hosted models, technical compatibility with existing enterprise systems through industry-standard methods such as API, webhooks, etc. for seamless integration. The utilization of well-defined APIs facilitates interoperability, allowing RAG to seamlessly align with AI applications, collaboration tools, and databases of your enterprise infrastructure minimizing disruptions during implementation.

Conclusion

Despite being a relatively new technique with Natural Language Processing (NLP), RAG is an effective solution for enterprises to adopt Generative-AI for external users. ?

While there are many tools in an ML toolkit to mold LLMs into your enterprise needs, RAG should be one of the logical options as it grounds LLMs on the latest, verifiable information mainly because other options are either time-consuming and/or cost-prohibitive. RAG’s versatility, lends itself as an ideal solution for external-facing use cases like chatbots, email, text messaging, and other Generative-AI based applications. ?

I hope you find this blog useful, please share your feedback through comments on this or other topics you would like me to discuss in the future.

Thomas McNally

VP of Technology

1 年

I always learn something new when you post Phane?? Corporate policy around use of AI / LLMs is few and far between from what I have seen so far. A big gap considering how quickly this is all evolving.

1 次回应

Naeem Hashmi

Digital Health, & SaMD products strategist / Privacy Design/ AI/ML/GenAI Solutions Strategist /Healthcare Systems Integration/ Cybersecurity/ Enterprise Architect/ Chief Research Officer,/ Thought Leader and Author.

1 年

Phane Mane, Good thoughts. I think theoretically, the RAG (an AI Framework) is workable but in real world it is limited to 'structured' content to build a ocal enterprise 'knowledgebase'. However, transforming unstructured data (aka, transactional, ERPs, Sales. EMRs,) to a usable knowledgebase suitable for LLMs is not feasible because you have to have some syntax/gramper and semantics for the structured data to build a useable 'knowledgebase.. (not database). Just like you cant use LLM for ECG or numerical data for LLM but you may place ECG report (which has some context/syntax) to abstract meaningful insight for prompts engineering. Way back ( see attached pic -20+ years back in 2000-2004), I published RAG like Enterprise Embedded Intelligent Reference model that outlines how conversational apps and intelligent agents (bots) using harmonized structured/unstructured content-- which LLMs are trying to do... This is a higher level pic but in my featured articles, there is more details on its building blocks. That framework was very similar to what IBM called RAG an AI Framework. but not an architecture and deployment model. If interested on overall architecture, let me know.

1 次回应

Vipin Killedar

Director at Sapours Technologies Pvt Ltd

1 年

Good Insight.Thanks for sharing Phane.

1 次回应

查看更多评论

要查看或添加评论，请登录

Phane Mane的更多文章

How AI is Accelerating Cardiovascular Care to Improve Patient Outcomes

2024年12月25日

How AI is Accelerating Cardiovascular Care to Improve Patient Outcomes

By Phane Mane Back in October, I had the privilege of participating in a panel discussion with Dr. Peter Monteleone…

4 条评论
How Generative-AI "Agents" Can Accelerate Enterprise Business Transformation

2024年10月7日

How Generative-AI "Agents" Can Accelerate Enterprise Business Transformation

By Phane Mane According to the latest McKinsey Global Survey on AI adoption, nearly 65 percent of respondents reported…

8 条评论
How Private and Enterprise data can help accelerate Generative-AI

2024年8月26日

How Private and Enterprise data can help accelerate Generative-AI

By Phane Mane As we count down to the 2nd anniversary of OpenAI’s launch of ChatGPT, which made words like “LLMs,”…

11 条评论
How “AI PCs” can help expedite enterprise adoption of Generative AI

2024年6月24日

How “AI PCs” can help expedite enterprise adoption of Generative AI

By Phane Mane In the past few weeks, tech giants like Microsoft, Dell, Intel, AMD, and Nvidia have announced the launch…

2 条评论
Smart Innovation: Balancing the Risks and Benefits of Generative-AI in B2B eCommerce and MedTech Manufacturing

2024年5月20日

Smart Innovation: Balancing the Risks and Benefits of Generative-AI in B2B eCommerce and MedTech Manufacturing

By Phane Mane and Deirdre Peters Per a recent report from the Boston Consulting Group, “GenAI is projected to grow…

2 条评论
How Generative AI Can Help Accelerate Healthcare Insights Using Multi-Modal Data

2024年4月2日

How Generative AI Can Help Accelerate Healthcare Insights Using Multi-Modal Data

By Phane Mane and Brian Peet When you think about organizations in the broader healthcare industry, especially the…

4 条评论
Innovating Safely: Software-as-a-Medical Device Compliance in the Era of Generative AI

2024年3月1日

Innovating Safely: Software-as-a-Medical Device Compliance in the Era of Generative AI

By Phane Mane and Brian Peet According to a recent report from BCG on Generative AI in health and opportunities..

5 条评论
Scaling Down but Powering Up: How Mid-size businesses can leverage Generative-AI using Small Language Models (SLMs)

2024年1月30日

Scaling Down but Powering Up: How Mid-size businesses can leverage Generative-AI using Small Language Models (SLMs)

By Phane Mane and Brian Peet Just over a year ago, very few people outside core engineering or academic research…

1 条评论
Generative AI is every Organization's "Business"

2023年12月21日

Generative AI is every Organization's "Business"

By, Phane Mane and Brian Peet In late November 2022, when OpenAI made ChatGPT publicly available it marked a…

4 条评论
From Outdated to Outstanding: How Generative-AI can help Modernize your Legacy Applications

2023年11月1日

From Outdated to Outstanding: How Generative-AI can help Modernize your Legacy Applications

Despite the rapid emergence of various technologies and platforms that are making it ever easy to create, deploy…

2 条评论

See all articles

How to leverage Generative-AI for external facing applications using Retrieval-Augmented Generation (RAG)

Phane Mane

AI-Practitioner | Blogger | eCommerce SME | IT Strategist | MS, MBA

So what is RAG?

How does RAG work?

What data sources to use for RAG?

领英推荐

How is RAG applied?

What to watch out before using RAG?

Conclusion

Phane Mane的更多文章

社区洞察

其他会员也浏览了

Transforming Industries with Mistral's New SDK for AI Fine-Tuning

Revolutionizing Document Summarization with GenAI and RAG (AI Document Summarization Part 1)

The AI Revolution: How LangChain is Transforming Intelligent Applications

Intro to LangChain: Enterprise AI use cases, top tools + frameworks - AI&YOU #56

CAG vs. RAG Explained: Choosing the Right Approach for Your GenAI Strategy

How Retrieval-Augmented Generation (RAG) is Making AI Smarter, More Accurate, and Reliable

RAG in 2025: Navigating the New Frontier of AI and Data Integration

Model Context Protocol: The Future of AI Interoperability

2025 AI Predictions: RAG + Knowledge Graphs + Agents + Foundation Models Will Outperform Custom Models for Most Business Cases

Part 3: Implementing RAG – Retrieval-Augmented Generation for Powerful AI Applications

So what is RAG?

How does RAG work?

What data sources to use for RAG?

领英推荐

How is RAG applied?

What to watch out before using RAG?

Conclusion

Phane Mane的更多文章

How AI is Accelerating Cardiovascular Care to Improve Patient Outcomes

How Generative-AI "Agents" Can Accelerate Enterprise Business Transformation

How Private and Enterprise data can help accelerate Generative-AI

How “AI PCs” can help expedite enterprise adoption of Generative AI

Smart Innovation: Balancing the Risks and Benefits of Generative-AI in B2B eCommerce and MedTech Manufacturing

How Generative AI Can Help Accelerate Healthcare Insights Using Multi-Modal Data

Innovating Safely: Software-as-a-Medical Device Compliance in the Era of Generative AI

Scaling Down but Powering Up: How Mid-size businesses can leverage Generative-AI using Small Language Models (SLMs)

Generative AI is every Organization's "Business"

From Outdated to Outstanding: How Generative-AI can help Modernize your Legacy Applications

社区洞察

其他会员也浏览了

Transforming Industries with Mistral's New SDK for AI Fine-Tuning

Revolutionizing Document Summarization with GenAI and RAG (AI Document Summarization Part 1)

The AI Revolution: How LangChain is Transforming Intelligent Applications

Intro to LangChain: Enterprise AI use cases, top tools + frameworks - AI&YOU #56

CAG vs. RAG Explained: Choosing the Right Approach for Your GenAI Strategy

How Retrieval-Augmented Generation (RAG) is Making AI Smarter, More Accurate, and Reliable

RAG in 2025: Navigating the New Frontier of AI and Data Integration

Model Context Protocol: The Future of AI Interoperability

2025 AI Predictions: RAG + Knowledge Graphs + Agents + Foundation Models Will Outperform Custom Models for Most Business Cases

Part 3: Implementing RAG – Retrieval-Augmented Generation for Powerful AI Applications