登录查看更多内容

From Vector databases to hybrid RAG for Enterprise Gen AI

Nitin Karandikar

Chief Product & Technology Officer

发布日期: 2024年3月16日

Vector databases have become synonymous with Retrieval Augmented Generation (RAG) for LLM applications in recent discourse. The embeddings approach that underpins these databases makes a lot of sense for a general-purpose Gen AI solution like ChatGPT; vector matching provides a broad, content-agnostic solution for identifying relevant information about any conceivable topic.

The use case for internal AI agents within an Enterprise is very different, in three important ways:?

Data: The data context is usually topical, focused on a specific industry vertical and company, along with its products, solutions, suppliers, partners, etc. There is also a great deal of structured data available (e.g. sales or purchase transactions) which can be looked up directly using keys and indexes.
User segmentation: Most internal users belong to one of a small number of user types, broadly aligned with team and function; each user type has a small number of use cases that are most frequently used. These use cases however can be quite complex and sophisticated.
Success parameters: Accuracy is the most important (except requests where creativity is the goal), followed by completeness and speed of response. It’s better for the AI to explicitly say that it does not know the answer than to make something up.

As foundation models increasingly become commoditized and prompt optimization gets systematized, e.g. using DSPy, the value differentiation of an AI system will shift from its inference engine to its “data power”, i.e. how well it understands the user request and locates relevant data across disparate data sources to provide in context. Equally important, it needs to seamlessly absorb new data as it is created so that the AI system continues to “learn” over time.

领英推荐

Precision, Prediction, and Performance: LLMs in…

WalkingTree Technologies 3 个月前

Future Trends in Data Quality: AI and Machine Learning

XenonStack 3 个月前

Centizen: Empowering Your Business with Cutting-Edge…

Centizen, Inc. 4 个月前

On the technology side, data within an Enterprise is typically distributed across many different data sources: transactional SQL systems, NoSQL/object databases, graph databases, file systems, APIs, etc. In addition, domain-specific taxonomies and rich metadata can add semantic depth to the search.

At ADS, we’re designing a hybrid RAG system that’s carefully designed to fetch data from this rich variety of data sources using multiple technologies (SQL queries, knowledge graphs, keyword search, vector search, etc.) and then to stitch the results together appropriately to feed into the LLM prompt.?

This sophisticated RAG system then becomes the backbone of the AI system, the growing knowledge base that enables AI agents to provide correct answers.

要查看或添加评论，请登录

Nitin Karandikar的更多文章

For Enterprise Apps The Future Is Conversational

2024年8月3日

For Enterprise Apps The Future Is Conversational

One of the enduring iconic movie scenes about technology comes from Star Trek IV: Scotty, the legendary chief engineer…
Another "Ah-ha" Moment in Generative AI

2024年7月7日

Another "Ah-ha" Moment in Generative AI

I recently had another "ah-ha" moment in Gen AI (the first one was after using ChatGPT), when watching a bunch of AI…
Next level Gen AI: From ChatGPT to Multi-agent workflows

2024年6月14日

Next level Gen AI: From ChatGPT to Multi-agent workflows

One of the godfathers of AI, Andrew Ng, recently released an open source implementation of a sample application for…
Preparing healthcare data for a Generative AI world

2024年4月24日

Preparing healthcare data for a Generative AI world

LLM applications are exploding in every vertical domain, healthcare is no exception. But the output of this magical…

1 条评论
The Top Priorities of a Data Executive

2019年6月22日

The Top Priorities of a Data Executive

An increasingly common aphorism is that "Data is the new Oil". Indeed, big data coupled with artificial intelligence…

3 条评论
The 4 Priorities of a Technology Leader

2019年1月26日

The 4 Priorities of a Technology Leader

The life of a software technology leader is an interesting one. There are so many competing priorities, requests (and…

4 条评论
Machine Learning is a Subset of AI

2017年8月13日

Machine Learning is a Subset of AI

Are we at Peak Machine Learning (ML) yet on the hype cycle? These days ML seems to be everywhere - no matter where you…
Latest data shows Medicare Inpatient Spending increased, but only for a few key medical conditions

2016年9月10日

Latest data shows Medicare Inpatient Spending increased, but only for a few key medical conditions

In the increasingly frequent discussions about rising healthcare costs, it is usually taken for granted that health…
Cloud Computing: The real value of IAAS is in Cloud Native Apps

2015年6月19日

Cloud Computing: The real value of IAAS is in Cloud Native Apps

IAAS: The Horror A friend was telling me his horror story about moving a mission-critical Enterprise software…
US Health care: Poised for a Radical Transformation

2015年5月31日

US Health care: Poised for a Radical Transformation

The health care system in this country has been undergoing a major change over the past ten years and the pace of…

See all articles

From Vector databases to hybrid RAG for Enterprise Gen AI

Nitin Karandikar

Chief Product & Technology Officer

领英推荐

Nitin Karandikar的更多文章

社区洞察

其他会员也浏览了

9 Ways Custom AI Solutions Outshine Off-the-Shelf Products

AutoML Revolution: Future of Automated Machine Learning in Transforming Data Science, Industry Applications, and Ethical Considerations

Maximizing AI Potential: The Vital Role of Data Pipelines in End-to-End AI Solutions

Harnessing Generative AI for Seamless Data Integration and Actionable Analytics

Analytics Isn’t Enough to Create a Data-Driven Culture

Significance of Gen AI in Data Analytics and How Clarista Integrates It

When one size won’t fit all: Can AI get the right data to the right people at the right time?

The Invisible Architect: How AI Generates Action from Data.

Frequently heard Customer Challenges about Deploying AI in Production

The AI Revolution in Data Science: Embracing AutoML for Strategic Innovation

领英推荐

Nitin Karandikar的更多文章

For Enterprise Apps The Future Is Conversational

Another "Ah-ha" Moment in Generative AI

Next level Gen AI: From ChatGPT to Multi-agent workflows

Preparing healthcare data for a Generative AI world

The Top Priorities of a Data Executive

The 4 Priorities of a Technology Leader

Machine Learning is a Subset of AI

Latest data shows Medicare Inpatient Spending increased, but only for a few key medical conditions

Cloud Computing: The real value of IAAS is in Cloud Native Apps

US Health care: Poised for a Radical Transformation

社区洞察

其他会员也浏览了

9 Ways Custom AI Solutions Outshine Off-the-Shelf Products

AutoML Revolution: Future of Automated Machine Learning in Transforming Data Science, Industry Applications, and Ethical Considerations

Maximizing AI Potential: The Vital Role of Data Pipelines in End-to-End AI Solutions

Harnessing Generative AI for Seamless Data Integration and Actionable Analytics

Analytics Isn’t Enough to Create a Data-Driven Culture

Significance of Gen AI in Data Analytics and How Clarista Integrates It

When one size won’t fit all: Can AI get the right data to the right people at the right time?

The Invisible Architect: How AI Generates Action from Data.

Frequently heard Customer Challenges about Deploying AI in Production

The AI Revolution in Data Science: Embracing AutoML for Strategic Innovation