登录查看更多内容

Why Large Context Windows in LLMs Don't Replace the Need for Search in Enterprise Knowledge

Ahmad Haj Mosa, PhD

Director @ PwC | Head of AI CoE | Co-founder of Digicust | PhD, Machine Learning, GenAI, Driving Innovation | TEDAI Panelist

发布日期: 2024年3月15日

In the realm of artificial intelligence, the capabilities of large language models (LLMs) like Gemini 1.5, which boasts an impressive ability to process up to 1 million tokens, have sparked discussions on the future of information retrieval. At first glance, the notion of inputting the equivalent of 4,000 pages of text into an LLM and retrieving accurate information sounds like it could render traditional search mechanisms obsolete. However, this perspective overlooks a crucial aspect of how neural networks, including transformers, handle information:

Compression in neural networks is not about squeezing text to fit into a smaller box; it's about distilling the essence of the data. This process involves abstracting the most prominent patterns from the input, where prominence is often synonymous with frequency. For instance, in a neural network designed to recognize faces, the algorithm learns to focus on facial features while disregarding background details. This principle of focusing on the "most prominent" does not guarantee that all nuances and less frequent but critical information within the 1 million tokens will be preserved or accurately represented.

领英推荐

FOD#23: The Quest for Self-Reproducing Automata: Old…

TuringPost 1 年前

How Computer Vision Adds value to Businesses

Steve Nouri 4 年前

Computer Vision

Valify Solutions 6 个月前

Therefore, while LLMs with extensive context windows offer remarkable capabilities for abstracting knowledge from large datasets, they are not infallible. Their strength in abstracting knowledge does not equate to delivering it with pinpoint accuracy. This distinction underscores why efficient retrieval-augmented generation (RAG) systems and search functions remain indispensable. These technologies ensure that specific, detailed, or less frequent pieces of information can be retrieved when needed, thereby complementing the broad knowledge abstraction capabilities of LLMs.

In summary, the evolution of LLMs and their expanding input capacities present exciting opportunities for managing and leveraging vast amounts of data. Yet, this technological advancement does not diminish the value of search functions. Instead, it highlights the need for a synergistic approach that combines the abstracting power of LLMs with the precision and specificity of traditional search and retrieval systems. As we continue to navigate the complexities of information processing, it's clear that a multi-faceted approach is essential for addressing the nuanced demands of real-world applications.

#AI #MachineLearning #InformationRetrieval #LLM #NeuralNetworks #KnowledgeManagement

Godwin Josh

Co-Founder of Altrosyn and DIrector at CDTECH | Inventor | Manufacturer

11 个月

The concept of data compression within neural networks, particularly transformers, underscores the nuanced process of distilling vast amounts of information into a condensed representation while retaining key patterns. You talked about the importance of efficient Retrieval-Augmented Generation (RAG) in complementing LLM capabilities, emphasizing the need to bridge the gap between abstracted knowledge and accurate sourcing. In considering scenarios where real-time decision-making relies on precise contextual information retrieval, how might techniques such as RAG be optimized to balance between information richness and computational efficiency, especially in dynamic environments with evolving data streams?

1 次回应

Reza Farahani

Building in Biotech and AI | Hiring across roles!

11 个月

Exciting times ahead with the advancements in AI and knowledge compression! ??

1 次回应

查看更多评论

要查看或添加评论，请登录

Ahmad Haj Mosa, PhD的更多文章

TedAI Vienna

2024年10月21日

TedAI Vienna

TedAI Vienna Hackathon: ?? I had the pleasure of joining the TEDAI Hackathon in Vienna with amazing co-judges Jia Zhu…

8 条评论
GenAI-Enhanced Knowledge Management

2023年11月14日

GenAI-Enhanced Knowledge Management

Knowledge Management (KM) in enterprises is a critical process that involves: Identifying Organizing Storing…

3 条评论
Understanding Association Learning for Everyone: The Backbone of Deep Learning and LLM

2023年10月5日

Understanding Association Learning for Everyone: The Backbone of Deep Learning and LLM

Section 1: Association Learning in the Human Brain Definition and Examples: Association learning (AL) is the process…

2 条评论
The Evolution of AI Strategy: From Data-Heavy Beginnings to the GenAI Revolution

2023年10月3日

The Evolution of AI Strategy: From Data-Heavy Beginnings to the GenAI Revolution

In the records of technological history, the development of Artificial Intelligence (AI) stands out as one of the most…
Why Might LLMs Produce Unexpected Outputs?

2023年10月1日

Why Might LLMs Produce Unexpected Outputs?

Large Language Models, like other machine learning models, rely on pattern discovery and association learning between…
Explainable AI vs Explaining AI - Part 1

2018年12月22日

Explainable AI vs Explaining AI - Part 1

Despite the recent remarkable results of deep learning (DL), there is always a risk that it produces delusional and…
Is explainable Deep Learning important?

2018年2月24日

Is explainable Deep Learning important?

Recently, artificial intelligence (AI) has become one of the fastest emerging technologies. It’s hard to predict how…
Deep Learning, Fast and Slow

2017年12月22日

Deep Learning, Fast and Slow

The difference between conscious and unconscious thinking (of the human mind) in decision-making is one of the key…

2 条评论

See all articles

Why Large Context Windows in LLMs Don't Replace the Need for Search in Enterprise Knowledge

Ahmad Haj Mosa, PhD

Director @ PwC | Head of AI CoE | Co-founder of Digicust | PhD, Machine Learning, GenAI, Driving Innovation | TEDAI Panelist

领英推荐

Ahmad Haj Mosa, PhD的更多文章

社区洞察

其他会员也浏览了

The Mixture of Experts: A Revolution in AI Models

VideoMamba: Utilizing State Space Models for Efficient Video Understanding

AI's Ascent from Dreams to Reality

Forever Artificial Intelligence Summer: a not-too-technical explanation of GenAI

Is Perfect Upscaling with 100% Accurate Details Computable?

Why is Gen AI so Complex?

A Curious Journey Through the History of Artificial Intelligence

General AI and ML = Trans-AI = Unified World Model Machine + Intelligent Neural Networks

Why "creating machines in the human image" is not rational and wise?

What is the state-of-the-art artificial intelligence?

领英推荐

Ahmad Haj Mosa, PhD的更多文章

TedAI Vienna

GenAI-Enhanced Knowledge Management

Understanding Association Learning for Everyone: The Backbone of Deep Learning and LLM

The Evolution of AI Strategy: From Data-Heavy Beginnings to the GenAI Revolution

Why Might LLMs Produce Unexpected Outputs?

Explainable AI vs Explaining AI - Part 1

Is explainable Deep Learning important?

Deep Learning, Fast and Slow

社区洞察

其他会员也浏览了

The Mixture of Experts: A Revolution in AI Models

VideoMamba: Utilizing State Space Models for Efficient Video Understanding

AI's Ascent from Dreams to Reality

Forever Artificial Intelligence Summer: a not-too-technical explanation of GenAI

Is Perfect Upscaling with 100% Accurate Details Computable?

Why is Gen AI so Complex?

A Curious Journey Through the History of Artificial Intelligence

General AI and ML = Trans-AI = Unified World Model Machine + Intelligent Neural Networks

Why "creating machines in the human image" is not rational and wise?

What is the state-of-the-art artificial intelligence?