登录查看更多内容

?? The End of Lazy LLMs

Pascal Biese

Daily AI highlights for 70k+ experts ???? AI/ML Engineer

发布日期: 2024年2月2日

+ 关注

In this issue:

Active Learning for passive LLMs
2 heads, 1 task
CRAG yourself before you wreck yourself

Want to market your brand? I’ve been personally using passionfroot for months now and as much as 40% of my partnerships can be accounted to their platform. They make it easy for companies to find fitting creators for their brand and I’ve found their streamlined collaboration process to be more efficient and more enjoyable for both sides.

Become a Sponsor

1. Efficient Exploration for LLMs

Watching: Double TS (paper)

What problem does it solve? Enhancing large language models (LLMs) requires copious amounts of feedback—a costly and time-consuming process. To streamline this, researchers continuously strive to optimize the way such feedback is collected. The crux of the problem lies in efficiently generating queries that elicit the most informative feedback to improve models with the least possible number of queries, a challenge that's critical for refining these models while conserving resources.

How does it solve the problem? Double Thompson sampling, in conjunction with an epistemic neural network—an artificial neural network capable of uncertainty estimation—offers a solution. This technique operates by generating queries based on a model of uncertainty within the learning agent itself. By sampling from this distribution of uncertainty, the agent can craft more informative queries. The combination of an epistemic neural network for uncertainty estimation and a strategic exploration scheme enables efficient collection of high-quality human feedback, thereby reducing the number of needed queries while maintaining or even enhancing the performance of LLMs.

What's next? This research sets the stage for further development of efficient data collection methods and more sophisticated exploration schemes, ensuring faster and cheaper refinements to LLMs. The exciting question now is how the incorporation of such exploration schemes will generalize across different model architectures, tasks, and domains. The ability to enhance model performance with fewer queries opens up possibilities for more intelligent, adaptive models that can self-improve with minimal human intervention.

领英推荐

Join Expert-Led AI Masterclass at ONLY $1

Blockchain Council 1 个月前

Artificial General Intelligence (AGI): Latest Trends…

Nadh Thota 11 个月前

The Most Important Lesson in AI

Ehsan Kamalinejad 10 个月前

2. Two Heads Are Better Than One: Integrating Knowledge from Knowledge Graphs and Large Language Models for Entity Alignment

Watching: LLMEA (paper)

What problem does it solve? Entity alignment is crucial for building extensive and interconnected Knowledge Graphs (KGs), which in turn are foundational for tasks in semantic web and artificial intelligence. The challenge in entity alignment lies in identifying correspondences across different KGs, considering the heterogeneous information they harbor—including structural, relational, and attributive data. Currently, embeddings generated to represent entities' multi-faceted information are difficult to match due to their dissimilarity and the lack of effective mechanisms to leverage these embeddings. Moreover, the potential of leveraging the nuanced semantic understanding of Large Language Models (LLMs) for this purpose has remained untapped.

How does it solve the problem? The proposed Large Language Model-enhanced Entity Alignment framework (LLMEA) addresses the shortcomings of previous entity alignment methods by harnessing the semantic knowledge inherent in LLMs. LLMEA operates by: 1) identifying candidate alignments based on embedding similarities and edit distances, and 2) using LLMs' inference abilities to iteratively process multi-choice questions that pinpoint the final aligned entity. This approach marries structural knowledge from KGs with the rich semantic understanding from LLMs, thereby creating a more nuanced and accurate alignment. By interpreting candidate entities through an LLM and using its robust inference capabilities, the framework can dynamically utilize the context and implicit knowledge that LLMs have been trained on.

What’s next? The results from the public datasets where LLMEA outperforms leading baseline models are promising, but it will be interesting to see how this framework performs at a larger scale, or with KGs that have sparse or noisy information. Future developments might include refinements in candidate selection processes or additional integration of LLMs for ongoing knowledge discovery and alignment tasks.

3. Corrective Retrieval Augmented Generation

Watching: CRAG (paper)

What problem does it solve? Large language models (LLMs) sometimes make errors reminiscent of hallucinations where the text generated does not correspond to reality or lacks accuracy, a problem not always solved by the model's inherent knowledge. While retrieval-augmented generation (RAG) strategies help to offer contextually relevant content by pulling from external databases, RAG's effectiveness is compromised if the quality of these external sources is poor. The primary concern this paper addresses is how to enhance the reliability of information generated by LLMs, especially when the RAG approach brings in suboptimal or irrelevant information.

How does it solve the problem? The proposed method, Corrective Retrieval Augmented Generation (CRAG), incorporates a novel retrieval evaluator that gauges the quality and relevance of documents fetched by the RAG system. Based on the confidence score provided by the evaluator, CRAG can adapt its retrieval strategies accordingly—thereby improving the quality of the generation. Furthermore, by incorporating large-scale web searches, CRAG expands beyond static databases to enhance its source material further. It also employs a decompose-then-recompose algorithm to sift through the fetched documents, emphasizing crucial details and discarding irrelevant content. Its "plug-and-play" nature also ensures that CRAG can be used alongside a multitude of existing RAG-based systems.

What's next? Moving forward, it would be fascinating to see how CRAG's introduction to various contexts impacts the adaptation of LLMs in more complex and nuanced environments, such as legal or technical domains where accuracy is paramount. Testing CRAG's effectiveness in these areas could set new standards for information retrieval in LLMs and possibly pave the way for even more advanced systems that combine the judgment and robust reasoning of AI with the ever-expanding knowledge of the internet. Evaluation in real-world scenarios will be crucial to understanding the limits and possibilities of CRAG-enhanced language models.

Papers of the Week:

LLM Watch

53,841 位关注者

?? Marc Policani (PMP, SAFe)

Unleashing Business Potential through Integrated PMO, Portfolio, and Program Excellence with a Touch of AI

1 年

Between ChatGPT, Bard, and CLaude - Bard is the laziest by far. 3/4 of the time, it will give me tips and tricks on how to perform the tasks I asked it to execute, rather than performing the tasks I asked it to execute.

1 次回应

??Hakim Elakhrass

post-deployment data science | OSS | co-founder @ nannyML

1 年

i hope not, i enjoy threatening them to get a proper response ??

4 次回应

查看更多评论

要查看或添加评论，请登录

Pascal Biese的更多文章

?? Quantum-Enhanced AI - It's Here

2025年3月21日

?? Quantum-Enhanced AI - It's Here

In this issue: Chinese researchers introduce quantum-enhanced fine-tuning Enabling open-source reinforcement learning…

3 条评论
?? Search-R1, Gemini Embeddings & Controlled Reasoning with L1

2025年3月14日

?? Search-R1, Gemini Embeddings & Controlled Reasoning with L1

In this issue: Emergent search behavior in LLMs Stopping reasoning models from “overthinking” The best embeddings - for…

1 条评论
?? QwQ-32B: 20x smaller than DeepSeek-R1

2025年3月7日

?? QwQ-32B: 20x smaller than DeepSeek-R1

In this issue: China just did it again: a new open source powerhouse The art of post-training reasoning models A new…

6 条评论
OpenAI Can Not Be Happy About This

2025年2月28日

OpenAI Can Not Be Happy About This

In this issue: OpenAI releases first “vibe” model Microsoft bets on data quality and efficiency When old benchmarks…
?????? One Giant Leap for AI Optimization

2025年2月21日

?????? One Giant Leap for AI Optimization

In this issue: Sakana’s AI CUDA Engineer Inner Thinking Transformers Better Code Generation for any model Accelerate…
LLM Watch#74: DeepSeek-R1 Was Only The Beginning

2025年2月14日

LLM Watch#74: DeepSeek-R1 Was Only The Beginning

In this issue: 1B model > 405B model AI winning Olympic Gold Generating world models on the fly For those of you that…

5 条评论
?? Massive Progress in Reasoning Models

2025年2月7日

?? Massive Progress in Reasoning Models

In this issue: Beating OpenAI with Open-Source 99% performance with only 1% data Chain-of-Associated-Thoughts (CoAT)…

2 条评论
??? Automatic Prompt Engineering 2.0

2025年1月31日

??? Automatic Prompt Engineering 2.0

Foreword: hi everyone, I hope you had a great week! Before we dive into this newsletter and its (hopefully) exciting…

5 条评论
?? This AI Makes Big Tech Panic

2025年1月24日

?? This AI Makes Big Tech Panic

In this issue: Re-defining what’s possible in AI DeepMind going even deeper Self-training agents are coming 1…

11 条评论
?? Google Releases Transformer 2.0

2025年1月17日

?? Google Releases Transformer 2.0

In this issue: From Transformers to Titans Smaller, weaker, yet better O1-preview-level results for $450 Interested in…

9 条评论

See all articles

?? The End of Lazy LLMs

Pascal Biese

Daily AI highlights for 70k+ experts ???? AI/ML Engineer

In this issue:

1. Efficient Exploration for LLMs

领英推荐

2. Two Heads Are Better Than One: Integrating Knowledge from Knowledge Graphs and Large Language Models for Entity Alignment

3. Corrective Retrieval Augmented Generation

Papers of the Week:

LLM Watch

53,841 位关注者

Pascal Biese的更多文章

社区洞察

其他会员也浏览了

Is Machine Learning a Part of Artificial Intelligence?

Deep learning, a glimpse of the future in contractmanagement

Phi-1.5 and "AI Textbooks": a groundbreaking new way to train LLMs

The Future of AI Starts Now: Join the PG Diploma at Britts UAE

Bring on the Bots: Deep Learning & Conversing

Help: I need help and need to learn about AI.

Investing in AI: A Deep Learning Opportunity

Enhancing Deep Q Learning: A Dive into Double Deep Q Networks, Dueling Deep Q Networks, and Prioritized Experience Replay

GPT-3: My AI Makes Better Conversation Than Thanksgiving Dinner

Understanding Knowledge Distillation in AI: A Game-Changer or a Double-Edged Sword?

In this issue:

1. Efficient Exploration for LLMs

领英推荐

2. Two Heads Are Better Than One: Integrating Knowledge from Knowledge Graphs and Large Language Models for Entity Alignment

3. Corrective Retrieval Augmented Generation

Papers of the Week:

LLM Watch

53,841 位关注者

Pascal Biese的更多文章

?? Quantum-Enhanced AI - It's Here

?? Search-R1, Gemini Embeddings & Controlled Reasoning with L1

?? QwQ-32B: 20x smaller than DeepSeek-R1

OpenAI Can Not Be Happy About This

?????? One Giant Leap for AI Optimization

LLM Watch#74: DeepSeek-R1 Was Only The Beginning

?? Massive Progress in Reasoning Models

??? Automatic Prompt Engineering 2.0

?? This AI Makes Big Tech Panic

?? Google Releases Transformer 2.0

社区洞察

其他会员也浏览了

Is Machine Learning a Part of Artificial Intelligence?

Deep learning, a glimpse of the future in contractmanagement

Phi-1.5 and "AI Textbooks": a groundbreaking new way to train LLMs

The Future of AI Starts Now: Join the PG Diploma at Britts UAE

Bring on the Bots: Deep Learning & Conversing

Help: I need help and need to learn about AI.

Investing in AI: A Deep Learning Opportunity

Enhancing Deep Q Learning: A Dive into Double Deep Q Networks, Dueling Deep Q Networks, and Prioritized Experience Replay

GPT-3: My AI Makes Better Conversation Than Thanksgiving Dinner

Understanding Knowledge Distillation in AI: A Game-Changer or a Double-Edged Sword?