登录查看更多内容

Do LLMs, SLMs and Large Vision Models in AI know when to stop?

Ramesh Yerramsetti

发布日期: 2024年11月19日

AI LLMs, LVMs and SLMs are great at predicting the next word or image in a sequence. However humans program the stopping criteria of when to stop. This is arbitrary and we can over engineer or under engineer a scene or a poem or your term paper.

LLMs and Vector Representations: LLMs are somewhat of a "library of vectors of meaning," but they do use vector representations internally. LLMs process and generate text by working with high-dimensional vector representations of words and phrases, often called embeddings
Role of Transformers: Transformers are indeed a key architecture used in many modern LLMs. They help interpret context by using mechanisms like self-attention to understand relationships between different parts of the input text
Vector Databases and LLMs: Vector databases are often used in conjunction with LLMs, but they are separate components
Context Interpretation: LLMs, particularly those based on transformer architectures, are designed to interpret context within the text they process.This context interpretation happens through the model's learned parameters and attention mechanisms, not through a static "library of vectors"
Embeddings and Meaning: While LLMs do work with vector representations that capture aspects of meaning, calling them a "library of vectors of meaning" is an oversimplification. The model's understanding of meaning comes from its training on vast amounts of text data and its ability to process this information through its neural network architecture

Knowing when to stop in LLMs:

LLM Behavior: LLMs don't inherently "know" when to stop generating text in the same way humans do. They are designed to continue generating text based on the input and their training patterns.
Stop Tokens: LLMs can be programmed to stop generating when they encounter specific "stop tokens" or sequences. These are typically defined by the developers or users of the model.
Context Length: LLMs have a maximum context length (e.g., 2048 tokens for GPT-3).They will naturally stop generating once this limit is reached.
Probabilistic Nature: LLMs generate text based on probabilities learned during training. They don't have a built-in concept of narrative completion or conversational turn-taking.
The "Art" of Stopping: The phrase "when to stop is an art" likely refers to the challenge of determining appropriate stopping points in various applications of LLMs. This could involve: a) Designing prompts that naturally lead to concise responses. b) Implementing post-processing techniques to trim generated text. c) Fine-tuning models to better recognize natural ending points.
Human Intervention: In many applications, human oversight is still necessary to determine when an LLM's output is sufficient or complete.
Application-Specific Strategies: Different applications may require different strategies for managing output length and completeness. For example, chatbots might use turn-taking cues, while content generation might use topic exhaustion signals.
Ongoing Research: Improving LLMs' ability to self-regulate output length and completeness is an active area of research in AI and NLP.

领英推荐

The Year Algorithms Learn to Act

Tomasz Tunguz 3 个月前

Integrating Physics with Machine learning: A promising…

Cactus 9 个月前

DeepSeek’s “Aha Moment”: The Next AI Revolution or…

Walter Adamson 1 个月前

Coaching LLMs and LVMs to Stop

Agentic frameworks can potentially be used to implement more sophisticated stopping mechanisms for LLMs based on specific use cases. This could involve creating specialized agents responsible for monitoring and controlling the output length and relevance.
Use Case-Specific Approaches: Different use cases may require different stopping criteria. Agentic frameworks allow for customization based on specific needs
Feedback Loops and Self-Reflection: Agentic frameworks often incorporate feedback loops and self-reflection mechanisms
Multi-Agent Collaboration: A system could be designed where one agent generates content while another evaluates and decides when to stop
Tool Integration: Agentic frameworks allow for integration with external tools and APIs
Continuous Learning and Adaptation: Agentic AI systems can learn and adapt over time
Safety and Governance: Agentic frameworks often include safety features and governance tools
Challenges: Implementing effective stopping mechanisms requires careful design and testing. There's a need to balance between providing complete information and avoiding unnecessary verbosity. Different use cases may require significantly different stopping strategies, necessitating flexible and adaptable systems.

Conclusion:

While LLMs don't inherently "know" when to stop in a human-like way, managing their output effectively is indeed an art that involves a combination of technical strategies, careful prompt engineering, and often human oversight. The challenge lies in balancing the model's generative capabilities with the need for coherent, appropriately sized outputs across various applications. LLMs are sophisticated neural networks that process and generate text based on learned patterns and relationships in language. Vector databases, while often used alongside LLMs, are separate tools that can enhance LLM capabilities by efficiently storing and retrieving relevant vector embeddings. Agentic frameworks to coach LLMs on when to stop based on use cases is a promising approach. It allows for more nuanced, context-aware, and adaptable stopping mechanisms compared to simple token limits or static stop sequences.

AI in motion

1,250 位关注者

Andy Burns

Principal Technical Project Manager, Lean-Agile Portfolio Coach, PMP, PMI-ACP, DAC, SPC, RTE

4 个月

No. The models have 1,000 eyes. There must be a Human in the Loop HitL to know where to look.

查看更多评论

要查看或添加评论，请登录

Ramesh Yerramsetti的更多文章

Preprompting image models in AI: case study of Stable Diffusion

2025年3月20日

Preprompting image models in AI: case study of Stable Diffusion

Prompting should provide great images from diffusion transformers. Right? Not always! In the context of Stable…
AI regulation - Healthcare industry AI tools getting better

2025年3月18日

AI regulation - Healthcare industry AI tools getting better

The US Federal Department of Health and Human Services (HHS) has implemented two major regulatory frameworks: The ONC's…

2 条评论
AI in Collaborative U.S. Combat Aircraft (CCA) program

2025年3月11日

AI in Collaborative U.S. Combat Aircraft (CCA) program

The Collaborative Combat Aircraft (CCA) program is a multi-faceted initiative by the US Air Force (USAF) to develop and…
Comparing Milvus and Cosmos DB for storing AI embeddings

2025年3月4日

Comparing Milvus and Cosmos DB for storing AI embeddings

Milvus and Cosmos DB can both store AI embeddings. So, which is better? Milvus Milvus is an open-source vector database…
How GPU based AI increases thermodynamic Entropy and further contributes to global warming

2025年2月28日

How GPU based AI increases thermodynamic Entropy and further contributes to global warming

While there is hype of AI, nothing in life is free; there are complex set of interconnected issues surrounding the…

1 条评论
How AI is transforming Toyota's "Woven City" at base of Mt. Fuji

2025年2月24日

How AI is transforming Toyota's "Woven City" at base of Mt. Fuji

Toyota's Woven City, located near Mount Fuji in Japan, serves as a groundbreaking testbed for integrating artificial…

1 条评论
AI model war heats up with Kim AI

2025年2月22日

AI model war heats up with Kim AI

Moonshot AI, a Chinese startup founded in March 2023, has recently released Kimi AI 1.5, a powerful and innovative…
Majorana quantum chips for solving world agricultural problems

2025年2月20日

Majorana quantum chips for solving world agricultural problems

The Majorana fever is on. Microsoft stock is up.
Comparing the storylines of two videos using AI

2025年2月17日

Comparing the storylines of two videos using AI

There is no specific tool designed to compare the storylines of two videos and determine which one is better. Storyline…

1 条评论
The Chinese “multi-node, multi-GPU” parallel computing approach changes the game (again)

2025年2月13日

The Chinese “multi-node, multi-GPU” parallel computing approach changes the game (again)

Recently we got the low-cost training shock of Deepseek.ai.

See all articles

Do LLMs, SLMs and Large Vision Models in AI know when to stop?

Ramesh Yerramsetti

Knowing when to stop in LLMs:

领英推荐

Coaching LLMs and LVMs to Stop

Conclusion:

AI in motion

1,250 位关注者

Ramesh Yerramsetti的更多文章

社区洞察

其他会员也浏览了

Genie 2: The Future of AI-Generated 3D Environments

AI at the Crossroads: Safeguarding Privacy and Human Dignity in a Data-Driven Age

The Failure of AI models in EnigmaEval Benchmark: Limitation of AI Agents in Automation

Artificial Intelligence: Your work, your future, and your power

The Quantum-Ready Future of AI: How the Transformer–Huginn Synergy Could Redefine Intelligent Systems

How are AI Chips Making the World a Smarter Place?

The Evolution of Multimodal Model Architectures: A Journey Towards Enhanced AI Understanding

Paper Review: Diffusion Model Alignment Using Direct Preference Optimization

Trans-AI or Meta-AI = Unified World Model Engine + Meta AI + Google AI + Transformers NNs+ Composite AI

Real AI = Machine Intelligence and Learning = Causal AI + Narrow AI + Statistical ML + ANNs

Knowing when to stop in LLMs:

领英推荐

Coaching LLMs and LVMs to Stop

Conclusion:

AI in motion

1,250 位关注者

Ramesh Yerramsetti的更多文章

Preprompting image models in AI: case study of Stable Diffusion

AI regulation - Healthcare industry AI tools getting better

AI in Collaborative U.S. Combat Aircraft (CCA) program

Comparing Milvus and Cosmos DB for storing AI embeddings

How GPU based AI increases thermodynamic Entropy and further contributes to global warming

How AI is transforming Toyota's "Woven City" at base of Mt. Fuji

AI model war heats up with Kim AI

Majorana quantum chips for solving world agricultural problems

Comparing the storylines of two videos using AI

The Chinese “multi-node, multi-GPU” parallel computing approach changes the game (again)

社区洞察

其他会员也浏览了

Genie 2: The Future of AI-Generated 3D Environments

AI at the Crossroads: Safeguarding Privacy and Human Dignity in a Data-Driven Age

The Failure of AI models in EnigmaEval Benchmark: Limitation of AI Agents in Automation

Artificial Intelligence: Your work, your future, and your power

The Quantum-Ready Future of AI: How the Transformer–Huginn Synergy Could Redefine Intelligent Systems

How are AI Chips Making the World a Smarter Place?

The Evolution of Multimodal Model Architectures: A Journey Towards Enhanced AI Understanding

Paper Review: Diffusion Model Alignment Using Direct Preference Optimization

Trans-AI or Meta-AI = Unified World Model Engine + Meta AI + Google AI + Transformers NNs+ Composite AI

Real AI = Machine Intelligence and Learning = Causal AI + Narrow AI + Statistical ML + ANNs