登录查看更多内容

LLMs: Is bigger always better? Small LLMs are punching above their weight.?

Pawel Sobczak

VP Partnerships | ?? ex-IBM VP EMEA | AI startup strategic advisor | Empowering AI builders to boost productivity | Trustworthy AI for Business | Startups | ISVs

发布日期: 2024年2月9日

Large Language Models (LLMs) have taken the world by storm, with names like GPT-4 by OpenAI , LLaMA2 by Meta , Jurassic-1 Jumbo by AI21 Labs , or Gemini (previously know as Bard) by Google DeepMind , dominating headlines. But are these behemoths always the best choice? In enterprise environment there are more factors to consider than versatility. There are smaller, more focused language models like Granite 13B by IBM , Mistral 7B by Mistral AI , or Flan-T5 3B by Hugging Face .?

LLMs: Powerhouses with potential pitfalls

Strengths: LLMs boast impressive versatility, addressing diverse tasks from creative writing to code generation. They excel at complex reasoning and learning from massive datasets.
Weaknesses: Their complexity comes at a cost. LLMs require significant computational resources and are often black boxes, making it difficult to understand their reasoning or identify biases. Additionally, their training on vast datasets can raise ethical concerns.

Usually they provide better results, but costs of use are surprisingly high for those who move from pilots and tests to production inference.

Smaller Models: Faster, cheaper, more explainable

Strengths: Smaller models offer several advantages. They are generally more lightweight and require less computational power, making them easier to deploy and potentially more cost-effective. Additionally, their smaller size often makes them more transparent and explainable, easier to manage in context of risk, compliance and overall AI governance.
Weaknesses: While capable, smaller models might not match the sheer breadth and depth of very large LLMs. But used in specific expert niche (for example finance, legal, manufacturing), they can deliver very good results.

Smaller models provide better transparency and trustworthiness, some vendors like IBM for its #Granite model provide indemnification based on confidence they have in quality of data used to train the family of foundation models.

领英推荐

This AI newsletter is all you need #77

Towards AI 1 年前

Why Llama 3.1's Release is an Important Step in the…

Data Science Dojo 7 个月前

DeepSeek Has Introduced Advanced Reasoning To The…

ARK Investment Management LLC 1 个月前

Some are mixing several small LLMs to meet benchmarks of very large models, while maintaining speed and cost efficiency of smaller ones, for example Mixtral 8x7B. The verification what is the best option is still to come.

So, which model is right for you?

The answer depends on your specific needs:

LLMs are ideal for: Tasks requiring vast knowledge, complex reasoning, or highly creative outputs. However, be prepared for computational demands and potential interpretability challenges. They are also more prone to hallucinations, if you care about taking responsibility for outputs in commercial environment.
Smaller models shine in: Situations where efficiency, explainability, and cost are critical. In enterprise environment specialization gets priority over creativity. They're also great starting points for experimentation or training on specific domain data.

Choosing the right LLM is all about understanding your needs and priorities. Don't get caught up in the hype – explore both options and find the model that empowers you to achieve your goals. Pay attention to total cost of ownership (model, infrastructure, skills) and compliance/risks when using Generative AI in business.

?What are your experiences with LLMs and smaller models? Share your thoughts in the comments.

#LLMs #AI #NLP #MachineLearning #DataScience #Startups #Tech #ibm #mistral #gemini #GenAI #watsonx #governance

Agnieszka Szufarska

Head of Marketing Operations, Mobile Networks

1 年

Thank you for sharing - I especially agree with the claim that smaller models may be better for specialized tasks. I can easily imagine areas of business in which I would explicitly want my model to NOT be trained on some data categories that may often be used for the big ones.

查看更多评论

要查看或添加评论，请登录

Pawel Sobczak的更多文章

The future of business: where Agentic AI meets Web3

2024年11月29日

The future of business: where Agentic AI meets Web3

The intersection of agentic AI and blockchain technology is gaining momentum. The market valuation of crypto assets has…
All you need is flow - from models to multi-agent AI systems

2024年9月9日

All you need is flow - from models to multi-agent AI systems

Millions of internet users interact with AI language models. Starting with popularity of free interactive service…

2 条评论
Vacation time to recharge batteries is over - 10 key AI developments you might have missed this summer

2024年8月27日

Vacation time to recharge batteries is over - 10 key AI developments you might have missed this summer

Time to relax is over. As August draws to a close and the new academic year begins, it's time to summarize significant…

1 条评论
How good is your AI when your cloud is down?

2024年7月19日

How good is your AI when your cloud is down?

Ensuring AI Continuity: The Imperative of Multi-Model, and Multi-Cloud Strategies AI systems are becoming the backbone…

1 条评论
The best AI programming language is…

2024年7月11日

The best AI programming language is…

Remember the good old days when we had to learn "alien" languages like Fortran, C#, JavaScript, or Python just to tell…

5 条评论
AI Agents: from Chatbots to Autonomous Co-workers

2024年5月9日

AI Agents: from Chatbots to Autonomous Co-workers

Imagine a company that replenishes stock levels before they all go - items like printer paper and toner in office…

1 条评论
Llama 3 and How Open-Source LLMs Reshape Enterprise AI

2024年4月22日

Llama 3 and How Open-Source LLMs Reshape Enterprise AI

The world of Artificial Intelligence is witnessing a surge in open-source large language models (LLMs). Meta's recent…
AI and the Art of Reasoning

2024年4月12日

AI and the Art of Reasoning

AI is rapidly evolving its ability to reason, and can not only revolutionize education, but also enhance decision…
Personal GenAI in Your Pocket: LLMs and RAG on mobile device?

2024年4月4日

Personal GenAI in Your Pocket: LLMs and RAG on mobile device?

The future of work is undeniably mobile, and soon, a personal generative AI assistant, with access to your personal…

3 条评论
Leveraging Larger Context Windows in RAG: Benefits and Cost Considerations

2024年3月21日

Leveraging Larger Context Windows in RAG: Benefits and Cost Considerations

Introduction: Large Language Models (LLMs) are constantly evolving, increasing for example size of context window…

2 条评论

See all articles

LLMs: Is bigger always better? Small LLMs are punching above their weight.?

Pawel Sobczak

VP Partnerships | ?? ex-IBM VP EMEA | AI startup strategic advisor | Empowering AI builders to boost productivity | Trustworthy AI for Business | Startups | ISVs

领英推荐

Pawel Sobczak的更多文章

社区洞察

其他会员也浏览了

GPT-4: A Potential Stepping Stone on the Path to Artificial General Intelligence AGI

Mastering MLOps practices for a trading bot

How LLMs are Transforming Bot Building, Botnet Detection at Scale, and Declarative ML for Engineers

Navigating the Future: Advances in Generative AI

Geneea's AI Spotlight #6

DeepSeek R1: Enter the Next Frontier of AI Evolution

A selection of AI news and insights for May 27 - June 3

The Week in AI. Top AI News and Tutorials

Geneea's AI Spotlight #3

DeepSeek – The First Look

领英推荐

Pawel Sobczak的更多文章

The future of business: where Agentic AI meets Web3

All you need is flow - from models to multi-agent AI systems

Vacation time to recharge batteries is over - 10 key AI developments you might have missed this summer

How good is your AI when your cloud is down?

The best AI programming language is…

AI Agents: from Chatbots to Autonomous Co-workers

Llama 3 and How Open-Source LLMs Reshape Enterprise AI

AI and the Art of Reasoning

Personal GenAI in Your Pocket: LLMs and RAG on mobile device?

Leveraging Larger Context Windows in RAG: Benefits and Cost Considerations

社区洞察

其他会员也浏览了

GPT-4: A Potential Stepping Stone on the Path to Artificial General Intelligence AGI

Mastering MLOps practices for a trading bot

How LLMs are Transforming Bot Building, Botnet Detection at Scale, and Declarative ML for Engineers

Navigating the Future: Advances in Generative AI

Geneea's AI Spotlight #6

DeepSeek R1: Enter the Next Frontier of AI Evolution

A selection of AI news and insights for May 27 - June 3

The Week in AI. Top AI News and Tutorials

Geneea's AI Spotlight #3

DeepSeek – The First Look