登录查看更多内容

Small Language Models: An Efficient and Sustainable Alternative to LLMs?

Dominik Krimpmann, PhD

Business & Technology Futurist at Accenture | Helping Companies Reimagine via Disruptive Technology

发布日期: 2024年8月30日

Nearly two years after the launch of ChatGPT, the hype around large language models (LLMs) shows no signs of abating. But while generative AI (GenAI) holds great promise, many IT leaders see a risk of overspending on the necessary infrastructure. And while investment in GenAI is forecast to skyrocket in coming years, CIOs and other senior execs remain concerned about the ROI of such implementations. That’s where small language models (SLMs) come in – offering a more efficient, sustainable, and lower-cost alternative to LLMs.

AI Is on the Rise – But Doubts Remain Around ROI

According to market intelligence experts IDC, GenAI spend in 2024 is up 100% year-on-year and is expected to total $151 billion in 2027. These are impressive figures, but the ROI delivered by AI implementations is still a worry. Recent Accenture research shows 35% of C?level execs now report more cost-reduction benefit from implementing AI – up from 24% only six months earlier. But while the trend is upward, there’s still no definite consensus.

This may be set to change with the adoption of more efficient, sustainable, and lower-cost small language models (SLMs). Gartner predicts that by 2027 more than 50% of GenAI models used by large businesses will be designed specifically for industry- or business-specific functions, compared with around just 1% in 2023.

Smaller, More Efficient GenAI: SLMs in a Nutshell

But what are SLMs? Sometimes also called specialized language models, SLMs offer machine learning (ML) algorithms trained on much smaller and more specific data sets than those used in LLMs. And often, this data is enterprise specific.

SLMs combine advanced AI capabilities with reduced computational demand. While LLMs generally offer millions, or even billions, of parameters, these are often unnecessary in enterprise settings. Consequently, customized SLMs are usually the more efficient solution for handling more specific tasks.

Many and Varied Use Cases

The applications for SLMs are similar to those for LLMs, only with a much narrower focus. For example, the tech is ideally suited for use with specialized image- and language-generating models. SLMs can handle many other tasks, including translation, market-trend analysis, and customer service. Companies can also leverage the tech to manage IT tickets or even as chatbots/virtual business assistants. In addition, it’s used in edge technologies such as smartphones, tablets, and laptops.

Where SLMs Shine

To gauge the pros and cons of LLMs versus SLMs, we need to compare the two in greater detail. First, let’s consider some upsides of SLMs. Based on relatively small domain-specific data sets, small language models tend to perform their tasks more efficiently and accurately. What’s more, they can do so almost twice as fast as LLMs and with 27% less memory overhead.

As shown in the table below, SLMs have the advantage of greater cost accuracy compared to LLMs, and they’re also more sustainable. And because they’re built on customized, rather than cloud-based, models, they carry a lower risk of privacy violations.

Where SLMs Do Less Well

While these benefits are appealing, it’s important to keep sight of the weaknesses of SLMs. For one thing, they tend to struggle with nuance, context, and creativity. That’s hardly surprising given the greater volumes of training data and wider variety of data sets that LLMs work with.

Shushant Lakhyani 5 个月前

Iterative AI Prompting: The Design Thinking Way

Luciano Ayres 1 年前

A Comparative Look at Today’s Leading Gen AI…

Adj. Prof. Ts. Dr. Behrang (Hani) Parhizkar 8 个月前

What’s more, SLMs can pose challenges for IT organizations – not least because of the high-quality data needed to avoid errors. Also, integrating small language models into the business can be complex and costly. And wherever client data is involved, legal and ethical considerations must be taken into account.

Cost Comparison: LLMs Versus SLMs

Large and small language models differ not only when it comes to infrastructures, implementation complexity, and sustainability, but also as regards the various costs and returns involved. The following table compares SLMs and LLMs in these respects:

?Better for Business, Better for the Environment

When considering how best to deploy AI, companies should consider whether they have the technical infrastructure, architecture, operating model, and governance structure needed to meet the high compute demands of LLMs and GenAI. But they must also keep a close eye on cost and sustainable energy consumption.

One distinct advantage of SLMs in enterprise architectures is their sustainability. AI is notoriously energy intensive, pushing up the cost of this already costly technology. By deploying SLMs where appropriate, organizations can reduce their AI footprint, thus helping them achieve their sustainability goals.

Pros and Cons of LLMs versus SLMs: An Overview

To sum up: SLMs have the edge when it comes to efficiency, in terms of cost, use cases, and energy use. However, they lag behind LLMs when it comes to accuracy, complexity, and flexibility.

LLMs, on the other hand, win out in terms of capability, performance, and versatility. But operating them involves considerable resources and infrastructure – and, therefore, higher costs.

If you’re thinking of embarking on an SLM journey, you should start by assessing the cost and benefit of the tech compared to other AI or analytical approaches. Any evaluation of this kind must consider various factors, including the need to retrofit data centers and introduce new chip set architectures, hardware innovations, and efficient algorithms.

Questions, Comments??

Want to learn more about SLMs and discover whether they’re right for your business? Then feel free to contact me. And if you have your own ideas about the relative merits of LLMs and SLMs, please share them in the comments below.

Peter Bardenhagen

Digital Transformation Architect | Enterprise AI, Data & Cloud Solutions | Strategy & Advisory

1 个月

Thank you for the detailed explanation, Dominik. Considering that some large language models (LLMs) are 40GB, I would prefer to concentrate my efforts on a 300MB small language model (SLM) that is suitable for my needs and can be further fine-tuned. For someone who downloads models onto their own machine, this is crucial. Additionally, the time and cost savings, whether online or offline, are significant.

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

Small Language Models: An Efficient and Sustainable Alternative to LLMs?

Dominik Krimpmann, PhD

Business & Technology Futurist at Accenture | Helping Companies Reimagine via Disruptive Technology

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Pioneering AI Frontier: Unleashing Natural Language Interface

4 tips for effective use of LLMs in conversational AI

The Future of Artificial Intelligence: Navigating Small and Large Language Models

What is an AI Agent, Really?

Small Language Models (SLMs): Powerful AI on a Smaller Scale

The 3 types of NLU systems in conversational AI

Why Should Enterprises Invest in Large Language Model Development Services?

Kore AI's 3 layers of conversational AI

Exploring LangChain and LlamaIndex: Comprehensive Tools for Enhancing Large Language Models

The Unseen Conversation: How Our Daily Chats with AI Chatbots Are Fueling the Future of Conversational AI

领英推荐

Deepfakes: A Prime Example of AI’s Creative Potential and Ethical Risks

2024年10月30日

Beyond DevOps: How Platform Engineering Transforms Digital Ecosystems

2024年9月30日

Enterprise Data Storage: The Key to Successful AI Initiatives

2024年7月30日

Harnessing the Power of Vector Databases: A New Era in Data Management

2024年6月28日

Edge AI: Powering the Intelligent Devices of Today and Tomorrow

2024年5月29日

Industry Cloud Platforms: Smoothing the Way to Greater Efficiency

2024年4月29日

Harnessing the Power of GenAI to Promote Sustainability

2024年3月28日

Multimodal AI: A Whole New Dimension of Decision-Making

2024年2月28日

Enterprise Observability: End-to-End Insight for Ultra-Effective IT Management

2024年1月30日

Five Tech Trends for 2024 that Every CXO Should Know About

2023年12月29日