The Business Case for Small AI Models: Efficiency Without Compromise
Maribel Lopez
Helping companies connect technology to business value. I'm a technology industry analyst, speaker, author, and business advisor
In the rapidly evolving landscape of artificial intelligence, business leaders face a critical decision: which AI models will deliver the most value to their organization? While much attention has focused on massive models with billions of parameters, there's a compelling case for considering smaller, more specialized AI models. Lopez Research interviewed Kate Soule, Director of Technical Product Management for IBM's Granite products, to learn more about these smaller models.
IBM Granite represents a family of open AI models specifically designed for transparency, data governance, and practical business applications. IBM's third-generation of Granite models (3.2) offer reasoning capabilities and multimodal models that offers vision and has been optimized for document understanding.
Why Size Matters (and Smaller Can Be Better)
Soule says, "Everything gets more difficult as the model gets larger." This challenge manifests in several business-critical ways, such as:
The Efficiency Advantage
IBM's approach with their Granite models focuses on efficiency through purpose-built smaller models. At just 2 to 8 billion parameters (compared to the largest industry models, such as Llama 3.1 from Meta, exceeding 400 billion parameters), these models deliver several key advantages, including:
The "Fit-for-Purpose" Approach
Rather than seeking a single "model to rule them all," business and IT leaders are working together to assemble a portfolio of AI solutions that incorporate foundational LLMs with specialized AI models to support different business needs. As Soule notes, "To get value and to be able to deploy AI cost-effectively... you need to consider having fit-for-purpose models."
The fit-for-purpose approach allows businesses to optimize performance and cost based on the specific requirements of each use case. A larger model may be the best solution for high-value, complex tasks. For routine operations, a smaller specialized model often delivers comparable results at a fraction of the cost.
Soule shared that IBM's Granite models embody this philosophy with their modular design. Instead of trying to create a single massive model for all tasks, IBM has developed distinct models for different enterprise functions. For example, IBM’s Granite solutions offer the models for use cases such as coding, time-series forecasting, security and language models for designed for agentic workflows, RAG etc.
领英推荐
Despite their efficient design, IBM shared that Granite models don't sacrifice performance for cost. According to IBM’s benchmarks, Granite outperforms comparable models across various enterprise tasks, achieving high scores on Hugging Face's RAGBench Leaderboard. This targeted approach ensures organizations can select the right tools for specific business challenges without unnecessary computational overhead.
Multimodal Models: Expanding AI's Capabilities
Multimodal AI models can simultaneously process and interpret multiple types of data inputs—such as text, images, audio, and video. Unlike unimodal models that work with only one data type (typically text), multimodal models can understand the relationships between different forms of information, similar to how humans process the world through multiple senses.
IBM's Granite 3.2 vision models offer a practical application of multimodal capabilities in an enterprise context. Rather than focusing on image generation (creating pictures from text prompts), these models specialize in image understanding—extracting valuable information from visual content. At just 2 billion parameters, these specialized vision models can:
Making Strategic AI Decisions: The Performance-Cost Matrix
As AI technology evolves, organizations must balance cost, accuracy, and safety for model use. Soule shared at least four guidelines for evaluating what models to use within the enterprise, that include:
For example, Soule discussed how different models may offer various levels of transparency and governance. One distinguishing feature of IBM Granite models is that IBM publishes detailed information about their training datasets and methodologies, allowing enterprises to understand what's "under the hood" of these models. The Granite ecosystem includes risk and harm detection capabilities, transparency tools, and IP protection. Model transparency allows IBM to provide indemnification for Granite models, offering businesses additional protection when deploying these AI solutions.
The Shift is Underway
The "bigger is better" paradigm is giving way to a more nuanced approach to enterprise AI. Business leaders can achieve comparable performance by strategically implementing smaller, specialized models for appropriate use cases while significantly reducing costs and complexity. IBM's Granite models are one example of this approach. We’ve also seen the large foundation model providers, such as Open.AI and Meta, support smaller models.
Looking ahead, we're moving toward more flexible AI deployment models where businesses can dynamically allocate resources based on task importance. This could mean using models of various sizes for different tasks or enabling features like "reasoning" when demands such as accuracy justify the additional cost and latency. As AI becomes further integrated into business operations, this efficiency-focused approach will be increasingly critical for sustainable AI adoption and competitive advantage.
You can subscribe to our video channel here https://www.youtube.com/@AIwithMaribelLopez and our podcast here https://bit.ly/3R0Etal.
Chief Marketing Officer, Lenovo ISG
1 周This is fantastic Maribel!
Should have Played Quidditch for England
1 周Thanks for this Maribel Lopez fascinating article
Great insights ?? thanks for sharing Maribel Lopez!
Principal @ APS Marketing, Inc. | Revenue-Driven Product Marketing, Multi-Channel Sales Engagement and Executive Leadership Coaching | 30+ years non-profit Board Experience
2 周Love the detail and timliness. Tx.