Small Language Models

Small Language Models

Open AI's GPT-4o is a 1 trillion parameter LLM, so does Google's Gemini Pro. In In comparison, Microsoft's Phi-3 is a 3.8B parameter. It is open source, can run locally on devices like cellphone and does not need internet to run. Exactly what a medical, law or many businesses that have to comply with strict data regulations require. Google's latest Pixel phone has Gemini nano, a small language model that does edge computing locally. The small language models can run on 8GM RAM and can generate text at reasonable speed on regular CPU. Last week Apple released OpenELM(open sourced efficient language models) that are small enough to run on a smartphone. With better curated data-set used for pre training and more and more research to enhanced transformer architecture, small language models will only get better.

Tech companies are making smaller AI models to target a wider range of customers, with lower costs and less computing power required. These small models are easier to run on devices and can keep data private. While large language models are still being developed, these smaller ones are seen as a way to get more businesses to adopt AI technology.

Benefits of small models:

  • Lower cost and power consumption: Businesses don't need expensive hardware to run AI.
  • Privacy focus: Small models can process tasks locally on devices, keeping data internal.
  • Accessibility: Enables AI features on mobile devices like smartphones.

Large models still have a place: OpenAI remains committed to developing large models with advanced capabilities like reasoning and planning.

The future of AI: Both large and small models will co-exist, catering to different needs and purposes.


Lalit Bansal

Associate Vice President and Digital Practice Head (CMT) - Digital Transformation, Portfolio Management, SDLC, Business Strategy, GCC Set-Ups, Global IT Service Delivery, Customer Success & Revenue Growth.

9 个月

OpenELM offer a promising solution for running AI tasks directly on devices. Hope it will not heat the mobile device and drain battery fast.

要查看或添加评论,请登录

Jwalant Mehta的更多文章

  • DEI

    DEI

    Google, Meta, Accenture, Amazon, Walmart, BT, Target, GM, Pepsi, Disney, Intel, Ford and many more global corporations…

    3 条评论
  • Digital Immortality Vision

    Digital Immortality Vision

    The concept of preserving and accessing the wisdom of individuals long after they are gone is a captivating vision for…

    5 条评论
  • Regulate AI's deception

    Regulate AI's deception

    Social Media and Gen AI are driven by profit focused incentives and endanger society and need stricter regulations…

    1 条评论
  • Productivity Improvement from Generative AI

    Productivity Improvement from Generative AI

    A new study by Stanford found that generative AI assistants can significantly boost agent productivity in call centers.…

    1 条评论
  • Meta's TestGen-LLM: A Leap Forward in Software Testing

    Meta's TestGen-LLM: A Leap Forward in Software Testing

    Meta has made a significant breakthrough with TestGen-LLM, a tool that automatically improves existing human authored…

    1 条评论

社区洞察

其他会员也浏览了