登录查看更多内容

Small Language Models

Jwalant Mehta

AVP, Head-DevOps, AI, SDET Quality Engineering, Global Financials portfolio

发布日期: 2024年5月27日

Open AI's GPT-4o is a 1 trillion parameter LLM, so does Google's Gemini Pro. In In comparison, Microsoft's Phi-3 is a 3.8B parameter. It is open source, can run locally on devices like cellphone and does not need internet to run. Exactly what a medical, law or many businesses that have to comply with strict data regulations require. Google's latest Pixel phone has Gemini nano, a small language model that does edge computing locally. The small language models can run on 8GM RAM and can generate text at reasonable speed on regular CPU. Last week Apple released OpenELM(open sourced efficient language models) that are small enough to run on a smartphone. With better curated data-set used for pre training and more and more research to enhanced transformer architecture, small language models will only get better.

Tech companies are making smaller AI models to target a wider range of customers, with lower costs and less computing power required. These small models are easier to run on devices and can keep data private. While large language models are still being developed, these smaller ones are seen as a way to get more businesses to adopt AI technology.

Benefits of small models:

Lower cost and power consumption: Businesses don't need expensive hardware to run AI.
Privacy focus: Small models can process tasks locally on devices, keeping data internal.
Accessibility: Enables AI features on mobile devices like smartphones.

Large models still have a place: OpenAI remains committed to developing large models with advanced capabilities like reasoning and planning.

The future of AI: Both large and small models will co-exist, catering to different needs and purposes.

Lalit Bansal

Associate Vice President and Digital Practice Head (CMT) - Digital Transformation, Portfolio Management, SDLC, Business Strategy, GCC Set-Ups, Global IT Service Delivery, Customer Success & Revenue Growth.

9 个月

OpenELM offer a promising solution for running AI tasks directly on devices. Hope it will not heat the mobile device and drain battery fast.

1 次回应

要查看或添加评论，请登录

Jwalant Mehta的更多文章

DEI

2025年2月9日

DEI

Google, Meta, Accenture, Amazon, Walmart, BT, Target, GM, Pepsi, Disney, Intel, Ford and many more global corporations…

3 条评论
Digital Immortality Vision

2024年8月26日

Digital Immortality Vision

The concept of preserving and accessing the wisdom of individuals long after they are gone is a captivating vision for…

5 条评论
Regulate AI's deception

2024年6月9日

Regulate AI's deception

Social Media and Gen AI are driven by profit focused incentives and endanger society and need stricter regulations…

1 条评论
Productivity Improvement from Generative AI

2024年4月6日

Productivity Improvement from Generative AI

A new study by Stanford found that generative AI assistants can significantly boost agent productivity in call centers.…

1 条评论
Meta's TestGen-LLM: A Leap Forward in Software Testing

2024年3月8日

Meta's TestGen-LLM: A Leap Forward in Software Testing

Meta has made a significant breakthrough with TestGen-LLM, a tool that automatically improves existing human authored…

1 条评论

See all articles

Small Language Models

Jwalant Mehta

AVP, Head-DevOps, AI, SDET Quality Engineering, Global Financials portfolio

Jwalant Mehta的更多文章

社区洞察

其他会员也浏览了

Gen AI for Business #3

Large Language Models (LLMs) and Inference: The Role of Data Centers and Colocation in AI

Microsoft Introduces DroidSpeak: A New Language for Faster AI Communication

Artificial Intelligence Trends in 2024 - Zoptal

GPT4 passed the Turing Test?

What is MAI-1? A Deep-Dive into Microsoft's GPT-4 Rival

Artificial Intelligence #186

Artificial Intelligence #186

AI January, IA-ismo Shines in its First Year: An Innovative Journey in the World of AI ??

Trends in LLMs - QLORA: Efficient Finetuning of Quantized LLMs

Jwalant Mehta的更多文章

DEI

Digital Immortality Vision

Regulate AI's deception

Productivity Improvement from Generative AI

Meta's TestGen-LLM: A Leap Forward in Software Testing

社区洞察

其他会员也浏览了

Gen AI for Business #3

Large Language Models (LLMs) and Inference: The Role of Data Centers and Colocation in AI

Microsoft Introduces DroidSpeak: A New Language for Faster AI Communication

Artificial Intelligence Trends in 2024 - Zoptal

GPT4 passed the Turing Test?

What is MAI-1? A Deep-Dive into Microsoft's GPT-4 Rival

Artificial Intelligence #186

Artificial Intelligence #186

AI January, IA-ismo Shines in its First Year: An Innovative Journey in the World of AI ??

Trends in LLMs - QLORA: Efficient Finetuning of Quantized LLMs