登录查看更多内容

???April 7x7

CLIKA Inc.

one API to serve lighter & faster AIs

发布日期: 2024年4月30日

When light-weighting AI models for on-device implementation and production-grade use, ensuring model accuracy post-compression is vital. But successful compression results may not necessarily translate into concomitant model performance in a production environment. This is because model accuracy can change when compiled for hardware deployment.

When light-weighting AI models for production-grade quality, what considerations should be given so that model performance post-compression stays consistent even in a production environment?

??Updates

CLIKA has been accepted into the Google for Startups AI First Accelerator! ??

? Read more about it here!

“Recently, there has been a shift toward greater openness, particularly regarding the carbon costs of training AI models. However, disclosure of the environmental costs associated with inference—a potentially more significant concern—remains insufficient” - AI Index Report 2024, Stanford University

AI Trends

领英推荐

The Year AI Found Its Voice and Changed Ours...

Factspan 2 个月前

Gen AI for Business Newsletter, edition #25

Eugina Jordan 4 个月前

AI in 2023 a Look Back

Michael Spencer 11 个月前

High-performing models with relatively fewer parameters are here! But can they run on your local device within a compute budget?

Late this month, Meta released a new version of the Llama model: Llama 3. Available in two different sizes (8B, 70B), it comes with not only a new extended tokenizer but also a commercially permissive license.

What’s impressive about Llama 3 is that the 70B model significantly outperforms GPT-3.5 (score: 70) in the MMLU benchmark despite having 2.5 times fewer parameters (based on the total parameter count), while also consistently outperforming other state-of-the-art models within their respective parameter ranges.

But to run these models on your personal device, you would still need to quantize the weights and activations to a lower precision to reduce memory requirements. But quantizing without sacrificing model performance is quite a challenge.

??See how CLIKA automatically compresses models for resource-constrained environments without compromising performance.

? See what else is up in this space:

??Food for Thought: Responsible AI & Benchmarks

Pursuing responsible AI at all times is imperative for creating a sustainable, integral and ethical AI ecosystem. At the core of this pursuit should be a commitment to ensuring the integrity of benchmark results, as inflating or fabricating them has real-world impact with implications for decision-making, fairness, and trust in AI systems. This should be a collective, shared responsibility across all industries utilizing or creating AI in both public and private sectors.

7x7

454 位关注者

CJ (Chan Jong) Na

Co-Founder, CTO. Ex Cruise

10 个月

Super! Curious to know if openELM is part of topic

要查看或添加评论，请登录

CLIKA Inc.的更多文章

See all articles

???April 7x7

CLIKA Inc.

one API to serve lighter & faster AIs

??Updates

AI Trends

领英推荐

High-performing models with relatively fewer parameters are here! But can they run on your local device within a compute budget?

??Food for Thought: Responsible AI & Benchmarks

7x7

454 位关注者

CLIKA Inc.的更多文章

社区洞察

其他会员也浏览了

How the Artificial Intelligence (AI) Arms race is reshaping data center construction

Gen AI for Business Weekly Newsletter # 29

Palm-Sized Petaflops Meet $6M AI- And Poof

Latest AI Developments: From Enhanced Persuasion and Reasoning Capabilities to Groundbreaking Chips and Models

Apple's AI Ambitions, Microsoft's Strategic Chess, and Nvidia's Game-Changing Tech—Dive Into the Future Now!

DeepSeek: China's AI Disruption in the Global Tech Race

Accenture invests $3 billion in AI

AI DIGEST: 5 LATEST UPDATES YOU NEED TO KNOW

Addepto AI & Tech Digest: September Edition 2024

What is DeepSeek? How Can It Transform Businesses?

??Updates

AI Trends

领英推荐

High-performing models with relatively fewer parameters are here! But can they run on your local device within a compute budget?

??Food for Thought: Responsible AI & Benchmarks

7x7

454 位关注者

CLIKA Inc.的更多文章

February 7x7

January 7x7

December 7x7

November 7x7

October 7x7

September 7x7

August 7x7

???July 7x7

???June 7x7

???May 7x7

社区洞察

其他会员也浏览了

How the Artificial Intelligence (AI) Arms race is reshaping data center construction

Gen AI for Business Weekly Newsletter # 29

Palm-Sized Petaflops Meet $6M AI- And Poof

Latest AI Developments: From Enhanced Persuasion and Reasoning Capabilities to Groundbreaking Chips and Models

Apple's AI Ambitions, Microsoft's Strategic Chess, and Nvidia's Game-Changing Tech—Dive Into the Future Now!

DeepSeek: China's AI Disruption in the Global Tech Race

Accenture invests $3 billion in AI

AI DIGEST: 5 LATEST UPDATES YOU NEED TO KNOW

Addepto AI & Tech Digest: September Edition 2024

What is DeepSeek? How Can It Transform Businesses?