登录查看更多内容

LLMs on CPU? The 1-bit framework is a Masterpiece

Martin Musiol

GenAI Since 2016 | Keynote Speaker | Author | 43k+ Newsletter

发布日期: 2024年10月29日

+ 关注

100 seconds, packed with extraordinary tech advancements. ??

Tech Deep Dive: Now you can run up to 100B LLMs locally on your CPU (No GPUs!) and get 5-7 words/second
Grok 2 unparalleled vision capabilities
Latest on humanoid robots
Learn best practices regarding building products/ code with AI!
Based AI Agends → on blockchain

Now you can run up to 100B LLMs locally on your CPU (No GPUs!) and get 5-7 words/second

Microsoft has open-sourced BitNet, an ultra-efficient LLM framework that offers groundbreaking performance by reducing data requirements.

It’s slightly technical, but its concept is not as difficult. Let me explain.

Traditional approaches use at least 16 bits to store the trained parameters, which are decimal numbers, such as 6.5.

Staying with the example, 6.5 would store 1 bit for the sign (+/ -), 5 bits for the exponent (order of magnitude), and 10 bits for the number's actual value. 16 bits in total.

BitNet, the 1-bit framework, is much simpler and works, using only 1 bit. It limits each trained parameter to just three values: -1, 0, or 1. This means no other values are possible. There is no 6.5.

Why would they do this?

BitNet runs 4.1 times faster and has 8.9 times the throughput of models using the 16-bit representation. Period. Basta.

By using only integer addition instead of complex multiplications, BitNet is not only optimized to be faster but also requires significantly less memory.

It is genius!

And this is huge because …

…, with this, we can do more in a shorter time. Which is vital in AI’s progression. Look at reasoning models such as o1 (Claude 3.5 Opus might one be as well). You want it to process thousands of ideas in under a second—perfect!

Further, this results in lower energy usage and reduced infrastructure costs. This is especially helpful for edge and mobile devices where resources are limited as well as real-time applications like we see with OpenAI’s Realtime API. (I wrote it and how you can apply it.)

How to get LLMs to make them run on your laptop with 1-bit BitNet? Here is the GitHub Repo + steps of how to do it.?

?? If you absolutely want me to do a video on how to implement it, let me know in the comments or reply to this email.

Be an everyday genius ?????

Learning a little every day can have a huge impact—especially if you're learning on Brilliant. Explore thousands of bite-sized, interactive lessons on everything from math and data analysis to programming, AI, and beyond.

Try it free for 30 days.

xAI’s Grok 2 (the AI Model) now has vision capabilities that are incredibly in-detail (my demo)

I have worked with Anthropic, Google, OpenAI, and other top-notch AI companies, but xAI’s vision capabilities are unparalleled. Look for yourself in my demo below.

Grok 2 's vision capabilities: CRAZY DETAIL

?Latest updates on humanoid robots ???

Clone has built Torso. An upper-body robot that is only actuated with artificial muscles. With that, robot anatomy is getting closer to human-like biology.

Finally, a humanoid robot with a natural, human-like...

Read the full newsletter episode here:

https://mail.generativeai.net/p/llms-on-cpu-the-1-bit-framework-is-a-masterpiece

That’s a wrap! I hope you enjoyed it.

Martin

Do you write newsletters? I use Beehiiv and highly recommend it.
AI for your org: We build custom AI solutions half the market price, and time (building w/ AI Agents). Contact us to know more.
Would you like to sponsor a post?
My book - Generative AI: Navigating the Course to AGI.
Generativeai.net?

Generative AI - Short & Sweet

4,304 位关注者

要查看或添加评论，请登录

Martin Musiol的更多文章

Build Applications on Your Phone!

2025年3月4日

Build Applications on Your Phone!

Please subscribe to my newsletter (not here on LinkedIn). I will discontinue newsletters on LI.

1 条评论
9 Areas Where Humans Still Outperform AI

2024年11月20日

9 Areas Where Humans Still Outperform AI

..
Build your cost-free, offline AI tool

2024年11月12日

Build your cost-free, offline AI tool

Demo! ..

2 条评论
Agora, the AI cost killer

2024年11月8日

Agora, the AI cost killer

and AI in medicine, robots, build your agent!, computer use Hey there, it’s Martin. 1001 things have happened in tech/…
what's the max potential

2024年11月6日

what's the max potential

..
SearchGPT, Perplexity’s top rival, saves you massive time

2024年11月1日

SearchGPT, Perplexity’s top rival, saves you massive time

In partnership with The world is becoming more agentic. Learn next ? SearchGPT saves you massive time.
AI can use your computer

2024年10月24日

AI can use your computer

In the screen capture below, you can see how Claude AI can steer your computer per mouse and keyboard. ?? Now, there…

2 条评论
Let AI work for you - Swarm by OpenAI

2024年10月16日

Let AI work for you - Swarm by OpenAI

Hey, it’s Martin In today’s issue: I have tried out OpenAI’s Swarm AI, and it is great Elon Musk’s undeniably insane…
The Easiest OpenAI Realtime API Integration You'll Ever See [demo]

2024年10月8日

The Easiest OpenAI Realtime API Integration You'll Ever See [demo]

In partnership with A year ago, I met May Habib, CEO of Writer, at Europe's first Generative AI conference. Similar to…

2 条评论
The Way We Interact with AI Has Changed Substantially

2024年10月2日

The Way We Interact with AI Has Changed Substantially

Meta’s Orion kicks off a novel way of interacting with AI, and human-level voice AI supports this development. Plus…

2 条评论

See all articles

Now you can run up to 100B LLMs locally on your CPU (No GPUs!) and get 5-7 words/second

Be an everyday genius ?????

xAI’s Grok 2 (the AI Model) now has vision capabilities that are incredibly in-detail (my demo)

?Latest updates on humanoid robots ???

Read the full newsletter episode here:

Generative AI - Short & Sweet

4,304 位关注者

Martin Musiol的更多文章

Build Applications on Your Phone!

9 Areas Where Humans Still Outperform AI

Build your cost-free, offline AI tool

Agora, the AI cost killer

what's the max potential

SearchGPT, Perplexity’s top rival, saves you massive time

AI can use your computer

Let AI work for you - Swarm by OpenAI

The Easiest OpenAI Realtime API Integration You'll Ever See [demo]

The Way We Interact with AI Has Changed Substantially