登录查看更多内容

Stochastic Rounding

Weiming Li

Machine Learning Signal Processing | MLSP.ai

发布日期: 2024年12月12日

When comes to digital signal, NN has the same liking as our ears.

Rounding a number is a very common operation in DSP, in ML it has a specially useful purpose which is to reduce the parameter precision, thereby increase the computation throughput. To optimize for this purpose, various rounding techniques are employed and one popular one is Stochastic Rounding (SR). When applying it during lowering parameter precision, NN model’s performance could be largely maintained, so same performance for much less computation, great!

The concept of SR is an interesting one, most noticeably, individual rounding result is no longer deterministic, but probabilistic. Take rounding 0.3 to nearest integer as example, conventional round-to-nearest (RTN) method would yield 0, 100% of the time. In contrast, the result SR method produces would has 70% probability of being 0 and 30% being 1. Two key points here:

Even 0.3 has a decent chance (30%) being rounded to 1
When run SR with input 0.3 many times, the mean value of the output should be 0.3

“Interesting yet makes good sense”, was my feeling when I first came across it. As you can extrapolate, with SR 0.5 would yield a result with 50% chance being 0 and 50% chance being 1. Following are two plots illustrate RTN’s and (a sample of) SR’s output over input range [0, 1.0] .

领英推荐

The Enchantress of Numbers: Ada Lovelace and Charles…

AI for Good 6 个月前

Now for a little criticism from an LLM

Steven Michalove, CISSP, CISM 2 个月前

RESPONDING THIS THOUGHT PROVOCKING PICTURE...

JMV Josune Moneo Viloria 3 年前

On the SR plot, input 0~0.3 mostly yield 0 with a small chance of jumping to 1, and vice versa for input above 0.6. The middle range has even probability between 0 or 1. In fact, there is a even simpler operation mode of SR, it is let’s not even worry about the fractional part, just discard it and round up or down randomly :-o

Yes, even pure random can be considered as rounding.

Finally, let’s try SR on a real signal and see what it looks like, reducing number precision from float to int8. Again, side by side with RTN for comparison.

Those with music engineering background might already recognize something familiar, SR is just like dithering! A technique employed to make recording more pleasant to our ears when convert to lower resolution.

When comes to digital signal, NN has the same liking as our ears????

Tim Hunt

Retired

3 个月

Interesting - seems akin to dithering in digital audio, making better use of the available quantization levels

要查看或添加评论，请登录

Weiming Li的更多文章

free trial: integrate NN processing in MCU with 2 lines of C code

2025年3月10日

free trial: integrate NN processing in MCU with 2 lines of C code

Trying is believing. In this post, I would enable everyone to be able to try bringing my example NN processing into…
Ray Tracing for sound, the holy grail for data generation?

2025年2月25日

Ray Tracing for sound, the holy grail for data generation?

Ray Tracing (RT) should be a very familiar term in 3D gaming, but what might be less known is its application in…
from minimize error to raise quality

2025年2月18日

from minimize error to raise quality

In this post, I am going to share the finding (and audio samples) of applying perceptual quality as training target for…
Looking forward to Cortex-M55 + Ethos-U55

2025年2月10日

Looking forward to Cortex-M55 + Ethos-U55

The 50x inference speed up and 25x efficiency jump are very exciting, but what I really look forward to is how it could…
SVDF, just give Conv a bit of time

2025年1月19日

SVDF, just give Conv a bit of time

Simply add a dimension of time to standard Conv layer, it becomes the SVDF layer, the core component powering our…
Peek into the future

2025年1月13日

Peek into the future

The Devil is in the details, a often hidden small detail that we must not miss when interpreting performance figures…
Tiny model for tiny system

2025年1月6日

Tiny model for tiny system

Large model shows us the limitless perspective of what’s possible, but model doesn’t have to be big to do amazing…

6 条评论
build trust with black box

2024年12月29日

build trust with black box

Putting a black box in a product requires courage, a few ways to turn some of the courage into confidence. A NN model…
from batch to streaming

2024年12月19日

from batch to streaming

Unexpected complication I wish I were well aware of from the beginning. If you coming from a conventional DSP…
Fuzzy Memory

2024年12月16日

Fuzzy Memory

I don’t mean the kind we have after a hangover, but the kind powering some of the greatest models we know. “But do I…

See all articles

Stochastic Rounding

Weiming Li

Machine Learning Signal Processing | MLSP.ai

领英推荐

Weiming Li的更多文章

社区洞察

其他会员也浏览了

RESPONDING THIS THOUGHT PROVOCKING PICTURE...

Transduction – leading transformation – Issue #74

Introduction to AWGN (Additive White Gaussian Noise)

The Logical Framework of HU - Topology

Outperforming the Traditional Black-Scholes Model: Quantum Arithmetic in Option Pricing:

Reasoning from first principles

As You Watch The British Open ... Say Thank You To DCT

477: One Thousand New Instructions with Kwabena Agyeman

Still time to submit to ICAIF'24

Laws of nature dictate that measuring developer productivity should be hard

领英推荐

Weiming Li的更多文章

free trial: integrate NN processing in MCU with 2 lines of C code

Ray Tracing for sound, the holy grail for data generation?

from minimize error to raise quality

Looking forward to Cortex-M55 + Ethos-U55

SVDF, just give Conv a bit of time

Peek into the future

Tiny model for tiny system

build trust with black box

from batch to streaming

Fuzzy Memory

社区洞察

其他会员也浏览了

RESPONDING THIS THOUGHT PROVOCKING PICTURE...

Transduction – leading transformation – Issue #74

Introduction to AWGN (Additive White Gaussian Noise)

The Logical Framework of HU - Topology

Outperforming the Traditional Black-Scholes Model: Quantum Arithmetic in Option Pricing:

Reasoning from first principles

As You Watch The British Open ... Say Thank You To DCT

477: One Thousand New Instructions with Kwabena Agyeman

Still time to submit to ICAIF'24

Laws of nature dictate that measuring developer productivity should be hard