登录查看更多内容

build trust with black box

Weiming Li

Machine Learning Signal Processing | MLSP.ai

发布日期: 2024年12月29日

Putting a black box in a product requires courage, a few ways to turn some of the courage into confidence.

A NN model is pretty much a black box. Feed input to it and it produces an output, we barely have visibility about what’s going on inside, sometimes this might make us nervous.

We could dissect a model to try to make sense of it, like my earlier post “from FFT to SVD” has done. This approach is not a very practical one and likely requires the model to be architected in interpretable stages, which could be a limitation.

There are more practical ways to build trust with a black box.

The old fashion testing

Still the sharpest tool in the toolbox, stress testing, corner cases etc. We all familiar with this therefore let's focus on the other two.

领英推荐

Technical Deep-Dive: How to Improve Segmentation…

LandingAI 1 年前

The Convergence of Data & Software Engineering in the…

Tomasz Tunguz 1 年前

Delving into Agent Architectures: How AI Agents are…

Jay Gimple 2 个月前

Understand how the model is trained

Model training essentially is an optimization process and the optimization algorithm is incredibly efficient. If we assume the model training process will digest all the training material fully, then the model is as good (or bad) as the training material. Looking deep inside the data and code to check is a key step to gain confidence, not just for correctness, but also comprehensiveness.

Mistakes in data or code. The high tolerance to noise and error characteristic of NN makes mistakes harder to be spotted from the output. Unlike conventional algo, a mistake often leads to unexpected result (because we know what to expect), a mistake in NN often just cause sub-optimal performance. In a development scenario that we don't know what performance level should be expected, NN output would not give hint about any mistake.
Coverage of elements the model will be exposed to in practice. A good example is, a keyword recognition model trained with a lot of the popular dataset works really well with recorded material, but then perform miserably in practice simply due to users don’t speak right in front of the device.

Choose the right model architecture

Model architecture is not just a performance consideration, it could help with adding confidence too. Take following two as example.

The left one has total freedom to re-create a new frame, therefore it has potential to be able to correct corrupted information, but it also has the potential to produce unwanted content, which will require more effort to verify.

While the mask-based architecture on the right puts the thinking onto producing a mask, if the mask is within range [0,1], then we know for sure the model would not add anything to the original input, but will only remove unwanted components. More peace of mind, but less room for creativity.

要查看或添加评论，请登录

Weiming Li的更多文章

free trial: integrate NN processing in MCU with 2 lines of C code

2025年3月10日

free trial: integrate NN processing in MCU with 2 lines of C code

Trying is believing. In this post, I would enable everyone to be able to try bringing my example NN processing into…
Ray Tracing for sound, the holy grail for data generation?

2025年2月25日

Ray Tracing for sound, the holy grail for data generation?

Ray Tracing (RT) should be a very familiar term in 3D gaming, but what might be less known is its application in…
from minimize error to raise quality

2025年2月18日

from minimize error to raise quality

In this post, I am going to share the finding (and audio samples) of applying perceptual quality as training target for…
Looking forward to Cortex-M55 + Ethos-U55

2025年2月10日

Looking forward to Cortex-M55 + Ethos-U55

The 50x inference speed up and 25x efficiency jump are very exciting, but what I really look forward to is how it could…
SVDF, just give Conv a bit of time

2025年1月19日

SVDF, just give Conv a bit of time

Simply add a dimension of time to standard Conv layer, it becomes the SVDF layer, the core component powering our…
Peek into the future

2025年1月13日

Peek into the future

The Devil is in the details, a often hidden small detail that we must not miss when interpreting performance figures…
Tiny model for tiny system

2025年1月6日

Tiny model for tiny system

Large model shows us the limitless perspective of what’s possible, but model doesn’t have to be big to do amazing…

6 条评论
from batch to streaming

2024年12月19日

from batch to streaming

Unexpected complication I wish I were well aware of from the beginning. If you coming from a conventional DSP…
Fuzzy Memory

2024年12月16日

Fuzzy Memory

I don’t mean the kind we have after a hangover, but the kind powering some of the greatest models we know. “But do I…
Stochastic Rounding

2024年12月12日

Stochastic Rounding

When comes to digital signal, NN has the same liking as our ears. Rounding a number is a very common operation in DSP…

1 条评论

See all articles

build trust with black box

Weiming Li

Machine Learning Signal Processing | MLSP.ai

领英推荐

Weiming Li的更多文章

社区洞察

其他会员也浏览了

AI Reference Architectures

How can you relate feature engineering to model evaluation?

Exploring RAG System Architectures: A Comparative Analysis

Exploring The Architecture Of Janus Pro 7B

How to hire a senior architect for $15 a month

DeepSeek‐R1: Architecture and Core Training Methodologies

Harnessing the Power of Behavior Trees: TAI's Core Technology Strategy

Reference Architecture for GenAI Applications — Part One

April 17, 2024

VALIDATING & TESTING

领英推荐

Weiming Li的更多文章

free trial: integrate NN processing in MCU with 2 lines of C code

Ray Tracing for sound, the holy grail for data generation?

from minimize error to raise quality

Looking forward to Cortex-M55 + Ethos-U55

SVDF, just give Conv a bit of time

Peek into the future

Tiny model for tiny system

from batch to streaming

Fuzzy Memory

Stochastic Rounding

社区洞察

其他会员也浏览了

AI Reference Architectures

How can you relate feature engineering to model evaluation?

Exploring RAG System Architectures: A Comparative Analysis

Exploring The Architecture Of Janus Pro 7B

How to hire a senior architect for $15 a month

DeepSeek‐R1: Architecture and Core Training Methodologies

Harnessing the Power of Behavior Trees: TAI's Core Technology Strategy

Reference Architecture for GenAI Applications — Part One

April 17, 2024

VALIDATING & TESTING