登录查看更多内容

Backpropagation: The blame game

Vizuara

Our AI experts from MIT and Purdue host the most comprehensive AI program for high school and middle school students.

发布日期: 2024年6月7日

+ 关注

Today, I recorded a very important lecture in the “Neural Networks from Scratch” series.

It is called Backpropagation from scratch - on a single neuron.

When most students think of backpropagation, they have the following image in mind:

Students vaguely know that it involves gradients and partial derivatives and chain rule.?

However, more than 90% of ML students are not able to write the backpropagation code from scratch.

Here is a snippet of my whiteboard as I was making this lecture:

Once you write it on the whiteboard and understand it, you realise that backpropagation is actually simple.

It is a cool trick which relies on the chain rule of calculus.

I think of backpropagation as a way to calculate how much each weight and bias should be blamed for the loss.

If you think of backpropagation as passing the blame, it becomes very intuitive to understand.

Here is a video we recorded on the topic:?

https://youtu.be/iE1lccrHfok?si=GD8vnYD8YY5xkWSP

Enjoy!

William Jardim

8 个月

I have a question about the backpropagation algorithm in neural networks: In any implementation, when calculating the gradient of a unit in the hidden layer, do I always need to consider the gradients of all units in the next layer? Specifically, does this involve summing these errors? In other words, does the gradient of a unit in the hidden layer always depend on the errors of all units in the next layer? For instance, if my neural network has just 2 layers: a hidden layer with 3 units and an output layer with 2 units, to calculate the delta of each unit in the hidden layer, do I need to sum all errors in the next layer (in this case, the errors of the output layer)?

1 次回应

要查看或添加评论，请登录

Vizuara的更多文章

See all articles

Backpropagation: The blame game

Vizuara

Our AI experts from MIT and Purdue host the most comprehensive AI program for high school and middle school students.

Vizuara的更多文章

社区洞察

其他会员也浏览了

Concise Basic Stats - Part IV: Central Limit Theorem and The Law of Large Numbers

Abstractions Everywhere

The Snippity Napkin Nip From Flat to Round Land

JADEs Issue #13: 1st Order & Poly-Heat PDEs

Lie Detector Test!

The very foundation of Differential Calculus

The Seven Bridges of K?nigsberg ???

Are invariants in knot theory the key to proving the four color theorem in graph theory?

?? Day 70 of 365: Review and Mini-Project ??

When Will I Ever Use Algebra?

Vizuara的更多文章

Generative Adversarial Network (GAN)

"One-pixel attack"

Is Generative AI the New Steam Engine?

“Adversarial attacks to fool neural networks”

The History of Large Language Models (LLMs)

Understanding Tabular Data with SHAP: A Comprehensive Guide

Neural networks from scratch series update

How is backpropagation implemented on the ReLU activation function?

Image-Based Predictions with SHAP

Filters in Convolutional Neural Networks

社区洞察

其他会员也浏览了

Concise Basic Stats - Part IV: Central Limit Theorem and The Law of Large Numbers

Abstractions Everywhere

The Snippity Napkin Nip From Flat to Round Land

JADEs Issue #13: 1st Order & Poly-Heat PDEs

Lie Detector Test!

The very foundation of Differential Calculus

The Seven Bridges of K?nigsberg ???

Are invariants in knot theory the key to proving the four color theorem in graph theory?

?? Day 70 of 365: Review and Mini-Project ??

When Will I Ever Use Algebra?