登录查看更多内容

The Backpropagation Algorithm!

Damien Benveniste, PhD

Founder @ TheAiEdge | Follow me to learn about Machine Learning Engineering, Machine Learning System Design, MLOps, and the latest techniques and news about the field.

发布日期: 2024年6月25日

The backpropagation algorithm is the heart of deep learning! That is the core reason why we can have those advanced models like LLMs.

In a previous video, we saw we can use the computational graph that is built as part of deep learning models to compute any derivatives of the network outputs with respect to the network inputs. Now we are going to see how we can use this computational graph to get the network to learn from the data by using the backpropagation algorithm. Let's get into it!

Watch the video for the full content!

The goal of a neural network is to generate an estimate of the target we are trying to predict. We use a loss function to compare the target to its estimate. The optimization problem is about minimizing the loss function.

Typical loss functions are the log-loss and the mean squared error loss:

To minimize the loss function, we take the gradient of the loss function with respect to the network parameters and find the zeros of the resulting function.

But solving this equation can be very hard, so instead we use optimization techniques like the gradient descent algorithm. We update the parameters following the gradient in the direction where the loss function decreases.

领英推荐

?? Image Classification: Supercharging Image…

Erfan Akbarnezhad 1 年前

Regularization: Make your Machine Learning Algorithms…

Vijay D. 1 年前

Glossary for Machine Learning (ML) recruiting

Rocket (getrocket.com) 2 年前

Until we reach a local minima of the loss function.

To apply the gradient descent algorithm, we are going to use the computational graph. We first compute the graph in the forward pass.

And we can back-propagate the gradients of the loss function for each computational block and node in the graph by using the chain rule in the backward pass.

Now that we have the gradient of the loss function for all the parameters in the graph, we can apply one step of the gradient descent algorithm.

Articles You May Have Missed!

The AiEdge

51,769 位关注者

Hardeep Chawla

Enterprise Sales Director at Zoho | Fueling Business Success with Expert Sales Insights and Inspiring Motivation

7 个月

The backpropagation algorithm truly is the heart of deep learning! It's amazing how it powers advanced models like LLMs!

1 次回应

Daniel Svonava

Vector Compute @ Superlinked | xYouTube

9 个月

How do you see the role of backpropagation evolving as we push towards more advanced AI architectures? Are there any promising alternatives on the horizon?

1 次回应

Matthew Yeseta

Generative AI LLM LangChain Fine Tuning, Director AI ML Data Science, Manager Data Science, 150% branding/125% Digital, revenue steams, CTV Segmentation CDP, CV YOLO, Snowflake, Engineering AI ML Manager

9 个月

I be ready to code this algorithms in python

1 次回应

Naresh Neelakantan

??Blitzscaling Innovation from low-to-high Tiers for many OEM MY programs?? ex-Senior Director & Head??~20 Years in Automotive and SDV/SDX globally?? AI, Cybersecurity, Energy, Semiconductors, XaaS, AUTOSAR, Linux, IoT

9 个月

Reaching the error minima descending and adjusting through all the epochs. #awesomeAI #understandingai

1 次回应

查看更多评论

要查看或添加评论，请登录

Damien Benveniste, PhD的更多文章

New Chapter: Attention Is All You Need - The Original Transformer Architecture

2025年2月11日

New Chapter: Attention Is All You Need - The Original Transformer Architecture

The second chapter of the Big Book of Large Language Models is now available in preview: Attention Is All You Need: The…

9 条评论
Introducing The Big Book of Large Language Models!

2025年1月30日

Introducing The Big Book of Large Language Models!

For the past years, I have been creating educational content around machine learning and, specifically, large language…

13 条评论
Latest AI News and Research: Meta's Controversial Move and AI's Future in Healthcare and Gaming

2025年1月17日

Latest AI News and Research: Meta's Controversial Move and AI's Future in Healthcare and Gaming

Hey, this issue covers updates on Meta's decision to halt its fake news filters, a transformative soft robotic armband…
Today AI in the News: Brain-Mimicking Chips, Eco-Focused Models, and Google's News-Powered Gemini

2025年1月16日

Today AI in the News: Brain-Mimicking Chips, Eco-Focused Models, and Google's News-Powered Gemini

Inside this edition: a brain-mimicking AI chip enhancing battery life, machine learning models for sustainable hydrogen…

1 条评论
Today AI in the News: AI's Bold Advances in Healthcare and Beyond

2025年1月15日

Today AI in the News: AI's Bold Advances in Healthcare and Beyond

In this edition: Nvidia's latest leap in AI robotics, a pioneering approach for more efficient neural networks, AI's…

1 条评论
The Machine Learning Fundamentals Bootcamp V2: Live Sessions Starting Soon!

2025年1月14日

The Machine Learning Fundamentals Bootcamp V2: Live Sessions Starting Soon!

I am glad to teach again the Machine Learning Fundamentals Bootcamp V2. On February 12th, 2025, I am going to start…

10 条评论
The AiEdge: From IVF Successes to Evolving Esports and Billion-Dollar Ventures

2025年1月13日

The AiEdge: From IVF Successes to Evolving Esports and Billion-Dollar Ventures

In this edition: AI's role in IVF breakthroughs; real-time translation headsets and subtitles; HPE's billion-dollar…

2 条评论
New Live Bootcamp: Introduction to Data Science and Machine Learning Bootcamp!

2024年12月18日

New Live Bootcamp: Introduction to Data Science and Machine Learning Bootcamp!

It is almost Christmas, so it is time for a little gift! I am launching a new live bootcamp: Introduction to Data…

2 条评论
Happy Thanksgiving!

2024年11月28日

Happy Thanksgiving!

Happy Thanksgiving, everyone! I want to thank all of you readers for continuing to learn machine learning together! To…
How To Bring Machine Learning Projects to Success

2024年8月9日

How To Bring Machine Learning Projects to Success

To build a successful machine learning product, you need to understand how to manage a machine learning project. This…

7 条评论

See all articles

The Backpropagation Algorithm!

Damien Benveniste, PhD

Founder @ TheAiEdge | Follow me to learn about Machine Learning Engineering, Machine Learning System Design, MLOps, and the latest techniques and news about the field.

领英推荐

Articles You May Have Missed!

The AiEdge

51,769 位关注者

Damien Benveniste, PhD的更多文章

社区洞察

其他会员也浏览了

Understanding Internal Covariate Shift ?????

Instability in deep learning / Chaos (little puzzle in it, try it)

Root Mean Square Propagation

Understanding Deep Learning Networks using Saliency Maps

Feature Scaling Methods: A Comprehensive Guide

Conventional Software vs. Machine Learning Application - a tester’s perspective

Deep Learning with a tale of two cities (Part III/IX): down the hill of gradient methods

More than 10X faster! You may have an access to the super-powered computers, too!

Bayesian Optimization

IRIS CLASSIFICATION

领英推荐

Articles You May Have Missed!

The AiEdge

51,769 位关注者

Damien Benveniste, PhD的更多文章

New Chapter: Attention Is All You Need - The Original Transformer Architecture

Introducing The Big Book of Large Language Models!

Latest AI News and Research: Meta's Controversial Move and AI's Future in Healthcare and Gaming

Today AI in the News: Brain-Mimicking Chips, Eco-Focused Models, and Google's News-Powered Gemini

Today AI in the News: AI's Bold Advances in Healthcare and Beyond

The Machine Learning Fundamentals Bootcamp V2: Live Sessions Starting Soon!

The AiEdge: From IVF Successes to Evolving Esports and Billion-Dollar Ventures

New Live Bootcamp: Introduction to Data Science and Machine Learning Bootcamp!

Happy Thanksgiving!

How To Bring Machine Learning Projects to Success

社区洞察

其他会员也浏览了

Understanding Internal Covariate Shift ?????

Instability in deep learning / Chaos (little puzzle in it, try it)

Root Mean Square Propagation

Understanding Deep Learning Networks using Saliency Maps

Feature Scaling Methods: A Comprehensive Guide

Conventional Software vs. Machine Learning Application - a tester’s perspective

Deep Learning with a tale of two cities (Part III/IX): down the hill of gradient methods

More than 10X faster! You may have an access to the super-powered computers, too!

Bayesian Optimization

IRIS CLASSIFICATION