登录查看更多内容

#20) Section 4 of 5: How a Bunch of Numbers Can Actually Learn From Trial-and-Error: Gradient Descent

David Code

Author, Speaker, Travel Writer, Dancer, Human.

发布日期: 2019年11月10日

Mighty Friends, one simple question has always fascinated me: I understand how a child can learn by trial-and-error. But how does the box of silicon that we call a computer learn from trial-and-error? Today, we begin to answer that question.

As always, I humbly beseech thee to please open this link in a separate window so you can toggle back-and-forth between the code and my explanations of it (use Alt + Tab in Windows, or Command + Tab/Shift Tab with Macs).

4.1) The Big Picture of Gradient Descent

What is the purpose of gradient descent? It is to find the best set of adjustments to our network of weights so that it gives a better prediction in the next iteration. In other words, certain values in the synapse matrices of our network need to be increased or decreased in order to give a better prediction next time. To adjust each of these values, we must answer two key questions:

1) In what direction do I adjust the number? Do I increase the value, or decrease it? Positive direction, or negative? and

2) By how much do I increase or decrease the number? A little, or a lot?

We will examine these two basic questions in great detail below. But if you want to visualize what gradient descent does, simply remember that "gradient" is just a fancy word for "slope." If you remember our curvy red bowl from Section 1, The Big Picture, gradient descent simply means calculating the optimal slope of the surface of that bowl to get that little white ball down to the bottom of the bowl as quickly and efficiently as possible. So keep that curvy red bowl in your mind.

Our first step in gradient descent is to calculate how much our current prediction missed the actual truth, namely a 1/yes or a 0/no in y.

4.2) How Far Off is our Prediction when Compared to Survey Question #4?

Line 66

l2_error = y - l2

So, by how much did our first prediction miss the target of "Yes/1," the actual truth from survey question four that Customer One did indeed buy Litter Rip? Well, with Customer One (Row One of l0), we want to compare our l2 prediction to the y value of 1, since Customer did indeed buy Litter Rip! When I say, "compare our l2 prediction," I mean we subtract the l2 probability from the y value and the remainder is our l2_error, or "how much we missed the target value y by."

So, big picture here again: our network took the input of each customer's response to the first three survey questions, and manipulated that data to come up with a prediction of whether that customer bought Litter Rip! or not. Because we have four customers, our network made four predictions. And you may recall that the 4x1 y vector contains the answers of four customers to question four, "Have you purchased Litter Rip!?" It contains four "0" or "1" values to which we want to compare the four predictions our network came up with.

Once we know our l2_error (which of course is also a vector of four errors, one for each prediction), next we want to print that error, so we can eyeball our process in real time:

Print Error: Lines 72-73

Line 72 is a clever parlor trick to have the computer print out our l2_error every 10,000 iterations. It's helpful for us to envision the learning the network is doing if it "shows us its homework" every 10,000 times, so we can see its progress. The line, if (j% 10000)==0: means, "If your iterator is at a number of iterations that, when divided by 10,000, leaves no remainder, then..." So, j%10000 would have a remainder of 0 only six times: at 0 iterations, 10,000, 20,000, and so on to 60,000. So this print-out gives us a nice report on the progress of our network's learning.

The code + str(np.mean(np.abs(l2_error)))) simplifies our print out by taking the absolute value of each of the 4 values, then averaging all 4 into one mean number and printing that.

OK, so we now know how much our predictions about four customers (l2) missed the Actual Truth about who purchased Litter Rip! (y). And we've printed that.

But of course, any distance between us and The Oracle's castle is too much for our hearts to bear, so how can we reduce the current, unsatisfactory prediction error of 0.5 to finally attain enlightenment?

One step at a time. First, let's get clear on what part of our network needs to change in order to improve our network's next prediction. After that, we'll discuss how to adjust our network. Tune in again tomorrow, and bask in this beauty tonight: my favorite nightclub in Mexico City, Dixon's:

Tech and Travel

15,950 位关注者

Rohan Chaudhari

Noise and Vibrations | Acoustics | Product Development

5 年

Can I get access to all the newsletters from beginning?

1 次回应

查看更多评论

要查看或添加评论，请登录

David Code的更多文章

Authentic Japan IS Fukuoka City!

2024年12月20日

Authentic Japan IS Fukuoka City!

Of the 103 countries I have visited, Japan is still my favorite. Achingly beautiful: That’s what authentic Japan is.
See France without the Hassle or the Crowds

2024年5月18日

See France without the Hassle or the Crowds

My fabulous friends, my wife doesn’t like to cook on vacation. I don’t like to drive in Europe.
You HAVE to Try Fly-Fishing!

2023年12月21日

You HAVE to Try Fly-Fishing!

My fabulous friends, we had just parked at the trailhead to our fly-fishing river near Provo, Utah when a lady drove up…
Doing Mexico City like a local.

2022年8月3日

Doing Mexico City like a local.

Mexico City is blessed with an overabundance of three things that make it superlative: 1) Friendly people, 2)…
Sicily: Eat, Drink, History, Joy.

2021年9月29日

Sicily: Eat, Drink, History, Joy.

Friends, nowadays Sicily seems like a nice vacation spot, but 2,000 years ago it was pretty much the center of the…
Bologna and Trieste, Summer 2021 (with a Ljubljana bonus). Stunning.

2021年8月21日

Bologna and Trieste, Summer 2021 (with a Ljubljana bonus). Stunning.

My fabulous friends, Bologna is the new It city in Italy, which has a well-justified reputation. UNESCO just recognized…

1 条评论
Florence, Italy. Summer 2021. Joy.

2021年8月12日

Florence, Italy. Summer 2021. Joy.

My fabulous friends, I rarely revisit the same hotel. But last summer, when I first saw the rooftop pool and bar of the…
Lake Como, Summer 2021. Stunning Joy.

2021年8月5日

Lake Como, Summer 2021. Stunning Joy.

My fabulous friends, this story starts out innocently enough. I was eating, as I often do.
Why the people of Southern Italy are fabulous.

2021年7月17日

Why the people of Southern Italy are fabulous.

My fabulous friends, when was the last time you lived in a fairytale castle on the Italian Riviera? Oh, so it's been a…

4 条评论
How to Make Friends in Italy: Molise

2021年6月15日

How to Make Friends in Italy: Molise

It was a national holiday that day, and most Italians were joyfully communing with their families at home, while Poor…

6 条评论

See all articles

#20) Section 4 of 5: How a Bunch of Numbers Can Actually Learn From Trial-and-Error: Gradient Descent

David Code

Author, Speaker, Travel Writer, Dancer, Human.

4.1) The Big Picture of Gradient Descent

4.2) How Far Off is our Prediction when Compared to Survey Question #4?

Line 66

Print Error: Lines 72-73

Tech and Travel

15,950 位关注者

David Code的更多文章

社区洞察

其他会员也浏览了

#DIY-Predicting Stock Prices using Machine Learning(Working code with past prediction display)

Daily DATADLE #9

Explaining sch_cake's statistics

Time and Space Complexity

Accuracy & Prediction of your forecast: Throw the dart to find out.

Sliding Window : 5 (+HashMap)

Understanding Percentiles: Unlocking Insights from Statistical Comparisons

What Are Happy Numbers And How To Find Them

K-NN ( K Nearest Neighbor )

Introduction to kernel SVMs

4.1) The Big Picture of Gradient Descent

4.2) How Far Off is our Prediction when Compared to Survey Question #4?

Line 66

Print Error: Lines 72-73

Tech and Travel

15,950 位关注者

David Code的更多文章

Authentic Japan IS Fukuoka City!

See France without the Hassle or the Crowds

You HAVE to Try Fly-Fishing!

Doing Mexico City like a local.

Sicily: Eat, Drink, History, Joy.

Bologna and Trieste, Summer 2021 (with a Ljubljana bonus). Stunning.

Florence, Italy. Summer 2021. Joy.

Lake Como, Summer 2021. Stunning Joy.

Why the people of Southern Italy are fabulous.

How to Make Friends in Italy: Molise

社区洞察

其他会员也浏览了

#DIY-Predicting Stock Prices using Machine Learning(Working code with past prediction display)

Daily DATADLE #9

Explaining sch_cake's statistics

Time and Space Complexity

Accuracy & Prediction of your forecast: Throw the dart to find out.

Sliding Window : 5 (+HashMap)

Understanding Percentiles: Unlocking Insights from Statistical Comparisons

What Are Happy Numbers And How To Find Them

K-NN ( K Nearest Neighbor )

Introduction to kernel SVMs