登录查看更多内容

Linear Regression- gradient descent optimization

Debi Prasad Rath

@AmazeDataAI- Technical Architect | Machine Learning | Deep Learning | NLP | Gen AI | Azure | AWS | Databricks

发布日期: 2023年11月12日

Hi connections. Trust you are doing well. In this post we will continue from where we left off and that is to discuss about "gradient descent" algorithm. As the name suggests, gradeint descent is an optimization algorithm to find optimal values of beta terms that minimizes cost function. This is an iterative process by estimating accuracies each time and update beta terms such that error gets reduced or become nearly zero. This is one of the powerful toolkit in all of machine learning applications.

Let me break it down for you guys. In "gradient descent" gradient refers to a vector that defines the direction of slope/angle of the current point to find minimum of cost function. By definition, it is coined as rate of change in cost function for tiny/very very small change in its parameters. In simple terms, you can define it as d(cost_function)/ dX or dy/dX parametrized by X<s>. For instance, while riding a bike, you apply brakes in such a way that bike gets slow down before getting crashed with an obstacle. Well, that is nothing but gradient descent algorithm for you.

Technically, gradient descent is the first order derivative optimization function.Precisely, it performs two tasks as in first gradient gets computed and then accordingly make a step (move) opposite to the gradient. This step or move is required so as to update gradient by mutiplying current gradient with the learning rate. Over here, learning rate refers to the learning in order to make a move or step. It should not be a very big step or a tiny little baby step to strike a balance, so that cost function is converging to its minimum.

Let us understand it in pieces collectively.Try to recollect the equation of linear regression with one independent variable that is y = Beta0 + Beta1 * X1 + error, where Beta0 represents intercept term and Beta1 represents slope/gradient/coefficent/weight that is rate of change of target with respect to a very small change in X<s>. Intercept is a constant value when the fitted line makes a cut on the y-axis. Slope term (in this case Beta1).

Well, you can find derivative/slope/gradient by using a tangent line to observe the steepness. In this way, our slope will inform about Beta terms that needs to be updated along the way. More fundamentally, the slope with an initial attempt would be steeper, along the way with new parameter updates it will get reduced. This process of parameter update happens iteratively until that point where it attains minimum value is alternatively known as "convergence point". Intuitively convergence point is where cost function is minimum as you can see from below image. This is synonymously indicating the fact that at this model should stop learning and average out error across entire dataset. keep in mind that both loss and cost are used in this context, but loss means error for one training example but error is for the entire training dataset.

gradient descent formula
-------------------------
Beta0 = Beta0 - learning_rate * d(cost_function)/dBeta0
Beta1 = Beta1 - learning_rate * d(cost_function)/dBeta1

要查看或添加评论，请登录

Debi Prasad Rath的更多文章

Explainable AI- XAI overview

2023年11月17日

Explainable AI- XAI overview

Explainable AI also abbreviated as XAI is another toolkit that will validate predictions to provide interpretability…
Linear Regression: How to find line of best fit ?

2023年11月11日

Linear Regression: How to find line of best fit ?

Hi connections. Trust you are doing well.
Linear Regression - An overview

2023年11月10日

Linear Regression - An overview

Hi connections. In this article we will be discussing about "linear regression" model algorithm.
Isolation Forest- An overview

2023年11月9日

Isolation Forest- An overview

Hi connections. Trust you are doing well.

1 条评论
Support Vector Machine- Simple analysis

2023年11月8日

Support Vector Machine- Simple analysis

Hi connections. Trust you are doing well.
The need of ensembling

2023年11月7日

The need of ensembling

Hi connections. Trust you are doing well.
Construct of Data Connectors using Python for routine ML tasks

2023年3月1日

Construct of Data Connectors using Python for routine ML tasks

Overview: - A data scientist is tasked to build models and predict the future. More or less, this is the task at hand…
Databricks Vs Azure Machine Learning - a comparative study

2023年1月29日

Databricks Vs Azure Machine Learning - a comparative study

azure machine learning vs databricks:-- =============================== CREDIT- Microsoft Documentation note:- ml -…
Machine Learning and Quality Assurance

2021年7月26日

Machine Learning and Quality Assurance

Content: - Framework to perform ML QA Steps needed Skills needed Areas to be tested and techniques involved Approach…
Understanding GitHub Essentials in Machine Learning

2019年12月15日

Understanding GitHub Essentials in Machine Learning

When I started learning data science, I was interacting with aspirants in this field. One significant thing I have…

See all articles

Debi Prasad Rath的更多文章

Explainable AI- XAI overview

Linear Regression: How to find line of best fit ?

Linear Regression - An overview

Isolation Forest- An overview

Support Vector Machine- Simple analysis

The need of ensembling

Construct of Data Connectors using Python for routine ML tasks

Databricks Vs Azure Machine Learning - a comparative study

Machine Learning and Quality Assurance

Understanding GitHub Essentials in Machine Learning