登录查看更多内容

点击“继续加入或登录”，即表示您同意遵守领英的《用户协议》、《隐私政策》及《Cookie 政策》。

BxD Primer Series: Ridge Regression Models and L2 Regularization in general

Mayank K.

Founding Partner - BUSINESS x DATA

发布日期: 2023年3月30日

+ 关注

Hey there ??

Welcome to BxD Primer Series where we are covering topics such as Machine learning models, Neural Nets, GPT, Ensemble models, Hyper-automation in ‘one-post-one-topic’ format. Today’s post is on?Ridge?Regression Models and use of L2 regularization in other models. Let’s get started:

The What:

Ridge Regression is used to refer specifically to linear regression with L2 regularization, while the term ‘L2 regularization’ is used more generally to refer to the use of L2 penalty terms in any model that involves optimization with a linear sum of parameters.

Ridge Regression Model is used to?handle multicollinearity in a dataset. Multicollinearity is a situation where two or more independent variables in a regression model are highly correlated with each other. This can lead to unstable and unreliable coefficient estimates, making it difficult to interpret the results.

Ridge Regression addresses this issue by?introducing a regularization term?to the least squares objective function of linear regression. This regularization term is a penalty term that shrinks the coefficient estimates towards zero, reducing their variance and improving their stability.

The How:

The regularization term in Ridge Regression is known as the L2 penalty. It is defined as the sum of the squares of the magnitude of the coefficients or weights:

L2 penalty = alpha * (sum of?square?of weights)

Here, alpha is a hyperparameter that controls the strength of the penalty. A higher value of alpha will result in more shrinkage of the coefficients towards zero, reducing their variance but increasing their bias. A lower value of alpha will result in less shrinkage, increasing the variance but reducing the bias.

Cost function with L2 penalty can be written as:

In case of Ridge Regression, original cost function is usually selected to be Mean Squared Error. Hence the cost function will be:

The optimal values of the weights and alpha are found by minimizing the cost function. The optimization of this cost function can be done using various methods, including closed-form solutions and iterative methods such as gradient descent.

The penalty term acts as a constraint on the magnitude of the coefficients. A penalty that increases as the magnitude of the coefficients increases, encourages the model to prioritize smaller and more stable coefficients, since larger coefficients will result in a higher penalty and a higher overall cost.

By minimizing the combined cost function, the model finds the values of the coefficients that balance the tradeoff between fitting the data well and keeping the coefficients stable. This results in a more robust and generalized model that is less likely to overfit the data.

The Why:

Some common reasons for using Ridge regression or L2 regularization are:

Ridge regression is commonly used in the field of finance to predict stock prices, as it helps to?avoid overfitting?when dealing with a large number of variables.
Ridge regression is particularly useful when the?number of variables in a model is much larger than the number of observations, as it can help to reduce the variance of the parameter estimates.
It is used when there is?multicollinearity in the data, which can cause the coefficients to be unstable and difficult to interpret.
L2 regularization can also be useful when?dealing with noisy data, as it can help to reduce the impact of the noise on the parameter estimates.

The Why Not:

When there is a small number of predictors: Ridge Regression may not be necessary when there are only a small number of predictors in the model. In this case, ordinary regression may be sufficient.
When the relationship between the predictors and the response is without noise: L2 regularization will not add much value in this case. Ordinary regression may be sufficient.
When there is a need for variable selection: While Ridge Regression can shrink the coefficients of less important variables towards zero, it does not set them exactly to zero. If there is a need for variable selection, Lasso Regression may be more appropriate. Lasso Regression can set the coefficients of irrelevant variables to exactly zero, resulting in a sparse model with only the most important variables included.
When the goal is prediction accuracy: While Ridge Regression can help to improve the generalization performance of the model by reducing overfitting, it may not always result in the best prediction accuracy.
When the model contains categorical variables: Ridge Regression assumes that the predictor variables are continuous, and may not perform well when there are categorical variables in the model. In this case, a model that can handle categorical variables, such as logistic regression or decision trees, may be a better choice.

Note on scaling of features: Ridge Regression is sensitive to the scaling of the features. If the features are not on the same scale, the regularization penalty will affect them differently. It is important to scale the features appropriately to avoid this issue.

Time for you to help in return:

Reply to this article with your question
Forward/Share to a friend who can benefit from this
Chat on Substack with BxD (here)
Engage with BxD on LinkedIN (here)

In next coming posts, we will be covering two more types of regression models - lasso, elastic net in similar format. Post that we will move to decision tree based models.

Let us know your feedback!

Until then,

Enjoy life in full ??

#businessxdata?#bxd?#ridgeregression #l2regularization?#primer

BUSINESS x DATA

765 位关注者

Mayank K.

Founding Partner - BUSINESS x DATA

1 年

If you prefer email updates, visit here: https://anothermayank.substack.com #substack

要查看或添加评论，请登录

Mayank K.的更多文章

What we look for in new recruits?

2024年9月22日

What we look for in new recruits?

Personalization is the #1 use case of most of AI technology (including Generative AI, Knowledge Graphs…
500+ Enrollments, ?????????? Ratings and a Podcast

2024年9月14日

500+ Enrollments, ?????????? Ratings and a Podcast

We are all in for AI Driven Marketing Personalization. This is the niche where we want to build this business.
What you mean 'Build A Business'?

2024年9月7日

What you mean 'Build A Business'?

We are all in for AI Driven Personalization in Business. This is the niche where we want to build this business.
Why 'AI-Driven Personalization' niche?

2024年8月31日

Why 'AI-Driven Personalization' niche?

We are all in for AI Driven Personalization in Business. In fact, this is the niche where we want to build this…
Entering the next chapter of BxD

2024年8月24日

Entering the next chapter of BxD

We are all in for AI Driven Personalization in Business. And recently we created a course about it.

1 条评论
We are ranking #1

2024年8月17日

We are ranking #1

We are all in for AI Driven Personalization in Business. And recently we created a course about it.
My favorites from the new release

2024年7月27日

My favorites from the new release

The Full version of BxD newsletter has a new home. Subscribe on LinkedIn: ?? https://www.
Many senior level jobs inside....

2024年7月7日

Many senior level jobs inside....

Hi friend - As you know, we recently completed 100 editions of this newsletter and I was the primary publisher so far…
People need more jobs and videos.

2024年6月29日

People need more jobs and videos.

From the 100th edition celebration survey conducted last week- one point is standing out that people need more jobs and…
BxD Saturday Letter #202425

2024年6月22日

BxD Saturday Letter #202425

Please take 2 mins to send your feedback. Link: https://forms.

See all articles

The What:

The How:

The Why:

The Why Not:

Time for you to help in return:

BUSINESS x DATA

765 位关注者

Mayank K.的更多文章

What we look for in new recruits?

500+ Enrollments, ?????????? Ratings and a Podcast

What you mean 'Build A Business'?

Why 'AI-Driven Personalization' niche?

Entering the next chapter of BxD

We are ranking #1

My favorites from the new release

Many senior level jobs inside....

People need more jobs and videos.

BxD Saturday Letter #202425

社区洞察