登录查看更多内容

R - Advanced Regression Models

Anandh Shanmugaraj

Group CEO & MD at Gladwin International & Company ?? India's leading Interim Leadership Consulting, Executive Search and Leadership Advisory Firm.

发布日期: 2016年11月28日

Ad: Free Python Tutorials | Free R Tutorials | Free Deep Learning Tutorials | Free Machine Learning Tutorials | Free Artificial Intelligence Tutorials

Register now on www.gladwinanalytics.com to get started with 7500+ hours of free video tutorials on data science

----------------------------------------------------------------------------------

Each of the regression analysis below contains working code examples with brief use-case explanations covered for each of the regression types in the list below. Many of these code snippets are generic enough so you could use them as a base template to start and build up on for your analyses.

Please note that the information presented in here should not be construed as full and complete analysis, but rather as a template and a hand guide of available modeling options. You are advised to pursue independent and thorough research before arriving at conclusions.

Robust Regression

Robust regression can be used in any situation where OLS regression can be applied. It generally gives better accuracies over OLS because it uses a weighting mechanism to weigh down the influential observations. It is particularly resourceful when there are no compelling reasons to exclude outliers in your data.

Robust regression can be implemented using the rlm() function in MASS package. The outliers can be weighted down differently based on psi.huber, psi.hampel and psi.bisquare methods specified by the psi argument.

How To Specify A Robust Regression Model

library(MASS)
rlm_mod <- rlm(stack.loss ~ ., stackloss, psi = psi.bisquare)  # robust reg model
summary(rlm_mod)
#> Call: rlm(formula = stack.loss ~ ., data = stackloss)
#> Residuals:
#>      Min       1Q   Median       3Q      Max 
#> -8.91753 -1.73127  0.06187  1.54306  6.50163 
#> 
#> Coefficients:
#>             Value    Std. Error t value 
#> (Intercept) -41.0265   9.8073    -4.1832
#> Air.Flow      0.8294   0.1112     7.4597
#> Water.Temp    0.9261   0.3034     3.0524
#> Acid.Conc.   -0.1278   0.1289    -0.9922
#> 
#> Residual standard error: 2.441 on 17 degrees of freedom

Compare Performance of rlm() with lm()

Lets build the equivalent lm() model so we can compare the errors against the respective fitted values.

lm_mod <- lm(stack.loss ~ ., stackloss)  # lm reg model

Calculate the Errors

# Errors from lm() model
DMwR::regr.eval(stackloss$stack.loss, lm_mod$fitted.values)
#>       mae       mse      rmse      mape 
#> 2.3666202 8.5157125 2.9181694 0.1458878

# Errors from rlm() model
DMwR::regr.eval(stackloss$stack.loss, rlm_mod$fitted.values)
#>       mae       mse      rmse      mape 
#> 2.1952232 9.0735283 3.0122298 0.1317191

As expected, the errors from the robust regression model is lesser than the linear regression model.

Learn more

Logistic Regression

Probit Regression

Multinomial Regression

Ordinal Logistic

Poisson and Negative Binomial

---------------------------------------------------------------------------------

Ad: Free Python Tutorials | Free R Tutorials | Free Deep Learning Tutorials | Free Machine Learning Tutorials | Free Artificial Intelligence Tutorials

Register now on www.gladwinanalytics.com to get started with 7500+ hours of free video tutorials on data science

----------------------------------------------------------------------------------

要查看或添加评论，请登录

Anandh Shanmugaraj的更多文章

Big Data in Aviation

2017年10月19日

Big Data in Aviation

We hear a lot about big data's ability to deliver usable insights - but what does this mean exactly for enterprises in…

1 条评论
Data Science Opportunities - 45000+ Roles Worldwide - 2016 Year End Update

2016年12月28日

Data Science Opportunities - 45000+ Roles Worldwide - 2016 Year End Update

Below are the list of data science opportunities with industry leading employers and highly successful startups around…
300 Hours of Free Video Tutorials on R Programming

2016年12月22日

300 Hours of Free Video Tutorials on R Programming

Ad: 50,000+ Data Science Jobs - Apply for Machine Learning, Data Mining, Analytics, Research and AI Jobs in USA, UK…

7 条评论
Big Data, IoT and Industrial Internet - Industry Uses

2016年12月4日

Big Data, IoT and Industrial Internet - Industry Uses

Ad: 50000 Data Science Jobs Globally | Over 10000 Hours of Free Data Science Video Tutorials - Only on Gladwin…
Big Data, Hadoop and Spring - Online Tutorials

2016年11月30日

Big Data, Hadoop and Spring - Online Tutorials

Ad: Over 50,000 Data Science Jobs Worldwide | 8000+ Hours of Free Data Science Tutorials…
Introduction to Bioinformatics - 40 Hours of Free Video Tutorials

2016年11月29日

Introduction to Bioinformatics - 40 Hours of Free Video Tutorials

Ad: Over 8000 Hours of Free Data Science Courses | 50000+ Data Science Jobs Worldwide…

2 条评论
Learn Python Programming Free - 127 Hours of Free Tutorials from the world's expert data scientists

2016年11月27日

Learn Python Programming Free - 127 Hours of Free Tutorials from the world's expert data scientists

Ad: 8000 Hours of Data Science Tutorials - Start Learning | 50000+ Data Science Opportunities with worlds leading…

2 条评论
Learn Computer Vision - 20 Hours of Free Expert Video Tutorials

2016年11月26日

Learn Computer Vision - 20 Hours of Free Expert Video Tutorials

Ad: 7500+ Hours of Free Online Courses - Start learning for free. 50000+ Data Science Jobs Worldwide - Find and apply…
Deep Learning Demystified - 70 Hours of World's Finest Tutorials - Free

2016年11月25日

Deep Learning Demystified - 70 Hours of World's Finest Tutorials - Free

Ad: Register now to watch 5000 Hours of Free Data Science Video Tutorials Deep learning (also known as deep structured…

30 条评论
Everything you need to know about Linear Regression

2016年11月23日

Everything you need to know about Linear Regression

Ad: Learn from more than 7500 Hours of Free Data Science Video Tutorials - Start now > Linear regression is used to…

See all articles

R - Advanced Regression Models

Anandh Shanmugaraj

Group CEO & MD at Gladwin International & Company ?? India's leading Interim Leadership Consulting, Executive Search and Leadership Advisory Firm.

Robust Regression

How To Specify A Robust Regression Model

Compare Performance of rlm() with lm()

Calculate the Errors

Learn more

Anandh Shanmugaraj的更多文章

社区洞察

其他会员也浏览了

#ArtificialIntelligence No 65: Why R lost the R vs Python wars and what that tells you about where AI is going

Python’s Top 6 Machine Learning Algorithms

Top 10 Python Libraries Every Developer Should Know

Logistic Regression implementation in Python

Machine Learning For Rookies

Using ChatGPT to write notebooks

Common AI Prompt Engineering Interview Question 11: How do you implement a decision tree, random forest, or other specific ML algorithms in Python?

Shapash : Machine Learning Interpretable & Understandable

Week 4: Soft Intro to Data Science with Python. Let's Build Our First Deep Neural Network! (Pt. 1)

TensorFlow Debugging

Robust Regression

How To Specify A Robust Regression Model

Compare Performance of rlm() with lm()

Calculate the Errors

Learn more

Anandh Shanmugaraj的更多文章

Big Data in Aviation

Data Science Opportunities - 45000+ Roles Worldwide - 2016 Year End Update

300 Hours of Free Video Tutorials on R Programming

Big Data, IoT and Industrial Internet - Industry Uses

Big Data, Hadoop and Spring - Online Tutorials

Introduction to Bioinformatics - 40 Hours of Free Video Tutorials

Learn Python Programming Free - 127 Hours of Free Tutorials from the world's expert data scientists

Learn Computer Vision - 20 Hours of Free Expert Video Tutorials

Deep Learning Demystified - 70 Hours of World's Finest Tutorials - Free

Everything you need to know about Linear Regression

社区洞察

其他会员也浏览了

#ArtificialIntelligence No 65: Why R lost the R vs Python wars and what that tells you about where AI is going

Python’s Top 6 Machine Learning Algorithms

Top 10 Python Libraries Every Developer Should Know

Logistic Regression implementation in Python

Machine Learning For Rookies

Using ChatGPT to write notebooks

Common AI Prompt Engineering Interview Question 11: How do you implement a decision tree, random forest, or other specific ML algorithms in Python?

Shapash : Machine Learning Interpretable & Understandable

Week 4: Soft Intro to Data Science with Python. Let's Build Our First Deep Neural Network! (Pt. 1)

TensorFlow Debugging