登录查看更多内容

SVMs versus Logistic Regression

Joseph Sefara

Senior Data Scientist Specialist

发布日期: 2019年6月5日

Like logistic regression (LR), support vector machines (SVMs) can also be generalised to categorical output variables that take more than two values. On the other hand, the kernel trick can also be employed for LR (this is called kernel LR). While LR, like linear regression, also makes use of all data points, points far away from the margin have much less influence because of the logit transform, and so, even though the mathematics is different, they often end up giving results similar to SVMs.

As to the choice of SVMs versus LR, it often makes sense to try both. SVMs sometimes give a better fit and are computationally more efficient - LR uses all data points but then the values away from the margin are discounted, while SVM uses only the support vector data points to begin with. However, SVM is a bit of a “black box” in terms of interpretability. On the other hand, in LR, the contribution of individual variables to the final fit can be better understood, and in back-fitting of the data, the outputs can be directly interpreted as probabilities.

Continue reading https://doi.org/10.1016/B978-0-12-803130-8.00004-X

Malome Tebatso Khomo

Everywhere, knowingly with the bG-Hum; Crusties!

5 年

Your last sentence seems to point to a way of using LR to estimate meaning attached to inputs via SVM. If you can establish the conditions under which those outputs indeed act as probability distributions, then you're done. If I remember vaguely,, it'd have to have a central moment, and its +/- infinity sum finite for a given input space domain, Interesting!

1 次回应

要查看或添加评论，请登录

Joseph Sefara的更多文章

Self-Study Data Science

2020年3月9日

Self-Study Data Science

As a data science consultant, lots of people interested in getting into data science have contacted me for guidance on…

2 条评论
162+ Data Science Interview Questions

2020年3月2日

162+ Data Science Interview Questions

A typical interview process for a data science position includes multiple rounds. Often, one of such rounds covers…
Importance of Data Normalisation for Data Science and Machine Learning Models

2020年3月2日

Importance of Data Normalisation for Data Science and Machine Learning Models

Normalisation is a technique often applied as part of data preparation for machine learning. The goal of normalisation…
Using Data Science to Know if the Customers will Buy the Products in their Cart or not?

2020年2月20日

Using Data Science to Know if the Customers will Buy the Products in their Cart or not?

Using Machine Learning (ML) Classifier specifically XGBoost to predict if a customer will eventually make a purchase…

2 条评论
Pandas DataFrame: Convert the column type from string to datetime format

2019年6月25日

Pandas DataFrame: Convert the column type from string to datetime format

While working with data in Pandas, it is not an unusual thing to encounter time series data and we know Pandas is a…
When to Scale, Standardise, or Normalise with Scikit-Learn

2019年4月29日

When to Scale, Standardise, or Normalise with Scikit-Learn

Many machine learning algorithms work better when features are on a relatively similar scale and close to normal…

1 条评论
Logistic Regression with Keras

2018年10月17日

Logistic Regression with Keras

Logistic Regression (LR) is a simple yet quite effective method for carrying out binary classification tasks. There are…
Advice to Recent Graduates: Plan, Negotiate and Network

2018年6月8日

Advice to Recent Graduates: Plan, Negotiate and Network

It has been more than 20 years since I graduated college. Since then my career has been productive and focused around…
Respect and Love Your Elders

2017年10月12日

Respect and Love Your Elders

Wise people are very few, if we put a habit of hearing then wise people from our home to the outside world will find…

See all articles

SVMs versus Logistic Regression

Joseph Sefara

Senior Data Scientist Specialist

Joseph Sefara的更多文章

社区洞察

其他会员也浏览了

Kalman Filter: The first dive

What is Multicollinearity? A Visual Description

Key Topics for Random Processes & Statistics and Probability

Look-ahead bias

Cross entropy loss function

Intuitive and Visual Explanation on the differences between L1 and L2 regularization

What is an ROC Curve?

Math behind Logistic Regression and how to fine tune it..

AI_Part_2_Regression Models with Codes

Fractals and Multifractals: Unlocking Patterns in Chaos

Joseph Sefara的更多文章

Self-Study Data Science

162+ Data Science Interview Questions

Importance of Data Normalisation for Data Science and Machine Learning Models

Using Data Science to Know if the Customers will Buy the Products in their Cart or not?

Pandas DataFrame: Convert the column type from string to datetime format

When to Scale, Standardise, or Normalise with Scikit-Learn

Logistic Regression with Keras

Advice to Recent Graduates: Plan, Negotiate and Network

Respect and Love Your Elders

社区洞察

其他会员也浏览了

Kalman Filter: The first dive

What is Multicollinearity? A Visual Description

Key Topics for Random Processes & Statistics and Probability

Look-ahead bias

Cross entropy loss function

Intuitive and Visual Explanation on the differences between L1 and L2 regularization

What is an ROC Curve?

Math behind Logistic Regression and how to fine tune it..

AI_Part_2_Regression Models with Codes

Fractals and Multifractals: Unlocking Patterns in Chaos