登录查看更多内容

Introduction to Support Vector Machines (SVM)

Shailendra Singh Kathait

Co-Founder & Chief Data Scientist @ Valiance | Envisioning a Future Transformed by AI | Harnessing AI Responsibly | Prioritizing Global Impact |

发布日期: 2015年10月21日

Support Vector Machines are supervised learning models, algorithms used to analyze data and recognize patterns. They are used for both classification & Regression.

Support Vector Machines (SVM) Introductory Overview

Support Vector Machines are based on the concept of hyper decision planes that define decision boundaries. A decision plane is one that separates between a set of different class types. A ref example is given below.

The objects belong either to class G or R. The separating line defines a boundary on the right side of which all objects are G and to the left of which all objects are R. Any new input coming and falling to the right is identified as G or R if it is on other side.

Let's take a look at a simple schematic example where every object either belongs to G or R

In case of two-class, separable training data set, intuitively a decision boundary drawn in the middle of the data distribution of the two classes. While some learning methods such as the perceptron algorithm find just any linear separator, others, like Naive Bayes, search for the best linear separator according to some criterion.

The SVM in particular defines the criterion to be looking for a decision surface that is maximally far away from any data point. This distance from the decision surface to the closest data point determines the margin of the classifier. This method of construction necessarily means that the decision function for an SVM is fully specified by a (usually small) subset of the data which defines the position of the separator. These points are referred to as the support vectors. The figure below shows the margin and support vectors for a sample problem.

In SVM other data points play no part in determining the decision surface that is chosen.

So in accordance above Maximizing the margin seems good because points near the decision surface represent very uncertain classification decisions: there is almost a 50% chance of the classifier deciding either way. A classifier with a large margin makes no low certainty classification decisions. This gives you a classification safety margin: a slight error in measurement or a slight document variation will not cause a miss-classification.

By Construct SVM classifier insists on a large margin around the decision boundary, when compared to decision hyperplanes. As distance between margin increases so there are fewer choices of where input can be put. So as a result its ability to correctly generalize to test data is increased .

Kernel Trick -

It helps in creating really powerful SVM. As it is unlikely that we can always have a linear dividing boundary, thereby resulting in miss classification of labels. One of the way out is if we can map each Input Vector into a different space via a kernel function where a linear dividing hyperplane is feasible.

There are number of kernels that can be used in Support Vector Machines models. These include linear, polynomial, radial basis function (RBF) and sigmoid. We may even write customized kernels.

Kernel Functions

There are number of kernels that can be used in Support Vector Machines models. These include linear, polynomial, radial basis function (RBF) and sigmoid. We may even customize kernels.

Shailendra Singh Kathait

Co-Founder & Chief Data Scientist @ Valiance | Envisioning a Future Transformed by AI | Harnessing AI Responsibly | Prioritizing Global Impact |

9 年

We have successfully worked with SVM on 100M+ Customer records in predicting next add to show. But high level of accuracy motivates ways around. @Valiance solutions

Shailendra Singh Kathait

Co-Founder & Chief Data Scientist @ Valiance | Envisioning a Future Transformed by AI | Harnessing AI Responsibly | Prioritizing Global Impact |

9 年

Very True, high memory requirement is challenge. But with advent of cloud and scalability it can be overcome. Even new smart implementation helps you circumvent. But that's trade off between accuracy and ease.

Amro A.

AI | Distributed Systems | Innovation Leadership | MIT

9 年

However, from a practical perspective I believe that one limitation in SVMs is the high algorithmic complexity and extensive memory requirements of the required quadratic programming in large-scale tasks.

Shailendra Singh Kathait

Co-Founder & Chief Data Scientist @ Valiance | Envisioning a Future Transformed by AI | Harnessing AI Responsibly | Prioritizing Global Impact |

9 年

Completely agreed. Kernel is the key, we should also consider VC dimension. SVM is most powerful when understood with Kernel.

Amjad Zaim

Serial AI Entrepreneur & Advocate of Ethical AI for the Public Good

9 年

Good tutorial ! I agree that SVM is a powerful classifier with good generalization ability especially compared to ANN which suffers from local minima convergence, but the choice of kernel function can severly alter model performance and therefore has to be carefully selected.

查看更多评论

要查看或添加评论，请登录

Shailendra Singh Kathait的更多文章

How did we reduce our data, computing, and storage requirements by roughly 96.6 % from the peak of 3 TB per day? Without dip in performance matrix

2023年5月27日

How did we reduce our data, computing, and storage requirements by roughly 96.6 % from the peak of 3 TB per day? Without dip in performance matrix

We drastically reduced the number of images to be classified by an object identification algorithm, resulting in…

8 条评论
Identifying New Mineral Occurrence using Remote Sensing Images

2023年3月5日

Identifying New Mineral Occurrence using Remote Sensing Images

The traditional ways of mapping earth’s geology and mineral resources, such as field sampling and aerial photographs…

5 条评论
Credit Risk Scorecard Monitoring

2017年5月17日

Credit Risk Scorecard Monitoring

Introduction Nowadays, Retail Banks are more focused on finding or discriminating the right clients and the wrong ones…

28 条评论
Introduction to Reinforcement Learning

2017年4月28日

Introduction to Reinforcement Learning

Machine Learning can be broadly classified into 3 categories: 1. Supervised Learning 2.

2 条评论
Moving towards world powered by Artificial Intelligence & Deep Learning.

2016年10月4日

Moving towards world powered by Artificial Intelligence & Deep Learning.

Machine learning is a very successful technology but applying it today often requires spending substantial effort…
Collecting Twitter Stream : Using Python & MongoDB

2015年12月10日

Collecting Twitter Stream : Using Python & MongoDB

Collecting Twitter Stream using MongoDb as storage Text mining is one of the applications of natural language…

5 条评论
Building Your First Spark : Logistic Regression Model

2015年11月17日

Building Your First Spark : Logistic Regression Model

Spark has recently been gaining traction. So I thought of providing starting point to play with Spark.

10 条评论
What is Topic Modeling ?

2015年11月3日

What is Topic Modeling ?

Topic Modeling has every growing relevance, specially with most of data being generated is unstructured data. So I…

4 条评论
Machine Learning helps in building High Performing Agent Sales Force

2015年9月9日

Machine Learning helps in building High Performing Agent Sales Force

Business Context: A Leading Fashion brand uses direct sales agents to sell products directly to the customer. Agents…

4 条评论
Machine Learning: Identifying Serviceable Tweets

2015年7月29日

Machine Learning: Identifying Serviceable Tweets

Twitter is fast evolving as servicing channel, though in a nascent stage. Twitter helps provide near real-time customer…

2 条评论

See all articles

Introduction to Support Vector Machines (SVM)

Shailendra Singh Kathait

Co-Founder & Chief Data Scientist @ Valiance | Envisioning a Future Transformed by AI | Harnessing AI Responsibly | Prioritizing Global Impact |

Shailendra Singh Kathait的更多文章

社区洞察

其他会员也浏览了

How to Handle Imbalanced Datasets in Machine Learning: A Step-by-Step Guide

Model Optimization in Machine Learning: Random vs. Grid?Search

Building a Machine Learning Pipeline

Understanding XGBoost: A Powerful Machine Learning Algorithm

Introduction to Feature Engineering for Time Series Forecasting

Understanding the Essentials of Machine Learning: A Deep Dive into Module 3 of Tom M. Mitchell, Machine Learning Book - Linear models for Regression

4 algorithms machine learning engineers should know

REGRESSION TECHNIQUES IN MACHINE LEARNING

Forecasting Using Machine Learning Tools: Techniques, Applications, and Challenges

Shailendra Singh Kathait的更多文章

How did we reduce our data, computing, and storage requirements by roughly 96.6 % from the peak of 3 TB per day? Without dip in performance matrix

Identifying New Mineral Occurrence using Remote Sensing Images

Credit Risk Scorecard Monitoring

Introduction to Reinforcement Learning

Moving towards world powered by Artificial Intelligence & Deep Learning.

Collecting Twitter Stream : Using Python & MongoDB

Building Your First Spark : Logistic Regression Model

What is Topic Modeling ?

Machine Learning helps in building High Performing Agent Sales Force

Machine Learning: Identifying Serviceable Tweets

社区洞察

其他会员也浏览了

How to Handle Imbalanced Datasets in Machine Learning: A Step-by-Step Guide

Model Optimization in Machine Learning: Random vs. Grid?Search

Building a Machine Learning Pipeline

Understanding XGBoost: A Powerful Machine Learning Algorithm

Introduction to Feature Engineering for Time Series Forecasting

Understanding the Essentials of Machine Learning: A Deep Dive into Module 3 of Tom M. Mitchell, Machine Learning Book - Linear models for Regression

4 algorithms machine learning engineers should know

REGRESSION TECHNIQUES IN MACHINE LEARNING

Forecasting Using Machine Learning Tools: Techniques, Applications, and Challenges