登录查看更多内容

Practical Guide to Clustering Algorithms & Evaluation in R

Manish Saraswat

Senior Machine Learning Engineer

发布日期: 2017年1月19日

Introduction

Clustering algorithms are a part of unsupervised machine learning algorithms. Why unsupervised ? Because, the target variable is not present. The model is trained based on given input variables which attempts to discover intrinsic groups (or clusters).

Since target variable is not present we can't label those groups. Then, how is it done? That's the interesting part we'll look at in this article!

Clustering algorithms are widely used across all industries such as retail, banking, manufacturing, healthcare etc. In business terms, companies use it to separate customers sharing similar characteristics than others, in order to make customised engagement campaign strategies.

For example: In healthcare, a hospital might cluster patients based on their tumor size so that, patients with different tumor sizes can be treated differently.

Types of Clustering Techniques
Distance Calculation for Clustering
K means Clustering | How does it work?
How to select best value of k in k means?
Hierarchical Clustering | How does it work?
What are the evaluation methods used in cluster analysis?
Clustering in R - Water Treatment Plans

Complete Article - Read Here

Did this tutorial helped you learn clustering better ? Drop in your suggestions, questions in the comments below.

要查看或添加评论，请登录

Manish Saraswat的更多文章

Practial Guide on Text Mining and Feature Engineering in R

2017年4月10日

Practial Guide on Text Mining and Feature Engineering in R

The ability to deal with text data is one of the important skills a data scientist must posses. With advent of social…
Start with Deep Learning & Parameter Tuning with MXnet, H2o Package in R

2017年1月31日

Start with Deep Learning & Parameter Tuning with MXnet, H2o Package in R

Introduction Deep Learning isn't a recent discovery. The seeds were sown back in the 1950s when the first artificial…

2 条评论
How can R Users Learn Python for Data Science ?

2017年1月13日

How can R Users Learn Python for Data Science ?

Introduction This article is meant to help R users to enhance their set of skills and learn Python for data science…

9 条评论
Practical Guide to Logistic Regression Analysis in R

2017年1月5日

Practical Guide to Logistic Regression Analysis in R

Introduction Recruiters in analytics/data science industry expect you to know atleast two algorithms: Linear Regression…
SQL Tutorial on Data Analysis in R

2016年12月28日

SQL Tutorial on Data Analysis in R

Introduction Many people are pursuing data science as a career (to become a data scientist) choice these days. With the…
XGBoost Tutorial in R (from Scratch)

2016年12月20日

XGBoost Tutorial in R (from Scratch)

Introduction Lately, I've come to know that a lot of newbies in R are keen to use xgboost package at best. And, why…

2 条评论
Tutorial on Random Forest and Parameter Tuning in R

2016年12月14日

Tutorial on Random Forest and Parameter Tuning in R

Introduction Random Forest is one of the most versatile machine learning algorithms available today. With its built-in…

1 条评论
Beginners Guide to Regression Analysis and Plot Interpretations

2016年12月6日

Beginners Guide to Regression Analysis and Plot Interpretations

"The Road to Machine Learning starts with Regression. Are you ready?" If you are aspiring to become a data scientist…
Machine Learning Project on Imbalanced Data set in R

2016年9月21日

Machine Learning Project on Imbalanced Data set in R

Lot of us get rejected during data science / machine learning interviews. Do you know why? Because, their resumes never…
Questions on Machine Learning & Statistics - Can you answer?

2016年9月16日

Questions on Machine Learning & Statistics - Can you answer?

With this article, I've tried to summarize the extensive machine learning subject, into 40 tricky & thoughtful…

7 条评论

See all articles

Practical Guide to Clustering Algorithms & Evaluation in R

Manish Saraswat

Senior Machine Learning Engineer

Table of Contents

Manish Saraswat的更多文章

社区洞察

其他会员也浏览了

ROC curve and Area Under ROC Curve in Machine Learning | Infogen Labs

Gradient Descent Algorithm in Machine Learning

???preprint - From prediction to prescription: Machine learning and Causal Inference

Understanding the Confusion Matrix

FEATURE SELECTION IN ML.

Demystifying Log Loss: A Key Metric for Probabilistic Model Evaluation

Correlation with Bayes

Which machine learning technique is best suited for a classification problem where the output is discrete categories?

The case for De-normalisation in Machine learning

Understanding Regularization Techniques in Machine Learning: L1, L2, Dropout, Data Augmentation, and Early Stopping

Table of Contents

Manish Saraswat的更多文章

Practial Guide on Text Mining and Feature Engineering in R

Start with Deep Learning & Parameter Tuning with MXnet, H2o Package in R

How can R Users Learn Python for Data Science ?

Practical Guide to Logistic Regression Analysis in R

SQL Tutorial on Data Analysis in R

XGBoost Tutorial in R (from Scratch)

Tutorial on Random Forest and Parameter Tuning in R

Beginners Guide to Regression Analysis and Plot Interpretations

Machine Learning Project on Imbalanced Data set in R

Questions on Machine Learning & Statistics - Can you answer?

社区洞察

其他会员也浏览了

ROC curve and Area Under ROC Curve in Machine Learning | Infogen Labs

Gradient Descent Algorithm in Machine Learning

???preprint - From prediction to prescription: Machine learning and Causal Inference

Understanding the Confusion Matrix

FEATURE SELECTION IN ML.

Demystifying Log Loss: A Key Metric for Probabilistic Model Evaluation

Correlation with Bayes

Which machine learning technique is best suited for a classification problem where the output is discrete categories?

The case for De-normalisation in Machine learning

Understanding Regularization Techniques in Machine Learning: L1, L2, Dropout, Data Augmentation, and Early Stopping