登录查看更多内容

k-mean clustering and its real usecase

Mudit Mathur

Tech Blogger & Cloud DevOps Engineer @Medium Passionate about Writing, Automation, and Cloud Technologies

发布日期: 2021年8月16日

What is K-Means Algorithm?

K-Means Clustering is an?Unsupervised Learning algorithm, which groups the unlabeled dataset into different clusters. Here K defines the number of pre-defined clusters that need to be created in the process, as if K=2, there will be two clusters, and for K=3, there will be three clusters, and so on.

It is an iterative algorithm that divides the unlabeled dataset into k different clusters in such a way that each dataset belongs only one group that has similar properties.

It allows us to cluster the data into different groups and a convenient way to discover the categories of groups in the unlabeled dataset on its own without the need for any training.

The algorithm takes the unlabeled dataset as input, divides the dataset into k-number of clusters, and repeats the process until it does not find the best clusters. The value of k should be predetermined in this algorithm.

The k-means?clustering?algorithm mainly performs two tasks:

Determines the best value for K center points or centroids by an iterative process.
Assigns each data point to its closest k-center. Those data points which are near to the particular k-center, create a cluster.

Hence each cluster has datapoints with some commonalities, and it is away from other clusters.

The below diagram explains the working of the K-means Clustering Algorithm:

Sanjay Kumar MBA,MS,PhD 10 个月前

AI Atlas #7: Clustering

Rudina Seseri 1 年前

Unsupervised Learning: Clustering and Dimensionality…

AgileWoW 5 个月前

How does the K-Means Algorithm Work?

The working of the K-Means algorithm is explained in the below steps:

Step-1:?Select the number K to decide the number of clusters.

Step-2:?Select random K points or centroids. (It can be other from the input dataset).

Step-3:?Assign each data point to their closest centroid, which will form the predefined K clusters.

Step-4:?Calculate the variance and place a new centroid of each cluster.

Step-5:?Repeat the third steps, which means reassign each datapoint to the new closest centroid of each cluster.

Step-6:?If any reassignment occurs, then go to step-4 else go to FINISH.

Step-7: The model is ready.

要查看或添加评论，请登录

查看全部

k-mean clustering and its real usecase

Mudit Mathur

Tech Blogger & Cloud DevOps Engineer @Medium Passionate about Writing, Automation, and Cloud Technologies

What is K-Means Algorithm?

领英推荐

How does the K-Means Algorithm Work?

更多精彩文章

社区洞察

其他会员也浏览了

Unsupervised Learning: Clustering and Dimensionality Reduction

Understanding K-Means and K-Nearest Neighbours: Key Differences and Confusing Similarities

Task #2 - Prediction using Unsupervised ML