登录查看更多内容

Classification Measures in Machine Learning

RISHABH SINGH

Actively looking for Full-time Opportunities in AI/ML/Robotics | Ex-Algorithms & ML Engineer @ Dynocardia Inc | Computer Vision Research Assistant & Robotics Graduate Student @Northeastern University

发布日期: 2025年2月24日

In classification problems, it’s crucial to have effective measures to evaluate how well our model is performing. Unlike regression, where measures like R2 and mean squared error help assess model performance, classification tasks rely on other metrics such as Accuracy, Precision, Recall, and F1 Score due to the categorical nature of predictions.

When we’re just getting started with machine learning, evaluating our model’s performance might seem confusing, especially with terms like precision, recall, F1 score, and accuracy. But fear not! In this article, we’ll walk through these important concepts step by step, starting with the confusion matrix and going all the way to understanding F1 score and the Dice coefficient.

Confusion Matrix

Before diving into classification measures, we need to understand the confusion matrix. It’s the foundation upon which accuracy, precision, recall, and other metrics are built. Simply put, the confusion matrix is a table that helps visualize how well a classification model is performing.

True Positives (TP): When the model correctly predicts a positive class.
True Negatives (TN): When the model correctly predicts a negative class.
False Positives (FP): When the model incorrectly predicts a positive class (also known as a Type I error).
False Negatives (FN): When the model incorrectly predicts a negative class (also known as a Type II error).

Example: Let’s imagine we’re building a model to predict whether an email is spam (positive class) or not spam (negative class). If the model predicts 50 emails correctly as spam and 40 correctly as not spam, but mistakenly labels 10 non-spam emails as spam and misses 5 actual spam emails, this will look like:

from sklearn import datasets
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.metrics import confusion_matrix

iris = datasets.load_iris()
x_train, x_test, y_train, y_test = train_test_split(iris.data, iris.target, 
                                                    random_state = 1)
clf = LogisticRegression(solver = 'liblinear')
# fit the training data
clf.fit(x_train, y_train)
# Do prediction on training data
y_train_pred = clf.predict(x_train)
# Do prediction on testing data
y_test_pred = clf.predict(x_test)
# find the confusion matrix for train data
confusion_matrix(y_train, y_train_pred)
# find the confusion matrix for test data
confusion_matrix(y_test, y_test_pred)
# get the full classification report for training data
print(classification_report(y_train, y_train_pred))
# get the full classification report for testing data
print(classification_report(y_test, y_test_pred))

With the confusion matrix in hand, we can now calculate key metrics like accuracy, precision, recall, and F1 score.

Accuracy

Accuracy tells us the percentage of predictions the model got right. It is calculated as:

from sklearn.metrics import accuracy_score
y_pred = [0, 2, 1, 3]
y_true = [0, 1, 2, 3]
print("Score :", accuracy_score(y_true, y_pred))

Accuracy is simple and intuitive, but it doesn’t always tell the full story, especially with imbalanced datasets (when one class significantly outweighs the other). That’s why we also look at precision and recall.

Precision

Precision focuses on how many of the predicted positive cases were actually correct. It’s important when false positives are costly (like predicting fraud when there’s none).

领英推荐

How can we prevent bias in machine learning models?

Machine Learning 2 年前

Feature selection Methods in Machine Learning

Sanjay Kumar MBA,MS,PhD 1 年前

Machine Learning in Causal Inference: Limitations and…

Paritosh Kumar 3 个月前

y_true = [0, 1, 2, 0, 1, 2]
y_pred = [0, 2, 1, 0, 0, 1]
from sklearn.metrics import precision_score
precision_score(y_true, y_pred)

Recall

Recall (or sensitivity) tells us how many actual positive cases were correctly identified. It’s critical when missing a positive case (false negative) is costly, such as in medical diagnoses.

F1 Score

The F1 score balances both precision and recall. It’s particularly useful when we need to account for both false positives and false negatives.

Classification Report

The classification report is a summary that includes precision, recall, F1 score, and support (the number of occurrences of each class). In Python’s sklearn, we can generate this easily:

from sklearn.metrics import classification_report
print(classification_report(y_true, y_pred))

This gives us a complete view of how our model performs on each class.

Bringing It All?Together

To summarize, classification metrics help us understand how well our model is performing beyond just accuracy.

Accuracy: Best for balanced datasets.
Precision: Important when false positives are costly.
Recall: Crucial when false negatives are costly.
F1 Score: Useful when we need a balance between precision and recall.

Robert Graham

AI Engineering Leader | Expert in Pinecone, ChromaDB & RAG | Driving AI-Driven Innovation

4 天前

Understanding and applying classification measures like precision, recall, and F1 score is crucial for evaluating machine learning models, especially in imbalanced datasets. These metrics help ensure that models are not only accurate but also effective in real-world applications, where the cost of false positives or false negatives can be significant.

1 次回应

Anjali Kushwah

Software Engineer 2 at Microsoft || Azure Storage and C++||

3 周

Hoping for best????

查看更多评论

要查看或添加评论，请登录

RISHABH SINGH的更多文章

Regularization in Machine Learning

2024年11月17日

Regularization in Machine Learning

Regularization is a technique used in machine learning to prevent overfitting, which occurs when a model learns the…
Logistic Regression

2024年11月4日

Logistic Regression

Logistic Regression is one of the most fundamental algorithms in Machine Learning and is primarily used for…
Why Logistic Regression Beats Linear Regression for Classification

2024年10月29日

Why Logistic Regression Beats Linear Regression for Classification

In machine learning, there are two main types of tasks: regression and classification. Linear Regression is designed…

2 条评论
Introduction to Machine Learning

2024年10月26日

Introduction to Machine Learning

Machine Learning (ML) is a branch of artificial intelligence (AI) that allows computers to learn and make predictions…
Statistics for Machine Learning

2024年10月24日

Statistics for Machine Learning

Statistics is described as a collection of tools and methods used to derive meaningful insights by performing…

4 条评论
Sliding Window Technique Simplified (C++)

2024年10月4日

Sliding Window Technique Simplified (C++)

The Sliding Window Technique is a powerful method to solve problems involving arrays or strings. It optimizes problems…
Natural Language Processing: Linear Text Classification

2024年9月29日

Natural Language Processing: Linear Text Classification

Linear classification refers to using a straight line (or hyperplane in higher dimensions) to separate different…
Mastering Arrays & Pointers (C++): Learning Basics to Solving Top Interview Questions (Part-2)

2024年9月25日

Mastering Arrays & Pointers (C++): Learning Basics to Solving Top Interview Questions (Part-2)

Ques.1 Remove Even Integers from Array Given an array of integers, arr, remove all the even integers from the array.

1 条评论
Introduction to Natural Language Processing: Byte Pair Encoding (BPE) and Natural Language Toolkit (NLTK)

2024年9月23日

Introduction to Natural Language Processing: Byte Pair Encoding (BPE) and Natural Language Toolkit (NLTK)

NLP is a subfield of artificial intelligence (AI) and computational linguistics. It focuses on enabling computers to…

2 条评论
Mastering HashSet in C++: Unraveling the Power of unordered_set

2024年9月21日

Mastering HashSet in C++: Unraveling the Power of unordered_set

In C++, the term “HashSet” is often confused with , but they are essentially the same thing. C++ does not have a direct…

See all articles

Classification Measures in Machine Learning

RISHABH SINGH

Actively looking for Full-time Opportunities in AI/ML/Robotics | Ex-Algorithms & ML Engineer @ Dynocardia Inc | Computer Vision Research Assistant & Robotics Graduate Student @Northeastern University

Confusion Matrix

Accuracy

Precision

领英推荐

Recall

F1 Score

Classification Report

Bringing It All?Together

RISHABH SINGH的更多文章

社区洞察

其他会员也浏览了

Handling Imbalanced Datasets in Machine Learning

The Emotional Journey of Machine Learning: How Models Find Their Balance

How to Detect Multivariate Covariate Shift in Machine Learning Models?

How to Detect Multivariate Covariate Shift in Machine Learning Models?

Understanding Model Drift in Machine Learning

Demystifying Machine Learning: A Guided Tour of the Top 10 Algorithms

AI_Part_3_Regression vs Classification Models

Machine Learning (Classification models)

Decision Tree

When Machine Learning Models Lose Their Way: Grasping Data Drift and Concept Drift

Confusion Matrix

Accuracy

Precision

领英推荐

Recall

F1 Score

Classification Report

Bringing It All?Together

RISHABH SINGH的更多文章

Regularization in Machine Learning

Logistic Regression

Why Logistic Regression Beats Linear Regression for Classification

Introduction to Machine Learning

Statistics for Machine Learning

Sliding Window Technique Simplified (C++)

Natural Language Processing: Linear Text Classification

Mastering Arrays & Pointers (C++): Learning Basics to Solving Top Interview Questions (Part-2)

Introduction to Natural Language Processing: Byte Pair Encoding (BPE) and Natural Language Toolkit (NLTK)

Mastering HashSet in C++: Unraveling the Power of unordered_set

社区洞察

其他会员也浏览了

Handling Imbalanced Datasets in Machine Learning

The Emotional Journey of Machine Learning: How Models Find Their Balance

How to Detect Multivariate Covariate Shift in Machine Learning Models?

How to Detect Multivariate Covariate Shift in Machine Learning Models?

Understanding Model Drift in Machine Learning

Demystifying Machine Learning: A Guided Tour of the Top 10 Algorithms

AI_Part_3_Regression vs Classification Models

Machine Learning (Classification models)

Decision Tree

When Machine Learning Models Lose Their Way: Grasping Data Drift and Concept Drift