登录查看更多内容

What evaluation approaches would you work to deal with the effectiveness of a machine learning model

Baishalini Sahu

发布日期: 2021年7月22日

why need of evaluate machine learning model ?

Machine learning continues to be an increasingly integral component of our lives, whether we’re applying the techniques to research or business problems. Machine learning models ought to be able to give accurate predictions in order to create real value for a given organization.

Methods for evaluating a model’s performance are divided into 2 categories: namely,?holdout?and?Cross-validation.This is because our model will simply remember the whole training set, and will therefore always predict the correct label for any point in the training set. This is known as?overfitting.

Holdout

The purpose?of holdout evaluation is to test a model on different data than it was trained on. This provides an unbiased estimate of learning performance.

In this method, the dataset is?randomly?divided into three subsets:

Training set?is a subset of the dataset used to build predictive models.
Validation set?is a subset of the dataset used to assess the performance of the model built in the training phase. It provides a test platform for fine-tuning a model’s parameters and selecting the best performing model. Not all modeling algorithms need a validation set.
Test set, or unseen data, is a subset of the dataset used to assess the likely future performance of a model. If a model fits to the training set much better than it fits the test set, overfitting is probably the cause.

Cross-Validation

Cross-validation?is a technique that involves partitioning the original observation dataset into a training set, used to train the model, and an independent set used to evaluate the analysis.Types of croos validations :

Leave p-out?cross-validation: …
Leave-one-out?cross-validation: …
Holdout?cross-validation: …
k-fold?cross-validation: …
Repeated random subsampling?validation: …
Stratified k-fold?cross-validation: …
Time Series?cross-validation:

Model Evaluation Metrics

Model evaluation metrics are required to quantify model performance. The choice of evaluation metrics depends on a given machine learning task (such as classification, regression, ranking, clustering, topic modeling, among others).

Classification Metrics

In this section we will review some of the metrics used in classification problems, namely:

Classification Accuracy
Confusion matrix
Logarithmic Loss
Area under curve (AUC)
F-Measure

Classification Accuracy

Classification predictive modeling involves predicting a class label given examples in a problem domain.

Accuracy and its complement error rate are the most frequently used metrics for estimating the performance of learning systems in classification problems.

Classification accuracy?involves first using a classification model to make a prediction for each example in a test dataset. The predictions are then compared to the known labels for those examples in the test set. Accuracy is then calculated as the proportion of examples in the test set that were predicted correctly, divided by all predictions that were made on the test set.

Accuracy = Correct Predictions / Total Predictions

Conversely, the error rate can be calculated as the total number of incorrect predictions made on the test set divided by all predictions made on the test set.

Error Rate = Incorrect Predictions / Total Predictions

The accuracy and error rate are complements of each other, meaning that we can always calculate one from the other. For example:

Accuracy = 1 — Error Rate
Error Rate = 1 — Accuracy

Aishwarya Srinivasan 1 年前

Dimensionality Reduction in Machine Learning explained

Data & Analytics 1 年前

Understanding Bagging in Machine Learning: Combat…

Data & Analytics 11 个月前

Accuracy Fails for Imbalanced Classification

When the class distribution is slightly skewed, accuracy can still be a useful metric. When the skew in the class distributions are severe, accuracy can become an unreliable measure of model performance.

Confusion matrix

When performing classification predictions, there’s four types of outcomes that could occur.

True positives?are when you predict an observation belongs to a class and it actually does belong to that class.
True negatives?are when you predict an observation does not belong to a class and it actually does not belong to that class.
False positives?occur when you predict an observation belongs to a class when in reality it does not.
False negatives?occur when you predict an observation does not belong to a class when in fact it does.

Logarithmic Loss

Logarithmic loss (logloss) measures the performance of a classification model where the prediction input is a probability value between 0 and 1. Log loss increases as the predicted probability diverges from the actual label. The goal of machine learning models is to minimize this value. As such, smaller logloss is better, with a perfect model having a log loss of 0.

Area under Curve (AUC)

Area under ROC Curve is a performance metric for measuring the ability of a?binary classifier?to discriminate between positive and negative classes.

In the example above, the AUC is relatively close to 1 and greater than 0.5. A perfect classifier will have the ROC curve go along the Y axis and then along the X axis.

F-Measure

F-measure (also F-score) is a measure of a test’s accuracy that considers both the?precision?and the?recall?of the test to compute the score. Precision is the number of correct positive results divided by the total predicted positive observations. Recall, on the other hand, is the number of correct positive results divided by the number of all relevant samples (total actual positives).

Regression Metrics

In this section we review 2 of the most common metrics for evaluating regression problems namely, Root Mean Squared Error and Mean Absolute Error.

The Mean Absolute Error (or MAE) is the sum of the absolute differences between predictions and actual values. On the other hand, Root Mean Squared Error (RMSE) measures the average magnitude of the error by taking the square root of the average of squared differences between prediction and actual observation.

Conclusion

Ideally, the estimated performance of a model tells us how well it performs on unseen/new data. Making predictions on future new data is often the main problem we want to solve. It’s important to understand the context before choosing a metric because each machine learning model tries to solve a problem with a different objective using a different dataset.

“ I’m Baishalini Sahu working as a data scientist specializing in Artificial intelligence and machine learning, message behind this article has attempted to explain the common evaluation metrics for classification and regression machine learning problems, providing short Python snippets to show how they can be implemented and what are the mathmatical formulas used behind it’’

Lakshmi - Hiring Oracle BRM

Hiring Oracle BRM developers, preferably from Bangalore connect me on 9845684794 or [email protected]

3 年

way to learn and unlearn :) ??

1 次回应

Ajay M.

Telecommunications professional with experience in IP transport, RAN transport, and project management domains. Azure cloud certified with automation & advanced MS-Excel skills.

3 年

Nice one.

1 次回应

Milan McGraw, Deep Learning Engineering

AI/ML Innovation @ AWS ◆ Machine Learning Engineer ◆ AI Consultant

3 年

Great reading, Baishalini Sahu, keep it coming!!!

2 次回应

查看更多评论

要查看或添加评论，请登录

Baishalini Sahu的更多文章

Project Title: Enhancing Cybersecurity with Artificial Intelligence

2024年10月29日

Project Title: Enhancing Cybersecurity with Artificial Intelligence

Introduction Objective: Explore how AI can improve cybersecurity measures, detect threats, and respond to incidents…

2 条评论
Data Science Road Map 2024

2024年1月4日

Data Science Road Map 2024

Becoming a successful data scientist requires a combination of education, skills development, practical experience, and…
How AWS EC2 needful for AI ML and high performance computing applications with powerful GPUs

2022年10月10日

How AWS EC2 needful for AI ML and high performance computing applications with powerful GPUs

In this post will discuss, How to get deeper insights from your data while lowering costs with AWS machine learning…

2 条评论
How Open AI Changing the world ?

2022年10月5日

How Open AI Changing the world ?

OpenAI is an artificial intelligence (AI) research laboratory consisting of the for-profit corporation OpenAI LP and…
Reinforcement Learning - Applications & Area of focus

2022年10月4日

Reinforcement Learning - Applications & Area of focus

1. Robotics – In robotics, the ultimate goal of reinforcement learning is to endow robots with the ability to learn…

1 条评论
AI Reinforcement Learning Overview

2022年9月25日

AI Reinforcement Learning Overview

What is Reinforcement Learning? Reinforcement Learning is defined as a Machine Learning method that is concerned with…

1 条评论
Advance Predictive Analytics usecase

2021年12月24日

Advance Predictive Analytics usecase

Project Title - Dementia Analysis And Prediction Technologies - Artificial Intelligence With Deep Learning Domain –…
Artificial Intelligence powered for Retail in 2021: Real-World Use Cases

2021年8月25日

Artificial Intelligence powered for Retail in 2021: Real-World Use Cases

Introduction For decades, traditional analytics have worked perfectly fine for the data-driven retail industry…
Databricks with Machine Learning flow all in one solution #2021

2021年6月17日

Databricks with Machine Learning flow all in one solution #2021

Over view Databricks Machine Learning is an integrated end-to-end machine learning environment for experiment tracking,…

5 条评论
Top 10 Automated Machine Learning(Auto ML) tools used in 2020-2021

2021年6月6日

Top 10 Automated Machine Learning(Auto ML) tools used in 2020-2021

How AutoML developed and its work flow? AutoML (automated machine learning) refers to the automated end-to-end process…

8 条评论

See all articles

What evaluation approaches would you work to deal with the effectiveness of a machine learning model

Baishalini Sahu

why need of evaluate machine learning model ?

Holdout

Cross-Validation

Model Evaluation Metrics

Classification Metrics

领英推荐

Confusion matrix

Logarithmic Loss

Area under Curve (AUC)

F-Measure

Regression Metrics

Conclusion

Baishalini Sahu的更多文章

社区洞察

其他会员也浏览了

Feature Selection vs. Feature Extraction: Navigating Dimensionality Reduction in Machine Learning

Regularization in Machine Learning

Performance Matrix in Machine Learning

Performance Matrix in Machine Learning

DIMENSIONALITY REDUCTION

Model Optimization in Machine Learning: Random vs. Grid?Search

Hyperparameter optimization in Machine Learning Part-1: Algorithms

Evaluation Metrics in Machine Learning: How to Measure Model Performance

Exploring the Importance and Techniques of Hyperparameter Tuning in Machine Learning

why need of evaluate machine learning model ?

Holdout

Cross-Validation

Model Evaluation Metrics

Classification Metrics

领英推荐

Confusion matrix

Logarithmic Loss

Area under Curve (AUC)

F-Measure

Regression Metrics

Conclusion

Baishalini Sahu的更多文章

Project Title: Enhancing Cybersecurity with Artificial Intelligence

Data Science Road Map 2024

How AWS EC2 needful for AI ML and high performance computing applications with powerful GPUs

How Open AI Changing the world ?

Reinforcement Learning - Applications & Area of focus

AI Reinforcement Learning Overview

Advance Predictive Analytics usecase

Artificial Intelligence powered for Retail in 2021: Real-World Use Cases

Databricks with Machine Learning flow all in one solution #2021

Top 10 Automated Machine Learning(Auto ML) tools used in 2020-2021

社区洞察

其他会员也浏览了

Feature Selection vs. Feature Extraction: Navigating Dimensionality Reduction in Machine Learning

Regularization in Machine Learning

Performance Matrix in Machine Learning

Performance Matrix in Machine Learning

DIMENSIONALITY REDUCTION

Model Optimization in Machine Learning: Random vs. Grid?Search

Hyperparameter optimization in Machine Learning Part-1: Algorithms

Evaluation Metrics in Machine Learning: How to Measure Model Performance

Exploring the Importance and Techniques of Hyperparameter Tuning in Machine Learning