登录查看更多内容

Winning Ensemble Classification Strategies

Niraj Kumar, Ph.D.

AI/ML R&D Leader | Driving Innovation in Generative AI, LLMs & Explainable AI | Strategic Visionary & Patent Innovator | Bridging AI Research with Business Impact

发布日期: 2020年6月6日

These days (1) due to the increase in the complexity of data, (2) data quality-related issues, and (2) the demand for accuracy and robustness in classification has created a lot of challenges for data science practitioners. This resulted in a shift towards ensemble classifiers. However, the selection of ensemble classifiers is not that easy. We have a lot of ensemble strategies, like (1) Model Averaging, (2) Weighted Model Averaging, (3) Majority Voting, (4) Bagging, (5) Boosting, (6) Stacking, (7) Blending, and so many others.

Ensemble strategies, like (1) Model Averaging, (2) Weighted Model Averaging, and (3) Majority Voting are basic ensemble techniques. We can use these basic ensemble strategies independently or as a key component of different ensemble strategies. Based on the use and high success rates, I have identified three ensemble strategies, which have emerged as the winner in Machine-learning-based ensemble classification. They are (1) Stacking, (2) Bagging, and (3) Boosting. In this article, I tried to capture these three techniques with the easiest possible explanations/tutorials.

Stacking (Stacked Generalization)

The term stacking is common in deep learning. But, in Machine learning, a two-layered format of stacking is famous. At the first layer, we use multiple classifiers to learn the different aspects of the data. For this, we select classifiers in such a way that, they result in maximum non-correlations among results and errors. Finally, on the second layer, we use a meta learner to learn from the prediction results of the first level classifiers. In the entire process, we do not take the entire data in learning, instead, we use the K-Fold cross-validation type strategy and use the "K-1" folds to train the system and Kth fold to generate the prediction results by first layer classifiers. We generally use - (1) Decision tree, (2) SVM, (3) Neural networks, (4) Random Forest, (5) Logistic regression, (6) Gradient Boosting, (7) XGBoosting, and (8) Bayesian classifier, etc. Finally, at the second level, we use "Meta-Learners". These meta learners are actually a lightweight classifier, like: Logistic regression. However, some research articles suggest the use of heavyweight classifiers also. A good strategy of selection of first layer classifier generally gives a better result (accuracy), compared to any of the individual classifier's results.

Bagging

Bootstrap aggregating, also called bagging (from bootstrap aggregating), is a machine learning ensemble meta-algorithm designed to improve the stability and accuracy of machine learning algorithms used in statistical classification and regression. It also reduces variance and helps to avoid overfitting. [source]. For example: Random Forest.

Boosting

A boosting algorithm combines multiple simple models (also known as weak learners or base estimators) to generate the final output. The way they prepare trees/weak learners and the way they combine all these weak learners creates differences in different varieties of boosting algorithms. The different variants of (a) Gradient boosting and (b) XGBoosting are highly famous in this area.

要查看或添加评论，请登录

Niraj Kumar, Ph.D.的更多文章

Internal Covariate Shift and Batch Normalization

2023年3月25日

Internal Covariate Shift and Batch Normalization

Internal Covariate Shift Internal covariate shift [1,2,3] refers to the phenomenon where the distribution of inputs to…
Forced/Guided Learning in Deep Learning

2023年3月11日

Forced/Guided Learning in Deep Learning

The forced/guided type deep learning techniques have proven their ability in any model that outputs in sequences. For…
Deep Clustering (A Self-Supervised Learning System)

2023年2月18日

Deep Clustering (A Self-Supervised Learning System)

If you are interested in any of the following, How do I develop a deep learning model, that can learn to do clustering?…
Time to Welcome - “The Quantum Deep Learning”

2023年1月21日

Time to Welcome - “The Quantum Deep Learning”

The Quantum World is Approaching Us The MIT xPRO - Quantum Computer Ai, highlighted the status of quantum AI by using…
Deep Learning for Dynamic Graph

2022年4月30日

Deep Learning for Dynamic Graph

Introduction. It is well understood that adding the time dimension to each and every component of the graph helps us in…
Simplest Tutorials on BERT and XLNet

2020年1月25日

Simplest Tutorials on BERT and XLNet

XLNet XLNet: is a generalized autoregressive pre-training method that (1) enables learning bidirectional contexts by…
Video Book on Deep Learning

2019年12月13日

Video Book on Deep Learning

I am happy to present a video book on deep learning. Thanks for all the email messages and suggestions.

3 条评论
Deep Learning for NLP Part-2

2019年10月12日

Deep Learning for NLP Part-2

Sequence transduction plays a very important role in natural language processing. The ability to transform and…
Loss Functions: Cross-Entropy, Categorical Cross-Entropy Loss, Binary Cross-Entropy Loss

2019年1月22日

Loss Functions: Cross-Entropy, Categorical Cross-Entropy Loss, Binary Cross-Entropy Loss

The following contains tutorial videos on (1) Cross-Entropy, (2) Categorical Cross-Entropy Loss, and (3) Binary…
Probabilistic graphical models for Deep Learning Part-1 (Restricted Boltzmann Machines)

2018年7月21日

Probabilistic graphical models for Deep Learning Part-1 (Restricted Boltzmann Machines)

RBM: Restricted Boltzmann machines are undirected graphical models that can also be interpreted as two-layered…

1 条评论

See all articles

Winning Ensemble Classification Strategies

Niraj Kumar, Ph.D.

AI/ML R&D Leader | Driving Innovation in Generative AI, LLMs & Explainable AI | Strategic Visionary & Patent Innovator | Bridging AI Research with Business Impact

Stacking (Stacked Generalization)

Bagging

Boosting

Niraj Kumar, Ph.D.的更多文章

社区洞察

其他会员也浏览了

Artificial Intelligence #105

Heatmaps: FiftyOne Computer Vision Tips and Tricks – Oct 6, 2023

Machine Learning An Old Wine (OR) in a new Bottle

Unleashing the Power of Data: How Data Engineers Can Harness AI/ML to Achieve Essential Data Quality

Artificial Intelligence #64: Statistical inference: A good way to understand the mathematical foundations of machine learning

XAI: Tabular Data with LIME

Causality and Inference for Machine Learning

The Five Schools of Thought in AI/ML.

Artificial Intelligence = Statistics without Bounds

Calculus for Artificial Intelligence

Stacking (Stacked Generalization)

Bagging

Boosting

Niraj Kumar, Ph.D.的更多文章

Internal Covariate Shift and Batch Normalization

Forced/Guided Learning in Deep Learning

Deep Clustering (A Self-Supervised Learning System)

Time to Welcome - “The Quantum Deep Learning”

Deep Learning for Dynamic Graph

Simplest Tutorials on BERT and XLNet

Video Book on Deep Learning

Deep Learning for NLP Part-2

Loss Functions: Cross-Entropy, Categorical Cross-Entropy Loss, Binary Cross-Entropy Loss

Probabilistic graphical models for Deep Learning Part-1 (Restricted Boltzmann Machines)

社区洞察

其他会员也浏览了

Artificial Intelligence #105

Heatmaps: FiftyOne Computer Vision Tips and Tricks – Oct 6, 2023

Machine Learning An Old Wine (OR) in a new Bottle

Unleashing the Power of Data: How Data Engineers Can Harness AI/ML to Achieve Essential Data Quality

Artificial Intelligence #64: Statistical inference: A good way to understand the mathematical foundations of machine learning

XAI: Tabular Data with LIME

Causality and Inference for Machine Learning

The Five Schools of Thought in AI/ML.

Artificial Intelligence = Statistics without Bounds

Calculus for Artificial Intelligence