登录查看更多内容

The need of ensembling

Debi Prasad Rath

@AmazeDataAI- Technical Architect | Machine Learning | Deep Learning | NLP | Gen AI | Azure | AWS | Databricks

发布日期: 2023年11月7日

Hi connections. Trust you are doing well. In this article we will try to deep dive about "ensembling". Let us get started.

Ensemble modelling as the name suggests tries to find the best model there by averaging a collective set of models with out of bag samples. This in turn offers the flexibility to build a robust model by comparing and combining various models with an idea that captures different data behavior and aspects. During the model predictions, errors get reduced by the performance of individual model. In simpler terms emsembling helps us to build a model that generalizes well and thus less prone to overfitting. At the same time each model is trained on different subsets of data (with replacement) so that each model learns all about the data behavior as much as possible.

In general there are three types of ensembling as bagging, boosting and stacking. Bagging is often termed as bootstapped aggregating with an idea that builds model from different subsets of data. Finally it averages all of their predictions to come up with a robust model. The process of bagging runs in parallel to get that final model meaning each individual model runs in parallel. Conversely, boosting creates sequential list of models where each model tries to correct error from previous model by weigting their predictions. It is conceived based on the idea of weak learners to become strong learner. Stacking is a phenomena of building a model that learns from one model to the other and combines predictions, often abbreviated as a meta model.

领英推荐

Day 17 - CatBoost

Ime Eti-mfon 1 个月前

Predicting the World Cup with Machine Learning!

Walter Shields 2 年前

Day 16 - LightGBM (Light Gradient Boosting Machine)

Ime Eti-mfon 1 个月前

Ensembling often considers combining different models performance to seek better predictions from two or more models. Intuitively, ensembling invloves fitting many models(tree based) with different samples of same data to average predictions. It is a non-parametric model meaning small changes in data will not affect. Furthermore, these models are less prone to overfitting due to the fact final model is getting diversified as they capture different patterns in data. It provides that "confidence" in providing reliable estimates of data. Ensembling also tries to find a strong learner sequentially that corrects prediction from weak learners respectively. Ensembling can also stack different models using same data to combine predictions to the best possible way.

Thanks for all your time and support. Happy learning.

要查看或添加评论，请登录

Debi Prasad Rath的更多文章

Explainable AI- XAI overview

2023年11月17日

Explainable AI- XAI overview

Explainable AI also abbreviated as XAI is another toolkit that will validate predictions to provide interpretability…
Linear Regression- gradient descent optimization

2023年11月12日

Linear Regression- gradient descent optimization

Hi connections. Trust you are doing well.
Linear Regression: How to find line of best fit ?

2023年11月11日

Linear Regression: How to find line of best fit ?

Hi connections. Trust you are doing well.
Linear Regression - An overview

2023年11月10日

Linear Regression - An overview

Hi connections. In this article we will be discussing about "linear regression" model algorithm.
Isolation Forest- An overview

2023年11月9日

Isolation Forest- An overview

Hi connections. Trust you are doing well.

1 条评论
Support Vector Machine- Simple analysis

2023年11月8日

Support Vector Machine- Simple analysis

Hi connections. Trust you are doing well.
Construct of Data Connectors using Python for routine ML tasks

2023年3月1日

Construct of Data Connectors using Python for routine ML tasks

Overview: - A data scientist is tasked to build models and predict the future. More or less, this is the task at hand…
Databricks Vs Azure Machine Learning - a comparative study

2023年1月29日

Databricks Vs Azure Machine Learning - a comparative study

azure machine learning vs databricks:-- =============================== CREDIT- Microsoft Documentation note:- ml -…
Machine Learning and Quality Assurance

2021年7月26日

Machine Learning and Quality Assurance

Content: - Framework to perform ML QA Steps needed Skills needed Areas to be tested and techniques involved Approach…
Understanding GitHub Essentials in Machine Learning

2019年12月15日

Understanding GitHub Essentials in Machine Learning

When I started learning data science, I was interacting with aspirants in this field. One significant thing I have…

See all articles

The need of ensembling

Debi Prasad Rath

@AmazeDataAI- Technical Architect | Machine Learning | Deep Learning | NLP | Gen AI | Azure | AWS | Databricks

领英推荐

Debi Prasad Rath的更多文章

社区洞察

其他会员也浏览了

AI_Part_4_What is K-fold Cross Validation?

XGBoost

How do I determine which evaluation metric is most appropriate for my specific machine learning task?

Data Optimizations Techniques in the Machine Learning

Why is it called Support Vector Machine(SVM)?

From Data to Deployment: A Casual Guide to the Machine Learning Process

Support Vector Machines: Harnessing the Power of Margins

Machine Learning

How to Use Machine Learning for Fault Detection in Electrical Systems

The Amazing Adventures of the Machine Learning Crew!

领英推荐

Debi Prasad Rath的更多文章

Explainable AI- XAI overview

Linear Regression- gradient descent optimization

Linear Regression: How to find line of best fit ?

Linear Regression - An overview

Isolation Forest- An overview

Support Vector Machine- Simple analysis

Construct of Data Connectors using Python for routine ML tasks

Databricks Vs Azure Machine Learning - a comparative study

Machine Learning and Quality Assurance

Understanding GitHub Essentials in Machine Learning

社区洞察

其他会员也浏览了

AI_Part_4_What is K-fold Cross Validation?

XGBoost

How do I determine which evaluation metric is most appropriate for my specific machine learning task?

Data Optimizations Techniques in the Machine Learning

Why is it called Support Vector Machine(SVM)?

From Data to Deployment: A Casual Guide to the Machine Learning Process

Support Vector Machines: Harnessing the Power of Margins

Machine Learning

How to Use Machine Learning for Fault Detection in Electrical Systems

The Amazing Adventures of the Machine Learning Crew!