ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Random Forest Algorithm, An Interactive Discussion

Niraj Kumar, Ph.D.

AI/ML R&D Leader | Driving Innovation in Generative AI, LLMs & Explainable AI | Strategic Visionary & Patent Innovator | Bridging AI Research with Business Impact

å‘å¸ƒæ—¥æœŸ: 2016å¹´6æœˆ17æ—¥

The random Forest algorithm has gained a significant interest in the recent past, due to its quality performance in several areas. A lot of new research work/survey reports related to different areas also reflects this. So, I decided to present a highly interactive tutorial on Random forest.

Random Forest in Research after-2012 to till date (a few references as per my interest)

1. Computer vision

On the study of Performance of Random Forest and SVM in Face Recognition, [1] reported that - the SVM achieved accuracy of 93.20%, but when optimized with different classifiers and kernel accuracy among all was 95.89%, 96.92%,
97.94%. Random Forest achieved accuracy of 97.17%. Similarly, [2] demonstrated that Random Forest regression can be used to generate
high quality response images.

2. Text Mining (including IR, NLP)

[3] describes a machine learning approach, a Random Forest (RF) classifier, to automatically compile bilingual dictionaries of technical terms from comparable corpora. [4] used random forest classifier to achieve 0.79 ROC-AUC at 0.76 precision and 0.76 recall in the detection of clickbait, i.e., short messages that lure readers to click a link.

3. Other Areas

according to the survey [5], With the data explosion in modern biology, and the rise in the data complexity in bioinformatics, as a non-parametric model, random forest provides a unique combination of prediction accuracy and model
interpretability. [6], noted the robustness of Random Forest-based gene selection methods.

Reference:

Kremic, E., & Subasi, A. (2015). Performance of Random Forest and SVM in Face Recognition. The International Arab Journal of Information Technology.
Cootes, Tim F., et al. "Robust and accurate shape model fitting using random forest regression voting." Computer Visionâ€“ECCV 2012. Springer Berlin Heidelberg, 2012. 278-291.
Kontonatsios, G., Korkontzelos, I., Jun'ichi Tsujii, & Ananiadou, S. (2014, April). Using a Random Forest Classifier to Compile Bilingual Dictionaries of Technical Terms from Comparable Corpora. In EACL (pp. 111-116).
Potthast, M., K?psel, S., Stein, B., & Hagen, M. (2016, March). Clickbait Detection. In European Conference on Information Retrieval (pp. 810-817). Springer International Publishing.
Qi, Y. (2012). Random forest for bioinformatics. In Ensemble machine learning (pp. 307-323). Springer US.
Kursa, Miron Bartosz. "Robustness of Random Forest-based gene selection methods." BMC bioinformatics 15, no. 1 (2014): 1.

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Niraj Kumar, Ph.D.çš„æ›´å¤šæ–‡ç«

Internal Covariate Shift and Batch Normalization

2023å¹´3æœˆ25æ—¥

Internal Covariate Shift and Batch Normalization

Internal Covariate Shift Internal covariate shift [1,2,3] refers to the phenomenon where the distribution of inputs toâ€¦
Forced/Guided Learning in Deep Learning

2023å¹´3æœˆ11æ—¥

Forced/Guided Learning in Deep Learning

The forced/guided type deep learning techniques have proven their ability in any model that outputs in sequences. Forâ€¦
Deep Clustering (A Self-Supervised Learning System)

2023å¹´2æœˆ18æ—¥

Deep Clustering (A Self-Supervised Learning System)

If you are interested in any of the following, How do I develop a deep learning model, that can learn to do clustering?â€¦
Time to Welcome - â€œThe Quantum Deep Learningâ€

2023å¹´1æœˆ21æ—¥

Time to Welcome - â€œThe Quantum Deep Learningâ€

The Quantum World is Approaching Us The MIT xPRO - Quantum Computer Ai, highlighted the status of quantum AI by usingâ€¦
Deep Learning for Dynamic Graph

2022å¹´4æœˆ30æ—¥

Deep Learning for Dynamic Graph

Introduction. It is well understood that adding the time dimension to each and every component of the graph helps us inâ€¦
Winning Ensemble Classification Strategies

2020å¹´6æœˆ6æ—¥

Winning Ensemble Classification Strategies

These days (1) due to the increase in the complexity of data, (2) data quality-related issues, and (2) the demand forâ€¦
Simplest Tutorials on BERT and XLNet

2020å¹´1æœˆ25æ—¥

Simplest Tutorials on BERT and XLNet

XLNet XLNet: is a generalized autoregressive pre-training method that (1) enables learning bidirectional contexts byâ€¦
Video Book on Deep Learning

2019å¹´12æœˆ13æ—¥

Video Book on Deep Learning

I am happy to present a video book on deep learning. Thanks for all the email messages and suggestions.

3 æ¡è¯„è®º
Deep Learning for NLP Part-2

2019å¹´10æœˆ12æ—¥

Deep Learning for NLP Part-2

Sequence transduction plays a very important role in natural language processing. The ability to transform andâ€¦
Loss Functions: Cross-Entropy, Categorical Cross-Entropy Loss, Binary Cross-Entropy Loss

2019å¹´1æœˆ22æ—¥

Loss Functions: Cross-Entropy, Categorical Cross-Entropy Loss, Binary Cross-Entropy Loss

The following contains tutorial videos on (1) Cross-Entropy, (2) Categorical Cross-Entropy Loss, and (3) Binaryâ€¦

See all articles

Random Forest Algorithm, An Interactive Discussion

Niraj Kumar, Ph.D.

AI/ML R&D Leader | Driving Innovation in Generative AI, LLMs & Explainable AI | Strategic Visionary & Patent Innovator | Bridging AI Research with Business Impact

Random Forest in Research after-2012 to till date (a few references as per my interest)

1. Computer vision

2. Text Mining (including IR, NLP)

3. Other Areas

Reference:

Niraj Kumar, Ph.D.çš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Transformers Unleashed: A Comprehensive Guide to Applying Transformers Across Data Types

New Book on Synthetic Data: Version 3.0 Just Released

Unsupervised Learning as Signals for Pairs Trading and StatArb

Applied Machine Learning: CNNs for Image Recognition

Roadmap of skills required to create AI Agent

From Research To Reality: Deep Learning Methods on Time Series Forecasting on Financial Data

5th grade data science (NLP: Computers & Text)

EP 1: Paper 1: A Neural Probabilistic Language Model

Class 19 - REGRESSION Notes from the AI Advance course by Irfan Malik & Dr Sheraz Naseer (Xeven Solutions)

Choosing the Right Time Series Model: A Blend of Data Science, Statistics, and Financial Understanding.

Random Forest in Research after-2012 to till date (a few references as per my interest)

1. Computer vision

2. Text Mining (including IR, NLP)

3. Other Areas

Reference:

Niraj Kumar, Ph.D.çš„æ›´å¤šæ–‡ç«

Internal Covariate Shift and Batch Normalization

Forced/Guided Learning in Deep Learning

Deep Clustering (A Self-Supervised Learning System)

Time to Welcome - â€œThe Quantum Deep Learningâ€

Deep Learning for Dynamic Graph

Winning Ensemble Classification Strategies

Simplest Tutorials on BERT and XLNet

Video Book on Deep Learning

Deep Learning for NLP Part-2

Loss Functions: Cross-Entropy, Categorical Cross-Entropy Loss, Binary Cross-Entropy Loss

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Transformers Unleashed: A Comprehensive Guide to Applying Transformers Across Data Types

New Book on Synthetic Data: Version 3.0 Just Released

Unsupervised Learning as Signals for Pairs Trading and StatArb

Applied Machine Learning: CNNs for Image Recognition

Roadmap of skills required to create AI Agent

From Research To Reality: Deep Learning Methods on Time Series Forecasting on Financial Data

5th grade data science (NLP: Computers & Text)

EP 1: Paper 1: A Neural Probabilistic Language Model

Class 19 - REGRESSION Notes from the AI Advance course by Irfan Malik & Dr Sheraz Naseer (Xeven Solutions)

Choosing the Right Time Series Model: A Blend of Data Science, Statistics, and Financial Understanding.

Time to Welcome - â€œThe Quantum Deep Learningâ€

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†