Feature ranking for multi-label classification using Markov Networks
Diego Marinho de Oliveira
Gen-AI Search, RecSys | ex-SEEK, AI Lead, Data Scientist Manager and ML Engineer Specialist
"Abstract We propose a simple and efficient method for ranking features in multi-label classification. The method produces a ranking of features showing their relevance in predicting labels, which in turn allows to choose a final subset of features. The procedure is based on Markov Networks and allows to model the dependencies between labels and features in a direct way. In the first step we build a simple network using only labels and then we test how much adding a single feature affects the initial network. More specifically, in the first step we use the Ising model whereas the second step is based on the score statistic, which allows to test a significance of added features very quickly. The proposed approach does not require transformation of label space, gives interpretable results and allows for attractive visualization of dependency structure. We give a theoretical justification of the procedure by discussing some theoretical properties of the Ising model and the score statistic. We also discuss feature ranking procedure based on fitting Ising model using l1 regularized logistic regressions. Numerical experiments show that the proposed methods outperform the conventional approaches on the considered artificial and real datasets."
Author:Pawe? Teisseyre
Read full paper at https://bit.ly/1UmDhuL
Artificial Intelligence Practitioner || Data Science Mentor || Machine Learning || Deep Learning || Advanced Analytics || NLP || Computer Vision || Gen AI ||
9 年Great!!
Good post
Great paper thanks
Director of Quantitative Insights and Data Science at Allspring Global Investments | Co-Lead of the Native Peoples Business Resource Group
9 年Very cool!