登录查看更多内容

Black Box Machine Learning may be harmful

Pranab Ghosh

AI Consultant || MIT Alumni || Entrepreneur || Open Source Project Owner || Blogger

发布日期: 2017年10月5日

Some machine learning solutions are being marketed as easy to use, point and click product that can treated as a black box. The reason is understandable. Being touted as easy to use, the vendors get a wider reach for their product. For machine learning practitioners, they get instant gratification with such products.

If you are working on some critical Machine Learning application, taking such short cut for the sake of expediency is not advisable. There may be serious consequences.

Unfortunately, there is no equivalency between Machine Learning and most other technologies. It's a deep and complex multi disciplinary field founded on Math, Statistics, Information Theory and Neuro Science. There is no easy and quick way to become an expert.

Selecting the appropriate algorithm for a given problem and data set is not trivial. Once one or more candidate algorithms have been selected, tuning the the algorithm through the various configuration parameters is a tedious and painstaking process. Same comment applies to feature engineering.

Point and click and voila

A vendor may claim that you can just feed some training data and click a button and voila you have predictive model. Any experienced Machine Learning practitioner will be skeptical and immediately ask the following questions. They will be curious about the inner workings of the product under the hood.

What is the algorithm?
How was the algorithm selected?
What are the tuning parameters?
If the tuning parameters are not exposed to the user, what are they set to?
What kind of validation technique was used while training the model?
What kind of feature engineering was done?
What performance criteria was used to train the model?

From novice to maverick

Let's consider Machine Learning practitioners at three levels of expertise and see how they might respond to the black box Machine Learning products.

Novice : They have shallow understanding of machine learning theory and algorithms. They are not knowledgeable about the nuances of different algorithms. They are more likely to be open to the black box solutions.
Advanced: They have deep knowledge of machine learning theory and different algorithms. They are comfortable selecting appropriate algorithm for a problem and tune the algorithm. They are likely to be very skeptical about black box solutions.
Maverick: These people have the all the traits of the advanced level Machine Learning engineers and more. They have very deep knowledge and always curious. If the implementation source code is available, they are likely to look at it to gain understanding of the inner working of the algorithms. They may have ideas about improving an algorithm and implement their own. It's difficult to imagine these people using black box Machine Learning solutions.

Ease of use is always a desirable goal for any product. However, there is limit to how easy a product can be made, before you start sacrificing value and quality.

要查看或添加评论，请登录

Pranab Ghosh的更多文章

Does AutoML make Data Scientists obsolete? Not so fast.

2019年11月8日

Does AutoML make Data Scientists obsolete? Not so fast.

There is lot of speculation around AutoML replacing Data Scientists. But most of it is unwarranted.
Perishable Product Discounting with Reinforcement Learning

2019年10月19日

Perishable Product Discounting with Reinforcement Learning

Some time ago a retailer brought up this problem with me. I implemented a solution using a variation of Reinforcement…
Quick and Easy Sentiment Analysis using Google Search Result size and Mutual Information

2018年4月13日

Quick and Easy Sentiment Analysis using Google Search Result size and Mutual Information

Generally sentiment analysis requires large corpus of labelled text and application some supervised Machine Learning…

1 条评论
Essential Differences between Deep Learning and Conventional Neural Network

2017年2月27日

Essential Differences between Deep Learning and Conventional Neural Network

There has been a resurgence in interest in Neural Network since 2006, because of some breakthroughs known collectively…

1 条评论
Big Data ETL Does Not Have to Cost Big Bucks

2016年10月13日

Big Data ETL Does Not Have to Cost Big Bucks

Like it or not, ETL constitutes bulk of the work in any data engineering project. It's been reported that even for…
The Amazing Power of Generalization

2016年9月1日

The Amazing Power of Generalization

Generalization We humans always generalize a particular problem and learn to solve the generalized problem. It enables…

1 条评论
Sometimes the Only Path to Survival is Big Data

2016年2月1日

Sometimes the Only Path to Survival is Big Data

I felt compelled to write this post, after seeing numerous articles with catchy titles as follows. I am paraphrasing…

5 条评论
Prescriptive Analytics is Predictive Analytics Inverted

2015年10月18日

Prescriptive Analytics is Predictive Analytics Inverted

In Predictive Analytics, we predict an outcome based on set of feature variables. We could turn the problem around…

4 条评论
When Approximation is Good Enough

2015年9月16日

When Approximation is Good Enough

We engineers are generally paranoid about accuracy of results. However there are many real life situations where such…
You Really Need Spark When ...

2015年8月18日

You Really Need Spark When ...

There is lot of confusion and misconceptions about how Spark stacks up against Hadoop Map Reduce. The question many of…

9 条评论

See all articles

Black Box Machine Learning may be harmful

Pranab Ghosh

AI Consultant || MIT Alumni || Entrepreneur || Open Source Project Owner || Blogger

Point and click and voila

From novice to maverick

Pranab Ghosh的更多文章

社区洞察

其他会员也浏览了

How does Machine Learning work?

Breaking Down the Buzzwords: Understanding the Basics of Machine Learning

World of Machine Learning

Machine learning

Exploring The Impact Of Machine Learning On Various Industries

Machine Learning

What is Hypothesis and Inductive Bias in Machine Learning?

An Introduction to Machine Learning

Unleashing the Power of Machine Learning Algorithms: A Comprehensive Guide

Different types of Machine Learning - Part 02

Point and click and voila

From novice to maverick

Pranab Ghosh的更多文章

Does AutoML make Data Scientists obsolete? Not so fast.

Perishable Product Discounting with Reinforcement Learning

Quick and Easy Sentiment Analysis using Google Search Result size and Mutual Information

Essential Differences between Deep Learning and Conventional Neural Network

Big Data ETL Does Not Have to Cost Big Bucks

The Amazing Power of Generalization

Sometimes the Only Path to Survival is Big Data

Prescriptive Analytics is Predictive Analytics Inverted

When Approximation is Good Enough

You Really Need Spark When ...

社区洞察

其他会员也浏览了

How does Machine Learning work?

Breaking Down the Buzzwords: Understanding the Basics of Machine Learning

World of Machine Learning

Machine learning

Exploring The Impact Of Machine Learning On Various Industries

Machine Learning

What is Hypothesis and Inductive Bias in Machine Learning?

An Introduction to Machine Learning

Unleashing the Power of Machine Learning Algorithms: A Comprehensive Guide

Different types of Machine Learning - Part 02