登录查看更多内容

Beyond the Black Box in Analytics and Cognitive

Tom Davenport

发布日期: 2017年3月29日

There is a growing crisis in the world of analytics and cognitive technologies, and as of yet there is no obvious solution. The crisis was created by a spate of good news in the field of cognitive technology algorithms: they’re working! Specifically, a relatively new and complex type of algorithms—deep learning neural networks (DLNN)—have been able to learn from lots of labeled data and accomplish a variety of tasks. They can master difficult games (Go, for example), recognize images, translate speech, and perform many more tasks as well as or better than the best humans.

So other than the threat to our delicate human egos and jobs, what’s the problem? These new DLNN models are successful in that they master the assigned task, but the way they do so is quite uninterpretable, bringing new meaning to the term “black box”. For example, how does the Google algorithm know that a cat photo on the Internet is a cat? It has looked at ten million YouTube videos that are labeled as including cats, identified the features or variables that best discriminate between cats and non-cats, and combined them in a very complex model (as this short article describes). But the cat features it has identified are highly abstract and can’t be described in a way that would be used to describe a human. In short, the algorithm is about as interpretable as your own neurons’ ability to recognize a cat when you see one.

Lack of interpretability is not a big problem for cat images or other low-impact or low-cost decisions. Black box machine learning is also employed, for example, in digital marketing. We can’t say with any precision why an algorithm decided that I should see the “How to Make Great Landing Pages—Free Guide” ad when I last consulted the Internet. But since the company offering the ad paid fractions of a cent to show it to me—particularly since I did not click on it—nobody much cares why it popped up.

However, when important things like human lives or big money are concerned, people and organizations start to care about interpretability. Let’s say, for example, that a DLNN algorithm identifies a lesion on your chest X-ray as likely to be cancerous, and you have to get a biopsy. Wouldn’t you be interested in how it came to that conclusion, and how good it is at doing so?

For another type of important decision, let’s say that you decide to take your sweetie to a really expensive hotel for Valentine’s Day. You’re checking in at the hotel and your credit card is denied; the card company’s machine learning model has decided that you are likely to be committing fraud. You call the credit card company and ask why your card was turned down. You ask to speak to supervisor after supervisor about this embarrassing incident, but nobody knows why—the model that turned you down is uninterpretable.

On a much larger scale, if bank credit and fraud models are black boxes, then regulators can’t review or understand them. If such algorithms don’t accurately assess risk, then the financial system could be threatened (as it was in 2008/9). Not surprisingly, many regulators are insisting that credit and risk models be interpretable, and as participants in the financial system we should probably be happy about this, even if we lose some predictive power.

So what can be done about this problem? There are a few solutions, but they don’t really address the most complex models like DLNNs. For straightforward statistical models with relatively few variables, it’s possible to determine which ones are really having an effect. One credit card executive, for example, told me that both his employees and his customers insist on transparent models. So if they have a model with 8 variables, they put each variable at its mean position and see what predictive power was lost as a result. This works for 8 variables with real-world referents, but wouldn’t work for a DLNN model with highly abstract variables, and many more than 8 of them to boot.

Some other types of cognitive technologies are more interpretable. I’m not an expert on the different types of natural language processing algorithms, but I am told (specifically by Venkat Srinivasan, the CEO of RAGE Frameworks) that the “computational linguistics” models they use—in which sentences are parsed and their relationships among words graphed—are fairly easily interpreted. His company reports, “…as a totally transparent solution, RAGE AI enables knowledge workers to move forward confidently, knowing the reasoning behind the platform’s insights is completely auditable.” The folks at IPSoft, whose avatar Amelia can handle customer interactions in places like call centers and IT help desks, use similar technology and say the same thing about transparency. But statistical natural language processing—a more complex form of natural language processing used, for example, in Google Translate—is again achieving a high level of task performance but much less interpretability.

You may be glad to know that academics are working on this problem. You may also, however, find their research itself to be largely uninterpretable. For example, at the “Interpretable Machine Learning for Complex Systems” conference, held in Barcelona last December, there were many papers presented on the subject. Actually attending and learning from these papers is well beyond my mathematical and statistical pay grade, but my favorite title is “Gaussian Process Structure Learning via Probabilistic Inverse Compilation.” I suspect that it may be a while before my fellow academics shed much light on the black box.

At the moment, the best we can do is to employ models that are relatively interpretable. If we need the analytical power of a DLNN, for example, and a data scientist presents it for consideration, ask him or her to do their best to explain it. Don’t expect a lot. Maybe someday the really smart technologies like DLNNs will be smart enough to explain themselves to us.

Tom Davenport (www.tomdavenport.com), the author of several best-selling management books on analytics and big data, is the President’s Distinguished Professor of Information Technology and Management at Babson College, a Fellow of the MIT Initiative on the Digital Economy, co-founder of the International Institute for Analytics, and an independent senior adviser to Deloitte Analytics. He also is a member of the Data Informed Board of Advisers.

* This article was originally published by Data Informed on February 27, 2016.

Joseph Coelho

Course Instructor at Marquette University

7 年

Interesting article. Un-interpretable models apart from being "black boxes" become problematic when they raise ethical issues -- issues that directly infringe upon the rights of individuals. But Machine Learning is the new "glamorous thing" and the better outcomes that can be achieved, the better the model. The race never ends! In such a situation it helps to debate issues and discuss consequences before actually implementing technology that impacts people directly.

Robert Guenther

Senior Data Scientist at Centene Corporation

7 年

We also don't know exactly how the brain works either, but use its output.

Marcello Calamai

Allevatore di algoritmi

7 年

Very interesting article, Tom. I think that if we need transparency in AI we can use expert systems, which are based on readable rules. A big challenge would be to submit an expert system to evolutionary algorithms, so that we could make it to perform like a deep learning system, mantaining readability. I'm an algo trader and so I'm very interested in such a development in AI. If you have any idea about this I'd love to hear something from you.

1 次回应

Kathleen Moreland

7 年

Good article Tom! I just asked a question about this very point in a conference/presentation on AI yesterday.

查看更多评论

要查看或添加评论，请登录

Tom Davenport的更多文章

Was Your Information Swimming Naked During The Pandemic?

2022年4月7日

Was Your Information Swimming Naked During The Pandemic?

*The article below is a three part article series I published in Forbes, I will post the other two articles next week…

5 条评论
How HR Leaders Are Preparing for the AI-Enabled Workforce

2021年5月1日

How HR Leaders Are Preparing for the AI-Enabled Workforce

By: Tom Davenport and George Westerman The promise — and threat — of AI is real. But the impact on jobs has not yet…

6 条评论
Deployment as a Critical Business Data Science Discipline

2021年3月14日

Deployment as a Critical Business Data Science Discipline

Column Editors’ Note: In this article, we focus on a key problem in industry: getting data science models deployed into…

2 条评论
What is a minimum viable AI product?

2021年1月26日

What is a minimum viable AI product?

One of the key attributes of the lean startup approach popularized by Steve Blank and Eric Ries is the development and…

13 条评论
BizOps--Aligning Business and IT in Automated Decision-making

2020年6月16日

BizOps--Aligning Business and IT in Automated Decision-making

I’m always interested in new attempts to address long-term issues with how technology works within businesses. One of…

10 条评论
The Future Of Work Now: Morgan Stanley’s Financial Advisors And The Next Best Action System

2020年5月21日

The Future Of Work Now: Morgan Stanley’s Financial Advisors And The Next Best Action System

Rich Brown and Christian Maguire are financial advisors (FAs) for Morgan Stanley in the New York City area. In addition…

1 条评论
Finally, AI for Hiring AI Talent

2020年3月3日

Finally, AI for Hiring AI Talent

An old friend of mine, Andy Hunter, is the Managing Partner of Ridgeway Partners, an executive and board recruiting…

7 条评论
State of AI in the Enterprise, 2nd Edition

2018年11月13日

State of AI in the Enterprise, 2nd Edition

Early adopters combine bullish enthusiasm with strategic investments By Jeff Loucks, Tom Davenport & David Schatsky In…

3 条评论
It’s time to modernize your big data management techniques

2018年9月25日

It’s time to modernize your big data management techniques

Data-management technology is adapting to the evolving ways data are disseminated. It is imperative for companies to…

11 条评论
AI-Driven Leadership

2018年9月20日

AI-Driven Leadership

Many companies are experimenting with AI on a small scale, and a few have made a commitment that their organizations…

7 条评论

See all articles

Beyond the Black Box in Analytics and Cognitive

Tom Davenport

Tom Davenport的更多文章

社区洞察

其他会员也浏览了

6 potential risks of artificial intelligence we have learned from the Godfather of AI

AI Model Optimisation: Breaking a Self-Referential Paradigm

The Math Behind Perceptron: A Step-by-Step Guide to Neural Network Learning and Decision Boundaries

The New ‘Adam-mini’ Optimizer Is Here To Cause A Breakthrough In AI

An opinion on neural networks.

BxD Primer Series: Matrix Factorization Recommendation Models

Future Today #014 - Neuralink, Macrohistory, Global Trends 2040

AI and Learning - Pattern Recognition and Prediction technology or something to do with Human Brain?

Neural Style Transfer: Offline Network Optimization (Fast but Tricky)

Tom Davenport的更多文章

Was Your Information Swimming Naked During The Pandemic?

How HR Leaders Are Preparing for the AI-Enabled Workforce

Deployment as a Critical Business Data Science Discipline

What is a minimum viable AI product?

BizOps--Aligning Business and IT in Automated Decision-making

The Future Of Work Now: Morgan Stanley’s Financial Advisors And The Next Best Action System

Finally, AI for Hiring AI Talent

State of AI in the Enterprise, 2nd Edition

It’s time to modernize your big data management techniques

AI-Driven Leadership

社区洞察

其他会员也浏览了

6 potential risks of artificial intelligence we have learned from the Godfather of AI

AI Model Optimisation: Breaking a Self-Referential Paradigm

The Math Behind Perceptron: A Step-by-Step Guide to Neural Network Learning and Decision Boundaries

The New ‘Adam-mini’ Optimizer Is Here To Cause A Breakthrough In AI

An opinion on neural networks.

BxD Primer Series: Matrix Factorization Recommendation Models

Future Today #014 - Neuralink, Macrohistory, Global Trends 2040

AI and Learning - Pattern Recognition and Prediction technology or something to do with Human Brain?

Neural Style Transfer: Offline Network Optimization (Fast but Tricky)