登录查看更多内容

The Real Reason behind all the Craze for Deep Learning

Ganes Kesari

2X Founder & CEO @ Tensor Planet | Transforming Waste Management | MIT SMR Columnist | TEDx Speaker

发布日期: 2019年8月27日

+ 关注

A simple English explanation, minus the math, stats & code

You can now read this article in Japanese (thanks to Koki Yoshimoto).

Deep learning has created a perfect dichotomy.

On the one hand, we have data science practitioners raving about it. We have the aficionados jumping in to learn and make a career out of this supposedly game-changing technology in analytics.

And then there is everyone else wondering what the buzz is all about. With a long list of cool technologies already projected as the panacea to business’ problems, one wonders what this additional ‘cool thing’ is all about.

For people on the business side of things, there are no easy avenues to get a simple and intuitive understanding. A Google search gets one entangled in the deep layers of neural networks or gets them bowled over by the math symbols. Online courses on the subject haunt one with a bevy of stats terms.

One eventually gives in and ends up taking all of the hype at face value. Here’s an attempt to demystify and democratize the understanding of deep learning (DL), in simple English and in under 5 minutes. I promise not to show you the cliched pictures of human brains, or a spider web of networks :-)

So, just what is Deep learning?

Let’s start with the basic premise of machine learning (ML). The attempt is to teach machines on how to get to the desired outcome when presented with some input. Say, when shown the past 6 month’s stock prices, predict tomorrow’s value. Or, when presented with a face, identify the person.

The machine learns how to do things like this, obviating the need for laborious instructions every time.

Deep learning is just a disciple (or, discipline) of machine learning, but with a higher IQ. It does the same thing as above, but in a much smarter way.

And, how is it different from machine learning?

Let me explain this by using a simple example of face detection.

(Pic: “Jimmy answering questions” by Beatrice Murch derivative work: Sylenius, licensed under CC BY 2.0)

Traditional face recognition using machine learning involves first manually identifying noticeable features on a human face (such as eyes, eyebrows, chin). Then, a machine is trained to associate every known face with these specific features. Now show it a new face, and the machine extracts these preset features and does a comparison to get the best match. This works moderately well.

(Pic: Machine learning is fun.. by Adam Geitgey)

Now, how does deep learning solve the same problem? The process is nearly the same. But remember, this student is smarter. So, instead of spoon-feeding standard facial features, you let the model creatively figure out what to notice. It may decide that the most striking feature in human faces is the curvature on the left cheek, or how flat a forehead is. Or, perhaps something even subtler.

(Pic: Facial features identified by few interim layers of DeepFace DL model)

The DL model silently figures out this connection between the input (face) and output (name), when shown tons of such pairs. Then, when presented with a new face, voila it gets it right magically. Compared to earlier recognition techniques, DL hits the ball way out of the park, in both accuracy and speed.

(Icons - hunotika, MGalloway(WMF), Google [CC BY 3.0] via Wikimedia Commons)

But, why do they always show pictures of the human brain?

To be fair, there is a connection.

Let's review how a child learns her first lessons. You show flashcards with the picture of an elephant, and read it out aloud. After a few such instances, when the baby looks at any semblance of an elephant, she identifies it instantly. Irrespective of the pose, color or context. We didn’t teach her about the trunk, tusk or shape of ears, but she learned it in totality. And she just gets it.

Just as we are unsure how the baby learned to identify what makes up an elephant, we really don’t know how neural networks, the technology behind deep learning figures this out. This is where all similarities to the human brain and neural connections spring up, but I’ll stop here and save you the hassle.

It suffices to know that deep learning is insanely smart at automatically identifying the most distinguishing signals (features) in any given data(face). In other words, it is a master at feature extraction. When given tons of input-output pairs, it identifies what to learn and how to learn it.

Deep learning figures out the strongest pattern in any presented entity — a face, voice or even a table of numbers.

Is this such a big deal for Machine learning?

Yes, it’s Huge.

In spite of the stellar advances in machine learning, the biggest challenge facing the discipline has been… you guessed it right, feature extraction. Data scientists spend sleepless nights discovering connections between an input (a hundred factors of customer behavior) and output (customer churn). Then the machines conveniently learn from them.

So, the difference between top accuracy and poor results is the identification of best features. Now, thanks to deep learning, if machines can do this heavy lifting as well automatically, won’t it be neat?

What use does a pattern identification machine have for business?

Plenty.

Deep learning can be applied anywhere there is a fitment for machine learning. It can comfortably thrash problems with structured data, an area where traditional algorithms reign supreme. Based on what we’ve seen, it can crash the learning cycles, and push accuracy to dizzying levels.

But the biggest bang for the buck is in those areas where ML is still stuttering without a brisk start. Take the case of images, video, audio or deeper meaning from plain old text. Deep learning has crushed problems with such data types that need machines to identify, classify or predict. Let's look at a few:

Advanced face recognition technology is seeing early applications in the real world, and the quality of image or exposure are no longer constraints.
It has made not just detection of animal species possible but lets us name every whale shark in the ocean. Say hello to Willy, the humpback whale!
Advances in speech recognition cut error rates by over 30% since DL took over. And about 2 years ago, they beat humans in this space.
DL has endowed machines with artistic abilities, and there are interesting applications of image synthesis and style transfer made possible.
Thanks to DL it is possible to extract deeper meaning from text, and there are initial attempts to solve the rankling challenge of fake news.

(Object detection using Deep learning on Tensor flow, by Diego Cavalca)

That’s all too smooth, isn’t there a catch?

Well, the biggest advantage of deep learning is also its shortcoming. The very fact that humans don’t have to identify distinguishing features means that the machine defines what it deems important. We, humans, are creatures of reason. We have trouble with anything that doesn’t fit a mold.

Trouble brews in this paradise when one tries interpreting the meaning of machine identified features or attempts to transparently explain why a machine’s decision must be implemented. After all, how comfortable is a business decision-maker to bet millions, or worse, place lives of people at the altar of cryptic, but accurate recommendations by a tool, that was invented a few years ago?

Interpretability of deep learning algorithms and visual explanation of results is a rapidly evolving field, and research is fast catching up. And yes, it needs tons of data to even get started. So yes, there are some hiccups in this area, but the stellar and stable results clearly outweigh the cons, for now.

So, that’s deep learning in a nutshell. Please let me know what you think.

PS: This is a repost of an article from my blog on Medium.

Too much to read? Here’s a 5-minute video post of this article.

Tharashasank D.

5 年

Thanks for sharing this great article . It had great insights in understanding Deep Learning in few seconds.

Arvind Prakash

Product Management & Strategy Leader | UCLA Alumnus | 16+ Years of experience | Product Management, Business strategy, Customer Experience, Product Vision, Roadmap, Go to Market | Startup Coach

5 年

A great article for the masses. Thanks Ganes Kesari for demystifying the Deep Learning.

查看更多评论

要查看或添加评论，请登录

Ganes Kesari的更多文章

AI Revolution In Diabetes Care - How Technology Is Beating This Silent Killer??

2024年1月12日

AI Revolution In Diabetes Care - How Technology Is Beating This Silent Killer??

Hello, Wishing you & yours a happy, healthy, and prosperous 2024! This newsletter will take you about 4 minutes to…

4 条评论
#69: Three Data Analytics Challenges: How Decision Intelligence Can Help You Tackle Them??

2023年11月16日

#69: Three Data Analytics Challenges: How Decision Intelligence Can Help You Tackle Them??

Hello, This newsletter will take you about 4 minutes to read. I.

2 条评论
#67: ??Unlocking Excellence: A Roadmap to Scaling Decision Intelligence

2023年10月18日

#67: ??Unlocking Excellence: A Roadmap to Scaling Decision Intelligence

Hello, This newsletter will take you about 4 minutes to read. I.

4 条评论
#66: Top 3 Applications of Data Science for Transforming Warehouse Operations???

2023年10月9日

#66: Top 3 Applications of Data Science for Transforming Warehouse Operations???

Hello, This newsletter will take you about 4 minutes to read. I.
#65: 3 Steps To Implement Decision Intelligence in Your Enterprise??

2023年9月18日

#65: 3 Steps To Implement Decision Intelligence in Your Enterprise??

Hello, What’s that latest with Generative AI, and importantly, how can leaders leverage it for decision-making? Let’s…

3 条评论
Why Decision Intelligence Is The Most Important Data Analytics Trend Of This Decade

2023年9月6日

Why Decision Intelligence Is The Most Important Data Analytics Trend Of This Decade

Hello, This newsletter will take you about 4 minutes to read. I.

4 条评论
8 Critical Steps To Get Your Team To Adopt AI??

2023年6月22日

8 Critical Steps To Get Your Team To Adopt AI??

Hello, Generative AI is clearly the flavor of the season. Most enterprise leaders I talk to are curious to learn more…
AI Trends For 2023: Industry Experts (And ChatGPT AI) Make Their Predictions??

2023年4月27日

AI Trends For 2023: Industry Experts (And ChatGPT AI) Make Their Predictions??

AI Trends For 2023: Industry Experts (And ChatGPT AI) Make Their Predictions?? Hello, This newsletter will take you…
3 Surprisingly common ways leaders fail their AI projects??

2022年12月19日

3 Surprisingly common ways leaders fail their AI projects??

Hello, This newsletter will take you about 4 minutes to read. ----- I.

1 条评论
4 Reasons Why Digital Biomarkers Are Game-Changers In Healthcare??

2022年11月22日

4 Reasons Why Digital Biomarkers Are Game-Changers In Healthcare??

Hello, This newsletter will take you about 4 minutes to read. ----- I.

See all articles

The Real Reason behind all the Craze for Deep Learning

Ganes Kesari

2X Founder & CEO @ Tensor Planet | Transforming Waste Management | MIT SMR Columnist | TEDx Speaker

A simple English explanation, minus the math, stats & code

So, just what is Deep learning?

And, how is it different from machine learning?

Is this such a big deal for Machine learning?

What use does a pattern identification machine have for business?

That’s all too smooth, isn’t there a catch?

Ganes Kesari的更多文章

社区洞察

其他会员也浏览了

Machine Learning for Beginners: The 3 Basic Strategies

Top 10 things to not do when learning GenAI

Deep Learning, an Alternative way of Thinking

Machine Learning 101: Understanding the inner workings of AI

Enhancing Deep Q Learning: A Dive into Double Deep Q Networks, Dueling Deep Q Networks, and Prioritized Experience Replay

Mastering Transfer Learning with TensorFlow Part: 1

Ensemble Learning: Combining Models for Improved Performance

An Introduction to Deep Learning

Machine Learning vs. Deep Learning: Understanding the Basics

AI Atlas #3: Transfer Learning

A simple English explanation, minus the math, stats & code

So, just what is Deep learning?

And, how is it different from machine learning?

Is this such a big deal for Machine learning?

What use does a pattern identification machine have for business?

That’s all too smooth, isn’t there a catch?

Ganes Kesari的更多文章

AI Revolution In Diabetes Care - How Technology Is Beating This Silent Killer??

#69: Three Data Analytics Challenges: How Decision Intelligence Can Help You Tackle Them??

#67: ??Unlocking Excellence: A Roadmap to Scaling Decision Intelligence

#66: Top 3 Applications of Data Science for Transforming Warehouse Operations???

#65: 3 Steps To Implement Decision Intelligence in Your Enterprise??

Why Decision Intelligence Is The Most Important Data Analytics Trend Of This Decade

8 Critical Steps To Get Your Team To Adopt AI??

AI Trends For 2023: Industry Experts (And ChatGPT AI) Make Their Predictions??

3 Surprisingly common ways leaders fail their AI projects??

4 Reasons Why Digital Biomarkers Are Game-Changers In Healthcare??

社区洞察

其他会员也浏览了

Machine Learning for Beginners: The 3 Basic Strategies

Top 10 things to not do when learning GenAI

Deep Learning, an Alternative way of Thinking

Machine Learning 101: Understanding the inner workings of AI

Enhancing Deep Q Learning: A Dive into Double Deep Q Networks, Dueling Deep Q Networks, and Prioritized Experience Replay

Mastering Transfer Learning with TensorFlow Part: 1

Ensemble Learning: Combining Models for Improved Performance

An Introduction to Deep Learning

Machine Learning vs. Deep Learning: Understanding the Basics

AI Atlas #3: Transfer Learning