登录查看更多内容

A Comprehensive Overview of Deep Learning

Ritika Dokania

Head of Machine Learning @AIML.com | Stanford AI | IITD | Cornell

发布日期: 2024年6月29日

+ 关注

Original article source: AIML.com

https://aiml.com/what-is-deep-learning/

Introduction:

Deep Learning is a subset of machine learning that is characterized by the use of deep neural networks, with multiple layers (hence the term "deep" learning) to perform tasks that typically require human intelligence. It is inspired by the structure and function of the human brain, where each layer of neurons processes and transforms the input data to progressively extract higher-level features.

What is Artificial Intelligence, Machine Learning and Deep Learning

Deep neural networks (DNNs), consist of interconnected layers of artificial neurons called nodes. Each node receives input from the previous layer, applies a mathematical transformation to it, and passes the transformed output to the next layer. The layers closer to the input are responsible for learning low-level features, while the deeper layers learn more abstract and complex representations.

How neural networks progressively extract higher-level features from the raw input | Source:

This phenomenon of automatically learning meaningful and informative features (or representations) from raw data is also referred to as representation learning, which stands as one of the key strengths of DNNs.

Key characteristics and working of Deep Neural Network

Deep learning works by using artificial neural networks, which are composed of layers of interconnected nodes (neurons) that process and transform the data through neural network training.

Key characteristics and working of deep learning include the following:

(1) The Perceptron,

(2) Deep architecture,

(3) Neural Networks, and

(4) Training

How a Deep Learning Algorithm works | Source:

The Perceptron (Neuron)A perceptron is the structural building block of a deep learning model. It refers to a simple type of artificial neuron or node in a neural network. It operates by calculating a weighted sum of its inputs, adding a bias term, and then applying an activation function to this sum
Deep architecture The term "deep" refers to the depth of the network, which means it has more than one hidden layer. Deep architectures enable DNNs to learn and represent intricate features from data.
Neural Networks Deep learning is built upon the concept of artificial neural networks (ANNs), which comprises of interconnected nodes, called neurons or units, organized into layers: an input layer, hidden layers, and an output layer.
Training Deep Neural networks are trained using large datasets. The training process comprises of five main steps: (a) Sampling mini-batch of data and weights initialization, (b) Forward propagation and Loss calculation, (c) Backpropagation and Optimization, (d) Repeat the training loop, and (e) Infer

Deep Learning Models

Deep learning encompasses several key architectures, each designed for specific types of data and tasks. These architectures serve as building blocks for solving a wide range of tasks in artificial intelligence and machine learning. Here are some of the key deep learning architectures:

Applications of Deep Learning

For complete list of applications, go to: https://aiml.com/what-is-deep-learning/

Evolution of Deep Learning: A brief history and Resurgence

A brief history:

Deep Learning might appear as a novel discovery in the field of machine learning, given its recent name and fame. However, , the history of Deep Learning spans several decades, dating back to 1940s as presented below:

Evolution of Deep Learning from 1940-2010 | Source:

领英推荐

A Comprehensive Guide: What are Convolutional Neural…

BasicAI Inc 1 年前

Deep Learning

Bluechip Technologies Asia 10 个月前

Machine Learning vs Deep Learning vs Neural Networks

Lares.AI 2 年前

1950s --> Alan Turing, a British mathematician, first presented the idea that computers would achieve human-level intelligence

1957 --> Frank Rosenblatt, an American psychologist, introduced the perceptron, a single-layer neural network

1965 --> Alexey Ivakhnenko, a Soviet mathematician, created a small functional neural network

1970s --> Limited progress, referred to as the AI winter

1980s --> Backpropagation, a method for training neural networks, was rediscovered by Dr. Geoffrey Hinton, ?a British-Canadian psychologist and computer scientist

1989 --> Yann LeCun’s invents machine that can read handwritten digits

1990s --> Multi-layer perceptrons, the inception of CNNs, and LSTM

1999 --> GPUs (Graphics Processing Units)?were developed

2000s --> Limited progress in the field of Deep Learning

2012 --> Deep neural network, AlexNet, outperformed other methods for image recognition and led to the resurgence of Neural Network. Several notable neural network models and frameworks followed

2017 --> Introduction of Transformer architecture, a game-changer in the field of Deep Learning models for solving Natural Language Processing tasks

2018 onwards --> Revolution in the AI space took place with the introduction of BERT, GPT-3, Stable Diffusion models, and systems such as ChatGPT, Bard, Perplexity etc.

The resurgence was catalyzed by three key factors:

Big data The digital age brought about an unprecedented amount of data. Deep learning models thrive on vast datasets, and having access to such data allowed for more effective training of deep neural networks.

Hardware (GPU): Neural networks are commonly trained on massive datasets, and often comprises of millions to billions of parameters. The introduction of Graphics Processing Units (GPUs) has been instrumental in facilitating this complex computation by offering accelerated processing power and parallel computing capabilities. Unlike Central Processing Units (CPUs), which have a limited number of cores capable of handling a few software threads at a time, GPUs consist of hundreds of cores capable of simultaneously managing thousands of threads. The increased availability of high-performance GPUs at affordable prices has played a pivotal role in the popularity and success of deep learning.

Software Breakthrough in Deep Learning architectures such as Transformers for Language Modeling, CNNs for Computer vision, made it possible to handle complexities in deep neural network and train them effectively Development of Deep Learning frameworks such as Pytorch, Tensorflow made it easier for developers to work with deep networks

Video Explanation (Playlist):

This playlist contains the following videos in the recommended order:

"The 'MIT Introduction to Deep Learning' is an introductory lecture from MIT's Deep Learning course, taught by Alexander Amini and Ava Amini. It explores the meaning of deep learning, its significance, applications, and the fundamentals of neural network training and regularization. This video provides a well-rounded understanding of deep learning and neural networks. https://www.youtube.com/watch?v=QDX-1M5Nj7s&list=PLtBw6njQRU-rwp5__7C0oIVt26ZgjG9NI
The video 'But what is a neural network?' by 3Blue1Brown explains the concept of deep learning using an image recognition example. The video helps you understand how a deep learning model learns for real-world applications. https://www.youtube.com/watch?v=aircAruvnKk

For more such articles, visit https://aiml.com

Looking for practice quizzes, https://aiml.com/quiz-category/technical/

(PS: Do sign up to take practice quizzes and bookmark your favorite questions)

#deeplearning #machineLearning

要查看或添加评论，请登录

Ritika Dokania的更多文章

What are Language Models? Discuss the evolution of Language Models over time

2024年4月9日

What are Language Models? Discuss the evolution of Language Models over time

Original article source (@aiml.com): https://aiml.
Explain the Transformer Architecture (with Examples and Videos)

2024年3月29日

Explain the Transformer Architecture (with Examples and Videos)

Original article source (@aiml.com): https://aiml.
Sequence Models: An in-depth look at Key Algorithms and their real-world applications

2023年10月25日

Sequence Models: An in-depth look at Key Algorithms and their real-world applications

Original article source (@aiml.com): https://aiml.
Comparing different Sequence models: RNN, LSTM, GRU, and Transformers

2023年10月21日

Comparing different Sequence models: RNN, LSTM, GRU, and Transformers

Original article source (@aiml.com): https://aiml.

5 条评论
Baby and Graduate School: My eventful year at Virginia Tech!

2018年2月28日

Baby and Graduate School: My eventful year at Virginia Tech!

Both academically and personally, the year 2017 was memorable for me. It was a year filled with ups and downs, joys and…

32 条评论
It was my pleasure organizing the CEO Roundtable meeting for PM Narendra Modi. Met some wonderful people here!

2017年6月27日

It was my pleasure organizing the CEO Roundtable meeting for PM Narendra Modi. Met some wonderful people here!

141 条评论
CRSP Data Definitions

2017年6月20日

CRSP Data Definitions

Definition of select variables in the CRSP database is provided below: Share Code (SHRCD)* Exchange Code (EXCHCD)*…

1 条评论
Working with CRSP data..

2017年6月20日

Working with CRSP data..

This article is part of the CRSP data analysis series. For other articles in this series, please see below: 1) CRSP…

2 条评论
Overview of Global Diamond industry and India's role in it!

2016年5月17日

Overview of Global Diamond industry and India's role in it!

Diamonds are the 2nd largest export commodity from India after mineral fuels. Over the past decade, India has proven…
India has potential to be the largest food exporter in the world!

2016年1月5日

India has potential to be the largest food exporter in the world!

India is a food surplus nation. It exports a net of $13 BN of food to other countries of the world.

5 条评论

See all articles

A Comprehensive Overview of Deep Learning

Ritika Dokania

Head of Machine Learning @AIML.com | Stanford AI | IITD | Cornell

Introduction:

Key characteristics and working of Deep Neural Network

Deep Learning Models

Applications of Deep Learning

Evolution of Deep Learning: A brief history and Resurgence

A brief history:

领英推荐

Video Explanation (Playlist):

Ritika Dokania的更多文章

社区洞察

其他会员也浏览了

Week 8: Deep Dive into Deep Learning and Neural Networks

Do You Understand The Difference Between Deep Learning And Neural Networks?

Significance of non linearity in machine learning and deep learning

Top 5 Types of Neural Networks in Deep Learning

Demystifying Artificial Neural Networks (ANNs): A Beginners Guide to Navigating Machine Learning in Healthcare

Top 10 Activation Functions in Deep Learning

A Primer on Deep Learning

Neural Networks vs Deep Learning - Understanding the Difference

Neural Networks and Deep Learning: How Do Machines Understand and Learn Like Humans?

The Process of Deep Learning: A Step-by-Step Guide to Mastering Neural Networks

Introduction:

Key characteristics and working of Deep Neural Network

Deep Learning Models

Applications of Deep Learning

Evolution of Deep Learning: A brief history and Resurgence

A brief history:

领英推荐

Video Explanation (Playlist):

Ritika Dokania的更多文章

What are Language Models? Discuss the evolution of Language Models over time

Explain the Transformer Architecture (with Examples and Videos)

Sequence Models: An in-depth look at Key Algorithms and their real-world applications

Comparing different Sequence models: RNN, LSTM, GRU, and Transformers

Baby and Graduate School: My eventful year at Virginia Tech!

It was my pleasure organizing the CEO Roundtable meeting for PM Narendra Modi. Met some wonderful people here!

CRSP Data Definitions

Working with CRSP data..

Overview of Global Diamond industry and India's role in it!

India has potential to be the largest food exporter in the world!

社区洞察

其他会员也浏览了

Week 8: Deep Dive into Deep Learning and Neural Networks

Do You Understand The Difference Between Deep Learning And Neural Networks?

Significance of non linearity in machine learning and deep learning

Top 5 Types of Neural Networks in Deep Learning

Demystifying Artificial Neural Networks (ANNs): A Beginners Guide to Navigating Machine Learning in Healthcare

Top 10 Activation Functions in Deep Learning

A Primer on Deep Learning

Neural Networks vs Deep Learning - Understanding the Difference

Neural Networks and Deep Learning: How Do Machines Understand and Learn Like Humans?

The Process of Deep Learning: A Step-by-Step Guide to Mastering Neural Networks