登录查看更多内容

Deep Reinforcement Learning

Ankit Rathi

Simplifying Data & AI through Visual Notes

发布日期: 2018年5月13日

While neural networks are responsible for recent breakthroughs in problems like computer vision, machine translation and time series prediction – they can also combine with reinforcement learning algorithms to create something astounding like AlphaGo.

What is Deep Reinforcement Learning?

To understand deep reinforcement learning, lets first look at some definitions from Wikipedia:

Reinforcement learning (RL) is an area of machine learning inspired by behaviourist psychology, concerned with how software agents ought to take actions in an environment so as to maximize some notion of cumulative reward.

Deep learning is loosely related to information processing and communication patterns in a biological nervous system, such as neural coding that attempts to define a relationship between various stimuli and associated neuronal responses in the brain.

Deep reinforcement learning (DRL) is a machine learning method that extends reinforcement learning approach using deep learning techniques.

So by above definitions we can infer that the traditional Reinforcement learning aims to solve problems of how agents can learn to take the best actions on the environment to get the maximum cumulative reward over time. A major part of this process is carefully engineering feature representations. The advances in algorithms for Deep learning have brought up a new wave of successful applications in Reinforcement Learning, because it offers the opportunity to efficiently work with high dimensional input data (like images). In this context the trained deep neural network can be seen as a kind of Deep Reinforcement learning approach, where the agent can learn a state abstraction and a policy approximation directly from its input data.

Why Deep Reinforcement Learning is required?

In those kinds of situations where you use supervised & unsupervised learning , you already have a pretty good idea of the data you have, what’s going on and how to solve the problem. You’re using machine learning to find interesting patterns in that data to get to a better solution, accelerate the process and get to your solution faster. But what about those situations or problem spaces where you have partial data or no data, where an agent can only learn by trial and error. In these situations reinforcement learning comes handy, domain experts and organizations typically know what they want a system to do, but they want to automate or optimize a specific process. Recent advances in Deep learning area has also fueled in Reinforcement learning as it doesn’t need hand-engineered features any more because of this ability. After appropriate many backpropagations, deep neural network knows which information is important to do the task.

How to use Deep Reinforcement Learning?

Reinforcement learning is inspired by behavioral psychology.

Instead of providing the model with ‘correct’ actions, we provide it with rewards and punishments. The model receives information about the current state of the environment (e.g. the computer game screen). It then outputs an action, like a joystick movement. The environment reacts to this action and provides the next state, alongside with any rewards.

The model then learns to find actions that lead to maximum rewards.

Q-learning intuition:

Most modern RL algorithms are some adaptation of Q-Learning. A good way to understand Q-learning is to compare it with playing chess.

Q(S,A) = R + γ * max Q(S’,A’)

The expected future reward Q(S,A) for a given a state S and action A is calculated as the immediate reward R, plus the expected future reward thereafter Q(S',A'). We assume the next action A' is optimal.

As a regression problem:

When playing a game, we generate lots of experiences. These experiences are our training data. We can frame the problem of estimating Q(S,A) as a regression problem. To solve this, we can use a neural network.

Training the experiences:

In training process, batch of experiences is trained on neural net using a loss function, where we calculate how far or near is predicted outcome from actual outcome.

Building the model:

In the next step, we build a model that will learn a Q-function for the game.

Exploration:

This is the final step of Q-Learning, where agent will choose some random option for exploration, which will not necessarily the best.

References:

What is deep reinforcement learning, and how does it work?

Welcome to Deep Reinforcement Learning

Deep reinforcement learning: where to start

--------------------------------------------------------------------------------------------

Thank you for reading my post. I regularly write about Data & Technology on LinkedIn & Medium. If you would like to read my future posts then simply ‘Connect’ or ‘Follow’. Also feel free to connect on Slideshare

Mohammad Qamar

Analytics Manager | NLP, BI, Business Analytics

6 年

Great post, thanks!

1 次回应

要查看或添加评论，请登录

Ankit Rathi的更多文章

Data Science and its Nearest-Neighbours

2021年4月14日

Data Science and its Nearest-Neighbours

I started my journey into data science in 2012, at that time data science, machine learning, and artificial…

1 条评论
How to Build a Data-Driven Organization?

2021年3月28日

How to Build a Data-Driven Organization?

There has not been an exciting time than this to talk about data. Data is everywhere, it is being called the new oil…

2 条评论
Building Data Analytics Ecosystem

2021年3月7日

Building Data Analytics Ecosystem

In this post, I am going to cover how you can build a data analytics ecosystem in your organization. A business doesn’t…
End-to-End Data Science Process

2020年8月18日

End-to-End Data Science Process

In this post, I am going to cover a typical end-to-end data science process. Watch this episode on YouTube here.
5 Data Science Use Cases for Every Business

2020年8月5日

5 Data Science Use Cases for Every Business

In this article, I would like to talk about 5 data science use cases for every business. Watch this episode on YouTube…
9 Movies Every Data Scientist Should?Watch

2020年7月25日

9 Movies Every Data Scientist Should?Watch

I have been a movie buff all my life. I have watched almost all the top 250 movies from IMDB and every decent movie…

2 条评论
5 Books Every Data Professional Should?Read

2020年7月17日

5 Books Every Data Professional Should?Read

In this post, would like to write about 5 books every data professional should read. These are the books that have…

2 条评论
Data Science is a Team Sport

2020年6月30日

Data Science is a Team Sport

Today, I am going to cover why I consider data science as a team sport? Now grab my content on your favourite platform:…
Kaggle Vs Real-world Projects

2020年6月24日

Kaggle Vs Real-world Projects

Now grab my content on your favourite platform: YouTube | SoundCloud | SlideShare | GitHub In this article, I am going…

6 条评论
How to approach Data Science in?2020?

2020年6月21日

How to approach Data Science in?2020?

Today, I am going to cover the 2nd most frequently question by my readers and followers, How they, I mean you can get…

3 条评论

See all articles

Deep Reinforcement Learning

Ankit Rathi

Simplifying Data & AI through Visual Notes

Ankit Rathi的更多文章

社区洞察

其他会员也浏览了

What is deep learning?

Generative AI Series: A Comprehensive Journey from Basics to Cutting-Edge Innovation Continue...(Part-2)

Top 10 Activation Functions in Deep Learning

Deep Learning from 30,000 feet

Exploring Deep Learning with Neural Networks at the AI for Good Institute

Neural Networks Made Fun With TensorFlow Playground!

Finding Links Between Deming and Deep Learning

Artificial Intelligence - Part 4 - Deep Learning

Deep Learning Techniques | An Overview

Demystifying Deep Learning: A Beginner's Guide to Artificial Neural Networks with TensorFlow

Ankit Rathi的更多文章

Data Science and its Nearest-Neighbours

How to Build a Data-Driven Organization?

Building Data Analytics Ecosystem

End-to-End Data Science Process

5 Data Science Use Cases for Every Business

9 Movies Every Data Scientist Should?Watch

5 Books Every Data Professional Should?Read

Data Science is a Team Sport

Kaggle Vs Real-world Projects

How to approach Data Science in?2020?

社区洞察

其他会员也浏览了

What is deep learning?

Generative AI Series: A Comprehensive Journey from Basics to Cutting-Edge Innovation Continue...(Part-2)

Top 10 Activation Functions in Deep Learning

Deep Learning from 30,000 feet

Exploring Deep Learning with Neural Networks at the AI for Good Institute

Neural Networks Made Fun With TensorFlow Playground!

Finding Links Between Deming and Deep Learning

Artificial Intelligence - Part 4 - Deep Learning

Deep Learning Techniques | An Overview

Demystifying Deep Learning: A Beginner's Guide to Artificial Neural Networks with TensorFlow