登录查看更多内容

Lessons in A.I. from a Budding Machine Learning Engineer — A Brief Introduction to Reinforcement Learning Part II

Larry Johnson

发布日期: 2025年1月13日

Introduction

In our last article we introduced RL and explained what a policy is and how it applies to our fast food problem domain. In this article we will dive a bit deeper by discussing the reward and value function. In addition, we will close out this series and open up future conversations to discuss more popular AI topics in addition to building and deploying real solutions.

When one speaks of RL we have to go back in time and discuss the genesis of the field from one of the early contributors, Richard Bellman. His work goes back to the mid 20th century around 1957. Building off the work of the Russian mathematician Andrey Andreyevich Markov, Bellman utilized Markov Decision Processes or MDP to solve a class of problems that involved sequential decisions to reach some long term goal. Bellman, born in Brooklyn, was the son of Jewish parents who ran a local grocery store. However, he was raised atheist which helped fuel his desire for unconventional thinking at that time in American history. During World War II, he worked for a theoretical physics division of the military in Los Alamos, New Mexico.2 Returning back to college Bellman defended his PhD thesis at Princeton University within just 3 months on the “the stability of differential equations”.1

Through his experimentation, rigor, and academic excellence one of his many contributions to mathematics and control theory was the Bellman Equation.

领英推荐

Making the unapproachable approachable

David Knott 1 年前

Quick Intuition for Understanding GenAI by Thinking in…

Xiao-Fei Zhang 6 个月前

Motion Magnification: Deep Learning and Hidden…

Ahmet Alper Akis 2 周前

It looks like there is a lot going in this equation and in fact there is. However, we will break it down to its subcomponent parts, so that you have a fundamental understanding of its functioning. First, remember the pseudocode in our last article that showed how a typical RL policy is designed and subsequently implemented.

I Did A Thing!

410 位关注者

要查看或添加评论，请登录

Larry Johnson的更多文章

Lessons in A.I. from a Budding Machine Learning Engineer?-?A Brief Introduction to Reinforcement Learning Part?I

2025年1月6日

Lessons in A.I. from a Budding Machine Learning Engineer?-?A Brief Introduction to Reinforcement Learning Part?I

In our last article we closed out our discussion on Supervised and Unsupervised Learning, which are two of the other…

5 条评论
The Phoenix Project Executive Summary

2024年11月21日

The Phoenix Project Executive Summary

Synposis If there was ever a book that any executive, manager, or employee should read, it is The Phoenix Project. This…
Data Is The New Oil

2024年9月23日

Data Is The New Oil

Introduction “Data is the new oil.” There are some Information Scientists who have a great disdain of this notion.

5 条评论
Lessons in A.I. from a Budding Machine Learning Engineer — Finding the Right Recipe

2024年8月5日

Lessons in A.I. from a Budding Machine Learning Engineer — Finding the Right Recipe

Cooking a special meal for the holiday season or another special occasion takes time to prepare to serve the people you…
Lessons in A.I. from a Budding Machine Learning Engineer — Getting to the Core — Part II

2024年7月22日

Lessons in A.I. from a Budding Machine Learning Engineer — Getting to the Core — Part II

Introduction In our last article we discussed prediction and how this relates to but is different from inference in…
Up with jobs, down with code!

2024年6月6日

Up with jobs, down with code!

Breaking News!!! In another riot across the country, this time in Canton Ohio, residents are fed up with computers…

7 条评论
Throwback Thursdays - The Second Machine Age

2024年2月1日

Throwback Thursdays - The Second Machine Age

(Originally Published on June 17, 2016) Data Religion’s Book of the Month is “The Second Machine Age” by the tag team…
Lessons in A.I. from a Budding Machine Learning Engineer - Getting to the Core

2024年1月16日

Lessons in A.I. from a Budding Machine Learning Engineer - Getting to the Core

Introduction In our last article we closed out our discussion on Linear Algebra thereby introducing Matrix Algebra. In…

1 条评论
Lessons in A.I. from a Budding Machine Learning Engineer - Mixed Fruit

2023年8月22日

Lessons in A.I. from a Budding Machine Learning Engineer - Mixed Fruit

Introduction In our last article we covered Linear Algebra introducing matrices and the matrix equation. In this…
Lessons in A.I. from a Budding Machine Learning Engineer - Sweet Potato Pie

2023年8月14日

Lessons in A.I. from a Budding Machine Learning Engineer - Sweet Potato Pie

Introduction In our last article we talked about what is Data Literacy and introduced Linear Algebra to slice and dice…

1 条评论

See all articles

Lessons in A.I. from a Budding Machine Learning Engineer — A Brief Introduction to Reinforcement Learning Part II

Larry Johnson

Introduction

领英推荐

I Did A Thing!

410 位关注者

Larry Johnson的更多文章

社区洞察

其他会员也浏览了

Motion Magnification: Deep Learning and Hidden Vibrations Around Us

Machine Learning tribes

Scalars Vectors Matrices and Tensors

Math 2.0 - The fundamental importance of Machine Learning

Back Propagation: Holistic Overview

Classification Models for Machine Learning

AI LEARNING SCHEDULE FOR BEGINNERS

Deep Reinforcement Learning with Python Training Course

LLMs: What is softmax?

Introduction

领英推荐

I Did A Thing!

410 位关注者

Larry Johnson的更多文章