登录查看更多内容

Advanced simulations, the key to successful Deep Reinforcement Learning-based AI deployments

Olivier Fontana

VP Marketing | Scaling B2B GTM & Partnerships for tech & AI for over 20 years | Strategy & Execution | Microsoft & Philips alumnus

发布日期: 2022年1月26日

Deep reinforcement learning?(DRL)?is an AI training methodology?that?lets?the AI learn on its own through trial and error instead of using a set of preexisting training data set. Therefore, as?it is?rarely possible to do that on the live system,?Deep?Reinforcement?Learning?requires advanced simulators to effectively pre-train?the?AI?agent before deployment.??

There are multiple ways to?build?such simulators,?and which approach will be the most appropriate?for a project?will depend on the particular use case. We can?categorize?simulators?types into five main?approaches at the highest abstraction level. Three of them are focused on in-house “build” strategies, and two are “buy” strategies.?

5 strategies to build advanced simulators:

Physics-based
Custom software
Off-the-shelf simulation software packages
Custom-built deep learning AI
Digital twins

The remainder of this article will?provide?a brief?introduction to the?five approaches,?and the embedded video will allow you to?explore?each of those?further.

Simulation strategies to train AI agents using Deep Reinforcement Learning?

Physics-based?simulations?

When systems?are of limited complexity and well understood,?one option is to use physics-based?simulators. This approach leverages?well-known physics?rules?to build?an accurate?simulation of a?real-life?system.?However,?these?approaches can?quickly?become complex when the system?encompasses?more?than one?device,?process,?or piece of equipment.??

An example of such an approach is?this robotic arm simulator?used by a?financial institution’s?research labs?to train their AI agent.??

Custom software?simulations?

When the system?does not require advanced?physics to be simulated, it can be relatively simple to build custom simulators using?standard?programming languages such as Python.?

However, rarely are?real-life?systems simple enough for that approach to be a viable solution.??

Off-the-shelf?simulation software packages?

The most popular approach is to leverage existing software packages that provide extensive libraries to simulate?broad?systems?types?spanning from discrete?processes, process?manufacturing,?supply chain,?and more.?

There are, of course, quite a few?players in that space. However,?two of the?most popular ones used for Autonomous Systems DRL training are?AnyLogic?and?Simulink.

领英推荐

Supervised and Unsupervised Learning in Machine…

Doug Rose 2 个月前

Challenges and Innovations in Reinforcement Learning…

Analytics Insight? 4 个月前

Black Box Method: Reinforcement Learning Algorithms

360DigiTMG 4 个月前

An important element to keep in mind is that these platforms support various modeling techniques. Deciding whether to leverage their capabilities or not is more a “build vs. buy” decision than a modeling approach selection one.

These platforms support many modeling techniques, including physics-based, custom models, and many others. Therefore, it is a simulation strategy decision, not a simulation technique selection one. Project leads need to decide which option between a “build” from scratch or a “buy” from simulation experts is the most appropriate from a business and technology strategy standpoint.

Custom-built deep learning AI?simulations?

However,?not every system can be modeled?using physics-based?or simulation software packages. In these situations,?an option is to develop a custom AI that will not simulate?the behavior of every element in the system. Still, just?the outputs the system?produces for every input.??

This kind of black-box approach?requires a large amount of training data. This requirement by itself can be?quite limiting for specific use cases. However, it?allows the simulator to be abstract the system complexity while still delivering?an?effective simulation?for?DRL?training purposes.??

The best option to go around the training data issues is to?measure the system’s real-life?inputs and outputs, as it is functioning today. It will generate a large-scale training data set quickly.?Capturing these measures may?involve using additional (edge) technologies. For instance,?one can use a vision AI to capture an output visual?aspect?parameters.??

For instance, in the Cheetos?customer story, Neal Analytics?built an AI?to simulate the combination of the extruder and baking process. It was the only effective and practical solution to train the?Project Bonsai?AI brain.?To?train this simulator,?the?Neal?team leveraged a?custom product characteristics measurement system?developed?by the PepsiCo team?to programmatically measure the Cheetos’ visual characteristics coming out of the oven.

To learn more about this project, please refer to this customer story:

Digital twins?

The last type of simulation strategy is to leverage existing digital?twins that manufacturers may provide when they supply?their equipment. However,?those?twins will only be available for specific pieces of equipment,?certain manufacturers,?and?most likely only?for their?most recent devices.??

Also, even if a digital twin is available,?the system the AI agent?needs to?control often comprises multiple?pieces of equipment. Therefore, for digital twins?to work for DRL training purposes, a mechanism?must?be found to stitch all those?twins together in one overarching simulation. It is often?challenging,?especially as some elements might be missing.

For instance, if the system has three components but only two have a digital twin,?this could be problematic. Not will you need to create?a dedicated simulation for the third element, but?you will also need to find a way to combine the three simulators in one overarching model.??

?Soon,?as more digital?twins are developed and standardization becomes more common on how those are?developed,?it should be more easily possible to create digital twin–based system-level simulators.??

Video: Using simulations for Deep Reinforcement Learning training

This video,?the?fourth?one in our five-part series on?Autonomous?Systems, provides?more?details?about the?five?types of simulations?used?to train using deep reinforcement learning?for Autonomous?Systems.?These simulators?can then be integrated into the Microsoft Project Bonsai platform?as part of the end-to-end AI agent design, training, and deployment process.

(This article was originally published on Neal Analytics blog)

要查看或添加评论，请登录

Olivier Fontana的更多文章

Unlocking Your Potential with GenAI: A 3-Step Journey

2024年3月12日

Unlocking Your Potential with GenAI: A 3-Step Journey

In our dynamic work environment, GenAI emerges as an invaluable companion for knowledge workers. Let’s explore how this…

2 条评论
Optimize hospital bed allocations with reinforcement learning-trained AI

2022年11月11日

Optimize hospital bed allocations with reinforcement learning-trained AI

Optimizing bed allocation in hospitals based on how patients (randomly) check in is a well-known and complex challenge…

1 条评论
The four pillars of an effective and robust forecasting solution

2022年10月21日

The four pillars of an effective and robust forecasting solution

Accurate forecasting is a critical element of many business processes across industries. Whether you need to forecast…
How to select the best CDP implementation strategy for your needs

2022年9月24日

How to select the best CDP implementation strategy for your needs

Customer Data Platforms (CDP) tout the capability to integrate multiple sources of information about potential and…
Digital Twins vs. Simulations

2022年7月16日

Digital Twins vs. Simulations

The concept of digital twins and their relationship to simulations is exciting but often hard to grasp. This short…
The smart factory: Industry 4.0 use cases technology enablers

2022年6月15日

The smart factory: Industry 4.0 use cases technology enablers

Industry 4.0, or the fourth industrial revolution, mostly refers to the shift to a new kind of smart factories and…
Improve the robustness of your specification drift control with AI

2022年6月1日

Improve the robustness of your specification drift control with AI

Process manufacturing is susceptible to what is usually referred to as specification drift or spec drift. Whether the…
Top 4 reasons to use an AI data-based simulator vs. a physics-based one for your process manufacturing reinforcement learning projects

2022年5月11日

Top 4 reasons to use an AI data-based simulator vs. a physics-based one for your process manufacturing reinforcement learning projects

It is necessary to use an accurate simulator to train an AI using deep reinforcement learning (DRL) since you cannot…
5 points to consider before starting your reinforcement learning project for process manufacturing optimization

2022年4月25日

5 points to consider before starting your reinforcement learning project for process manufacturing optimization

Every Autonomous System (AS) project built using the Microsoft Project Bonsai AI toolchain (or any other tool) will…
How to improve process manufacturing productivity with real-world AI solutions

2022年4月12日

How to improve process manufacturing productivity with real-world AI solutions

With Industry 4.0 comes the promise of leapfrogging in productivity, quality, and the overall return on your…

See all articles

Advanced simulations, the key to successful Deep Reinforcement Learning-based AI deployments

Olivier Fontana

VP Marketing | Scaling B2B GTM & Partnerships for tech & AI for over 20 years | Strategy & Execution | Microsoft & Philips alumnus