Distributed Deep Learning with Horovod Training Course
Blue Chip Training and Consulting
Skills for the next step in your career
Horovod is an open source software framework, designed for processing fast and efficient distributed deep learning models using TensorFlow, Keras, PyTorch, and Apache MXNet. It can scale up a single-GPU training script to run on multiple GPUs or hosts with minimal code changes.
This course is aimed at developers or data scientists who wish to use Horovod to run distributed deep learning trainings and scale it up to run across multiple GPUs in parallel.
Course Outline
Introduction
Installing and Configuring Horovod
Running Distributed Training
Optimizing Distributed Training Processes
Troubleshooting
Summary and Conclusion
Contact us
email - [email protected]