课程目录:Distributed Deep Learning with Horovod培训
4401 人关注
(78637/99817)
课程大纲:

         Distributed Deep Learning with Horovod培训

 

 

 

Introduction

Overview of Horovod features and concepts
Understanding the supported frameworks
Installing and Configuring Horovod

Preparing the hosting environment
Building Horovod for TensorFlow, Keras, PyTorch, and Apache MXNet
Running Horovod
Running Distributed Training

Modifying and running training examples with TensorFlow
Modifying and running training examples with Keras
Modifying and running training examples with PyTorch
Modifying and running training examples with Apache MXNet
Optimizing Distributed Training Processes

Running concurrent operations on multiple GPUs
Tuning hyperparameters
Enabling performance autotuning
Troubleshooting

Summary and Conclusion