GTC 2020: Improve ML Training Performance with Amazon SageMaker Debugger (Presented by Amazon Web Services)
After clicking “Watch Now” you will be prompted to login or join.
Click “Watch Now” to login or join the NVIDIA Developer Program.
Improve ML Training Performance with Amazon SageMaker Debugger (Presented by Amazon Web Services)
Shashank Prasanna, Amazon Web Services | Satadal Bhattacharjee, Amazon Web Services
During ML model training, it's challenging to ensure that models are progressively learning the correct values for different parameters and to analyze and debug model characteristics without building additional tools, making the process time-consuming and cumbersome. With Amazon SageMaker Debugger, developers can get complete insights into the training process by automating data capture and analysis from training runs without code changes. We'll take a closer look at how you can define rules to monitor and analyze tensors and watch for issues in your model. By monitoring training flow, developers can improve GPU utilization, reduce troubleshooting time during training, and build high-quality models.