Sagar Shelke works as a deep learning software engineer at NVIDIA, focusing on autonomous driving applications. His interests include neural network optimization for deployment and machine learning systems. Sagar holds a master's degree in electrical and computer engineering from San Diego State University.
Sparsity in INT8: Training Workflow and Best Practices for NVIDIA TensorRT Acceleration

The training stage of deep learning (DL) models consists of learning numerous dense floating-point weight matrices, which results in a massive amount of... 12 MIN READ

Accelerating Quantized Networks with the NVIDIA QAT Toolkit for TensorFlow and NVIDIA TensorRT

We’re excited to announce the NVIDIA Quantization-Aware Training (QAT) Toolkit for TensorFlow 2 with the goal of accelerating the quantized networks with... 9 MIN READ