Technical Walkthrough 0

Int4 Precision for AI Inference

INT4 Precision Can Bring an Additional 59% Speedup Compared to INT8 If there’s one constant in AI and deep learning, it’s never-ending optimization to wring… 5 MIN READ
Technical Walkthrough 0

MLPerf Inference: NVIDIA Innovations Bring Leading Performance

New TensorRT 6 Features Combine with Open-Source Plugins to Further Accelerate Inference Inference is where AI goes to work. Identifying diseases. 7 MIN READ
Technical Walkthrough 0

NVIDIA Boosts AI Performance in MLPerf v0.6

The relentless pace of innovation is most apparent in the AI domain. Researchers and developers discovering new network architectures… 10 MIN READ
Figure 1: The Tesla V100 Accelerator with Volta GV100 GPU. SXM2 Form Factor.
Technical Walkthrough 0

Mixed-Precision ResNet-50 Using Tensor Cores with TensorFlow

Mixed-Precision combines different numerical precisions in a computational method. Using precision lower than FP32 reduces memory usage… 2 MIN READ