Technical Blog
Tag: A100
Subscribe
Technical Walkthrough
Jun 16, 2022
Accelerating Quantized Networks with the NVIDIA QAT Toolkit for TensorFlow and NVIDIA TensorRT
We’re excited to announce the NVIDIA Quantization-Aware Training (QAT) Toolkit for TensorFlow 2 with the goal of accelerating the quantized networks with...
9 MIN READ
Technical Walkthrough
Jun 02, 2022
Fueling High-Performance Computing with Full-Stack Innovation
High-performance computing (HPC) has become the essential instrument of scientific discovery. Whether it is discovering new, life-saving drugs, battling...
8 MIN READ
Technical Walkthrough
May 25, 2022
Training a State-of-the-Art ImageNet-1K Visual Transformer Model using NVIDIA DGX SuperPOD
Recent work has demonstrated that large transformer models can achieve or advance the SOTA in computer vision tasks such as semantic segmentation and object...
9 MIN READ
Technical Walkthrough
Aug 25, 2021
Deploying NVIDIA Triton at Scale with MIG and Kubernetes
NVIDIA Triton Inference Server is an open-source AI model serving software that simplifies the deployment of trained AI models at scale in production. Clients...
24 MIN READ
Technical Walkthrough
Jan 27, 2021
Accelerating AI Training with NVIDIA TF32 Tensor Cores
NVIDIA Ampere GPU architecture introduced the third generation of Tensor Cores, with the new TensorFloat32 (TF32) mode for accelerating FP32 convolutions and...
10 MIN READ
Technical Walkthrough
Jan 26, 2021
Adding More Support in NVIDIA GPU Operator
Editor's note: Interested in GPU Operator? Register for our upcoming webinar on January 20th, "How to Easily use GPUs with Kubernetes". Reliably provisioning...
6 MIN READ