DEVELOPER BLOG

Tag: Tensor Cores

AI / Deep Learning

Accelerating Inference with Sparsity Using the NVIDIA Ampere Architecture and NVIDIA TensorRT

○ TensorRT is an SDK for high-performance deep learning inference, and TensorRT 8.0 introduces support for sparsity that uses sparse tensor cores on NVIDIA… 8 MIN READ
Graphics / Simulation

Tips: Getting the Most out of the DLSS Unreal Engine 4 Plugin

DLSS is a deep learning, super-resolution network that boosts frame rates by rendering fewer pixels and then using AI to construct sharp… 5 MIN READ
AI / Deep Learning

Accelerating AI Training with NVIDIA TF32 Tensor Cores

NVIDIA Ampere GPU architecture introduced the third generation of Tensor Cores, with the new TensorFloat32 (TF32) mode for accelerating FP32 convolutions and… 10 MIN READ
HPC

Bringing Tensor Cores to Standard Fortran

Tuned math libraries are an easy and dependable way to extract the ultimate performance from your HPC system. However, for long-lived applications or those that… 10 MIN READ
AI / Deep Learning

Accelerating TensorFlow on NVIDIA A100 GPUs

The NVIDIA A100, based on the NVIDIA Ampere GPU architecture, offers a suite of exciting new features: third-generation Tensor Cores, Multi-Instance GPU (MIG)… 12 MIN READ
AI / Deep Learning

Defining AI Innovation with NVIDIA DGX A100

Built on the brand new NVIDIA A100 Tensor Core GPU, DGX A100 is the third generation of DGX systems and is the universal system for AI infrastructure. 15 MIN READ