DEVELOPER BLOG

Tag: Volta

AI / Deep Learning

Case Study: ResNet50 with DALI

Let’s imagine a situation. You buy a brand-new, cutting-edge, Volta-powered DGX-2 server. You’ve done your math right, expecting a 2x performance increase in… 12 MIN READ
AI / Deep Learning

Machine Learning Acceleration in Vulkan with Cooperative Matrices

Machine learning harnesses computing power to solve a variety of ‘hard’ problems that seemed impossible to program using traditional languages and techniques. 8 MIN READ
HPC

Tensor Core Programming Using CUDA Fortran

The CUDA Fortran compiler from PGI now supports programming Tensor Cores with NVIDIA’s Volta V100 and Turing GPUs. This enables scientific programmers using… 12 MIN READ
AI / Deep Learning

Speeding Up Semantic Segmentation Using MATLAB Container from NVIDIA NGC

MATLAB makes it easy for engineers to train deep-learning models for semantic segmentation, taking advantage NVIDIA GPU acceleration 8 MIN READ
Artificial Intelligence

Video Series: Mixed-Precision Training Techniques Using Tensor Cores for Deep Learning

Neural networks with thousands of layers and millions of neurons demand high performance and faster training times. The complexity and size of neural networks… 5 MIN READ
Accelerated Computing

Using Tensor Cores for Mixed-Precision Scientific Computing

Double-precision floating point (FP64) has been the de facto standard for doing scientific simulation for several decades. Most numerical methods used in… 9 MIN READ