Automatic Mixed Precision (AMP)

Feb 15, 2022

Time Series Forecasting with the NVIDIA Time Series Prediction Platform and Triton Inference Server

In this post, we detail the recently released NVIDIA Time Series Prediction Platform (TSPP), a tool designed to compare easily and experiment with arbitrary...

13 MIN READ

Jul 24, 2020

Accelerating TensorFlow on NVIDIA A100 GPUs

The NVIDIA A100, based on the NVIDIA Ampere GPU architecture, offers a suite of exciting new features: third-generation Tensor Cores, Multi-Instance GPU (MIG)...

12 MIN READ

Dec 10, 2019

Develop Smaller Speech Recognition Models with the NVIDIA NeMo Framework

As computers and other personal devices have become increasingly prevalent, interest in conversational AI has grown due to its multitude of potential...

7 MIN READ

Sep 14, 2019

Neural Modules for Fast Development of Speech and Language Models

This post has been updated with Announcing NVIDIA NeMo: Fast Development of Speech and Language Models. The new version has information...

6 MIN READ

Jun 19, 2019

Creating an Object Detection Pipeline for GPUs

Earlier this year in March, we showed retinanet-examples, an open source example of how to accelerate the training and deployment of an object detection...

16 MIN READ

Mar 18, 2019

Automatic Mixed Precision for NVIDIA Tensor Core Architecture in TensorFlow

Whether to employ mixed precision to train your TensorFlow models is no longer a tough decision. NVIDIA’s Automatic Mixed Precision (AMP) feature for...

5 MIN READ

Dec 03, 2018

NVIDIA Apex: Tools for Easy Mixed-Precision Training in PyTorch

Most deep learning frameworks, including PyTorch, train using 32-bit floating point (FP32) arithmetic by default. However, using FP32 for all operations is not...

8 MIN READ