Tensor Cores

An image of an NVIDIA H200 Tensor Core GPU.

Mar 27, 2024

NVIDIA H200 Tensor Core GPUs and NVIDIA TensorRT-LLM Set MLPerf LLM Inference Records

Generative AI is unlocking new computing applications that greatly augment human capability, enabled by continued model innovation. Generative AI...

11 MIN READ

Apr 05, 2023

Setting New Records in MLPerf Inference v3.0 with Full-Stack Optimizations for AI

The most exciting computing applications currently rely on training and running inference on complex AI models, often in demanding, real-time deployment...

15 MIN READ

Mar 22, 2022

NVIDIA Hopper Architecture In-Depth

Today during the 2022 NVIDIA GTC Keynote address, NVIDIA CEO Jensen Huang introduced the new NVIDIA H100 Tensor Core GPU based on the new NVIDIA Hopper GPU...

36 MIN READ

Sep 24, 2021

Explore and Test Experimental Models for DLSS Research

Today, NVIDIA is enabling developers to explore and evaluate experimental AI models for Deep Learning Super Sampling (DLSS). Developers can download...

2 MIN READ

Jul 20, 2021

Accelerating Inference with Sparsity Using the NVIDIA Ampere Architecture and NVIDIA TensorRT

This post was updated July 20, 2021 to reflect NVIDIA TensorRT 8.0 updates. When deploying a neural network, it's useful to think about how the network could be...

8 MIN READ

Feb 17, 2021

Tips: Getting the Most out of the DLSS Unreal Engine 4 Plugin

DLSS is a deep learning, super-resolution network that boosts frame rates by rendering fewer pixels and then using AI to construct sharp, higher-resolution...

5 MIN READ

Jan 27, 2021

Accelerating AI Training with NVIDIA TF32 Tensor Cores

NVIDIA Ampere GPU architecture introduced the third generation of Tensor Cores, with the new TensorFloat32 (TF32) mode for accelerating FP32 convolutions and...

10 MIN READ

Aug 07, 2020

Bringing Tensor Cores to Standard Fortran

Tuned math libraries are an easy and dependable way to extract the ultimate performance from your HPC system. However, for long-lived applications or those that...

10 MIN READ

Jul 24, 2020

Accelerating TensorFlow on NVIDIA A100 GPUs

The NVIDIA A100, based on the NVIDIA Ampere GPU architecture, offers a suite of exciting new features: third-generation Tensor Cores, Multi-Instance GPU (MIG)...

12 MIN READ

May 14, 2020

Defining AI Innovation with NVIDIA DGX A100

Organizations of all kinds are incorporating AI into their research, development, product, and business processes. This helps them meet and exceed their...

15 MIN READ

May 14, 2020

NVIDIA Ampere Architecture In-Depth

Today, during the 2020 NVIDIA GTC keynote address, NVIDIA founder and CEO Jensen Huang introduced the new NVIDIA A100 GPU based on the new NVIDIA Ampere GPU...

30 MIN READ

May 09, 2020

Accelerating Medical Image Segmentation with NVIDIA Tensor Cores and TensorFlow 2

Figure 1. Example of a serial section Transmission Electron Microscopy image (ssTEM) and its corresponding segmentation. Medical image segmentation is a hot...

11 MIN READ

Apr 28, 2020

Using Windows ML, ONNX, and NVIDIA Tensor Cores

As more and more deep learning models are being deployed into production environments, there is a growing need for a separation between the work on the model...

13 MIN READ

Apr 21, 2020

Speeding Up Deep Learning Inference Using TensorRT

This...

22 MIN READ

Apr 03, 2020

Accelerating WinML and NVIDIA Tensor Cores

Figure 1. TensorCores. Every year, clever researchers introduce ever more complex and interesting deep learning models to the world. There is of course a big...

13 MIN READ

Dec 19, 2019

NVIDIA Developer Blog 2019 Highlights

We published nearly 100 technical blogs this year on the NVIDIA Developer Blog to help developers across a variety of industries develop their GPU-accelerated...

4 MIN READ