Tensor Cores
Mar 27, 2024
NVIDIA H200 Tensor Core GPUs and NVIDIA TensorRT-LLM Set MLPerf LLM Inference Records
Generative AI is unlocking new computing applications that greatly augment human capability, enabled by continued model innovation. Generative AI...
11 MIN READ
Apr 05, 2023
Setting New Records in MLPerf Inference v3.0 with Full-Stack Optimizations for AI
The most exciting computing applications currently rely on training and running inference on complex AI models, often in demanding, real-time deployment...
15 MIN READ
Mar 22, 2022
NVIDIA Hopper Architecture In-Depth
Today during the 2022 NVIDIA GTC Keynote address, NVIDIA CEO Jensen Huang introduced the new NVIDIA H100 Tensor Core GPU based on the new NVIDIA Hopper GPU...
36 MIN READ
Sep 24, 2021
Explore and Test Experimental Models for DLSS Research
Today, NVIDIA is enabling developers to explore and evaluate experimental AI models for Deep Learning Super Sampling (DLSS). Developers can download...
2 MIN READ
Jul 20, 2021
Accelerating Inference with Sparsity Using the NVIDIA Ampere Architecture and NVIDIA TensorRT
This post was updated July 20, 2021 to reflect NVIDIA TensorRT 8.0 updates. When deploying a neural network, it's useful to think about how the network could be...
8 MIN READ
Feb 17, 2021
Tips: Getting the Most out of the DLSS Unreal Engine 4 Plugin
DLSS is a deep learning, super-resolution network that boosts frame rates by rendering fewer pixels and then using AI to construct sharp, higher-resolution...
5 MIN READ
Jan 27, 2021
Accelerating AI Training with NVIDIA TF32 Tensor Cores
NVIDIA Ampere GPU architecture introduced the third generation of Tensor Cores, with the new TensorFloat32 (TF32) mode for accelerating FP32 convolutions and...
10 MIN READ
Aug 07, 2020
Bringing Tensor Cores to Standard Fortran
Tuned math libraries are an easy and dependable way to extract the ultimate performance from your HPC system. However, for long-lived applications or those that...
10 MIN READ
Jul 24, 2020
Accelerating TensorFlow on NVIDIA A100 GPUs
The NVIDIA A100, based on the NVIDIA Ampere GPU architecture, offers a suite of exciting new features: third-generation Tensor Cores, Multi-Instance GPU (MIG)...
12 MIN READ
May 14, 2020
Defining AI Innovation with NVIDIA DGX A100
Organizations of all kinds are incorporating AI into their research, development, product, and business processes. This helps them meet and exceed their...
15 MIN READ
May 14, 2020
NVIDIA Ampere Architecture In-Depth
Today, during the 2020 NVIDIA GTC keynote address, NVIDIA founder and CEO Jensen Huang introduced the new NVIDIA A100 GPU based on the new NVIDIA Ampere GPU...
30 MIN READ
May 09, 2020
Accelerating Medical Image Segmentation with NVIDIA Tensor Cores and TensorFlow 2
Figure 1. Example of a serial section Transmission Electron Microscopy image (ssTEM) and its corresponding segmentation. Medical image segmentation is a hot...
11 MIN READ
Apr 28, 2020
Using Windows ML, ONNX, and NVIDIA Tensor Cores
As more and more deep learning models are being deployed into production environments, there is a growing need for a separation between the work on the model...
13 MIN READ
Apr 21, 2020
Speeding Up Deep Learning Inference Using TensorRT
This...
22 MIN READ
Apr 03, 2020
Accelerating WinML and NVIDIA Tensor Cores
Figure 1. TensorCores. Every year, clever researchers introduce ever more complex and interesting deep learning models to the world. There is of course a big...
13 MIN READ
Dec 19, 2019
NVIDIA Developer Blog 2019 Highlights
We published nearly 100 technical blogs this year on the NVIDIA Developer Blog to help developers across a variety of industries develop their GPU-accelerated...
4 MIN READ