Posts by Shar Narasimhan
News
Sep 14, 2022
NVIDIA, Arm, and Intel Publish FP8 Specification for Standardization as an Interchange Format for AI
AI processing requires full-stack innovation across hardware and software platforms to address the growing computational demands of neural networks. A key area...
4 MIN READ
Technical Walkthrough
May 11, 2022
Accelerating AI Inference Workloads with NVIDIA A30 GPU
NVIDIA A30 GPU is built on the latest NVIDIA Ampere Architecture to accelerate diverse workloads like AI inference at scale, enterprise training, and HPC...
6 MIN READ
Technical Walkthrough
Dec 01, 2021
Boosting NVIDIA MLPerf Training v1.1 Performance with Full Stack Optimization
Five months have passed since v1.0, so it is time for another round of the MLPerf training benchmark. In this v1.1 edition, optimization over the entire...
22 MIN READ
Technical Walkthrough
Jun 30, 2021
MLPerf v1.0 Training Benchmarks: Insights into a Record-Setting NVIDIA Performance
MLPerf is an industry-wide AI consortium tasked with developing a suite of performance benchmarks that cover a range of leading AI workloads widely in use. The...
31 MIN READ
Technical Walkthrough
Nov 23, 2020
Updating AI Product Performance from Throughput to Time-To-Solution
Data scientists and researchers work toward solving the grand challenges of humanity with AI projects such as developing autonomous cars or nuclear fusion...
9 MIN READ
Technical Walkthrough
Aug 13, 2019
NVIDIA Clocks World’s Fastest BERT Training Time and Largest Transformer Based Model, Paving Path For Advanced Conversational AI
NVIDIA DGX SuperPOD trains BERT-Large in just 47 minutes, and trains GPT-2 8B, the largest Transformer Network Ever with 8.3Bn parameters Conversational AI...
8 MIN READ