Benchmarks

May 15, 2023
Efficiently Scale LLM Training Across a Large GPU Cluster with Alpa and Ray
Recent years have seen a proliferation of large language models (LLMs) that extend beyond traditional language tasks to generative AI. This includes models like...
16 MIN READ

May 15, 2023
Accelerate Whole Exome Analysis with Deep Learning at 70% Cost Reduction Using NVIDIA Parabricks
The human exome is key to understanding and treating genetic disorders and disease. Although the exome consists of just over 1% of the human genome, it also...
9 MIN READ

May 05, 2023
Accelerating Redis Performance Using VMware vSphere 8 and NVIDIA BlueField DPUs
A shift to modern distributed workloads, along with higher networking speeds, has increased the overhead of infrastructure services. There are fewer CPU cycles...
10 MIN READ

Apr 18, 2023
Build High Performance Robotic Applications with NVIDIA Isaac ROS Developer Preview 3
Robots are increasing in complexity, with a higher degree of autonomy, a greater number and diversity of sensors, and more sensor fusion-based algorithms....
8 MIN READ

Apr 18, 2023
New GPU Library Lowers Compute Costs for Apache Spark ML
Spark MLlib is a key component of Apache Spark for large-scale machine learning and provides built-in implementations of many popular machine learning...
6 MIN READ

Apr 05, 2023
Setting New Records in MLPerf Inference v3.0 with Full-Stack Optimizations for AI
The most exciting computing applications currently rely on training and running inference on complex AI models, often in demanding, real-time deployment...
15 MIN READ

Feb 09, 2023
Massively Improved Multi-node NVIDIA GPU Scalability with GROMACS
GROMACS, a scientific software package widely used for simulating biomolecular systems, plays a crucial role in comprehending important biological processes...
8 MIN READ

Feb 02, 2023
Benchmarking Deep Neural Networks for Low-Latency Trading and Rapid Backtesting on NVIDIA GPUs
Lowering response times to new market events is a driving force in algorithmic trading. Latency-sensitive trading firms keep up with the ever-increasing pace of...
8 MIN READ

Dec 15, 2022
Best-in-Class Quantum Circuit Simulation at Scale with NVIDIA cuQuantum Appliance
Quantum algorithm researchers in government, enterprise, and academia are interested in developing and benchmarking novel quantum algorithms on ever-larger...
8 MIN READ

Nov 09, 2022
Tuning AI Infrastructure Performance with MLPerf HPC v2.0 Benchmarks
As the fusion of AI and simulation accelerates scientific discovery, the need has arisen for a means to measure and rank the speed and throughput for building...
14 MIN READ

Nov 09, 2022
Leading MLPerf Training 2.1 with Full Stack Optimizations for AI
MLPerf benchmarks, developed by MLCommons, are critical evaluation tools for organizations to measure the performance of their machine learning models' training...
14 MIN READ

Sep 08, 2022
Full-Stack Innovation Fuels Highest MLPerf Inference 2.1 Results for NVIDIA
Today’s AI-powered applications are enabling richer experiences, fueled by both larger and more complex AI models as well as the application of many models in...
14 MIN READ

Jun 30, 2022
The Full Stack Optimization Powering NVIDIA MLPerf Training v2.0 Performance
MLPerf benchmarks are developed by a consortium of AI leaders across industry, academia, and research labs, with the aim of providing standardized, fair, and...
14 MIN READ

Jun 22, 2022
Novel Transformer Model Achieves State-of-the-Art Benchmarks in 3D Medical Image Analysis
At the Computer Vision and Pattern Recognition Conference (CVPR), NVIDIA researchers are presenting over 35 papers. This includes work on Shifted WINdows UNEt...
6 MIN READ

Jun 02, 2022
Fueling High-Performance Computing with Full-Stack Innovation
High-performance computing (HPC) has become the essential instrument of scientific discovery. Whether it is discovering new, life-saving drugs, battling...
8 MIN READ

Apr 06, 2022
Getting the Best Performance on MLPerf Inference 2.0
Models like Megatron 530B are expanding the range of problems AI can address. However, as models continue to grow complexity, they pose a twofold challenge for...
11 MIN READ