Benchmark
Apr 03, 2024
Optimizing Memory and Retrieval for Graph Neural Networks with WholeGraph, Part 2
Large-scale graph neural network (GNN) training presents formidable challenges, particularly concerning the scale and complexity of graph data. These challenges...
5 MIN READ
Mar 27, 2024
NVIDIA H200 Tensor Core GPUs and NVIDIA TensorRT-LLM Set MLPerf LLM Inference Records
Generative AI is unlocking new computing applications that greatly augment human capability, enabled by continued model innovation. Generative AI...
11 MIN READ
Mar 20, 2024
Record-Breaking NVIDIA cuOpt Algorithms Deliver Route Optimization Solutions 100x Faster
NVIDIA cuOpt is an accelerated optimization engine for solving complex routing problems. It efficiently solves problems with different aspects such as breaks,...
13 MIN READ
Mar 19, 2024
NVIDIA Speech and Translation AI Models Set Records for Speed and Accuracy
Speech and translation AI models developed at NVIDIA are pushing the boundaries of performance and innovation. The NVIDIA Parakeet automatic speech recognition...
8 MIN READ
Mar 18, 2024
RAPIDS cuDF Accelerates pandas Nearly 150x with Zero Code Changes
At NVIDIA GTC 2024, it was announced that RAPIDS cuDF can now bring GPU acceleration to 9.5M million pandas users without requiring them to change their code....
5 MIN READ
Mar 18, 2024
Scaling Enterprise RAG with Accelerated Ethernet Networking and Networked Storage
In the era of generative AI, where machines are not just learning from data but generating human-like text, images, video, and more, retrieval-augmented...
10 MIN READ
Feb 22, 2024
Benchmarking NVIDIA Spectrum-X for AI Network Performance, Now Available from Supermicro
NVIDIA Spectrum-X is swiftly gaining traction as the leading networking platform tailored for AI in hyperscale cloud infrastructures. Spectrum-X networking...
6 MIN READ
Dec 18, 2023
Deploying Retrieval-Augmented Generation Applications on NVIDIA GH200 Delivers Accelerated Performance
Large language model (LLM) applications are essential in enhancing productivity across industries through natural language. However, their effectiveness is...
10 MIN READ
Dec 14, 2023
Achieving Top Inference Performance with the NVIDIA H100 Tensor Core GPU and NVIDIA TensorRT-LLM
Best-in-class AI performance requires an efficient parallel computing architecture, a productive tool stack, and deeply optimized algorithms. NVIDIA released...
4 MIN READ
Dec 12, 2023
Oracle Cloud Infrastructure Sets Quantitative Financial HPC Calculations Record with NVIDIA GPUs
NVIDIA A100 Tensor Core GPUs were featured in a stack that set several records in a recent STAC-A2â„¢ benchmark standard based on financial market risk...
1 MIN READ
Dec 12, 2023
Benchmarking Quantum Computing Applications with BMW Group and NVIDIA cuQuantum
Quantum computing has the potential to revolutionize various aspects of industry, ranging from numerical simulations and optimization of complex systems to...
5 MIN READ
Dec 04, 2023
NVIDIA TensorRT-LLM Enhancements Deliver Massive Large Language Model Speedups on NVIDIA H200
Large language models (LLMs) have seen dramatic growth over the last year, and the challenge of delivering great user experiences depends on both high-compute...
5 MIN READ
Dec 04, 2023
New NVIDIA NeMo Framework Features and NVIDIA H200 Supercharge LLM Training Performance and Versatility
The rapid growth in the size, complexity, and diversity of large language models (LLMs) continues to drive an insatiable need for AI training performance....
9 MIN READ
Nov 27, 2023
New Risk Calculation Record in Financial Services with Dell Technologies and NVIDIA H100 System for HPC and AI
End clients are working on converged HPC quant finance and AI business solutions. Dell Technologies, along with NVIDIA, is uniquely positioned to accelerate...
7 MIN READ
Nov 13, 2023
Simplifying GPU Programming for HPC with NVIDIA Grace Hopper Superchip
The new hardware developments in NVIDIA Grace Hopper Superchip systems enable some dramatic changes to the way developers approach GPU programming. Most...
17 MIN READ
Nov 08, 2023
Setting New Records at Data Center Scale Using NVIDIA H100 GPUs and NVIDIA Quantum-2 InfiniBand
Generative AI is rapidly transforming computing, unlocking new use cases and turbocharging existing ones. Large language models (LLMs), such as OpenAI’s GPT...
19 MIN READ