Benchmark

Decorative image of graphs as light web.

Apr 03, 2024

Optimizing Memory and Retrieval for Graph Neural Networks with WholeGraph, Part 2

Large-scale graph neural network (GNN) training presents formidable challenges, particularly concerning the scale and complexity of graph data. These challenges...

5 MIN READ

An image of an NVIDIA H200 Tensor Core GPU.

Mar 27, 2024

NVIDIA H200 Tensor Core GPUs and NVIDIA TensorRT-LLM Set MLPerf LLM Inference Records

Generative AI is unlocking new computing applications that greatly augment human capability, enabled by continued model innovation. Generative AI...

11 MIN READ

Mar 20, 2024

Record-Breaking NVIDIA cuOpt Algorithms Deliver Route Optimization Solutions 100x Faster

NVIDIA cuOpt is an accelerated optimization engine for solving complex routing problems. It efficiently solves problems with different aspects such as breaks,...

13 MIN READ

Mar 19, 2024

NVIDIA Speech and Translation AI Models Set Records for Speed and Accuracy

Speech and translation AI models developed at NVIDIA are pushing the boundaries of performance and innovation. The NVIDIA Parakeet automatic speech recognition...

8 MIN READ

Decorative image of a computer screen against a purple background, with a dial on the side.

Mar 18, 2024

RAPIDS cuDF Accelerates pandas Nearly 150x with Zero Code Changes

At NVIDIA GTC 2024, it was announced that RAPIDS cuDF can now bring GPU acceleration to 9.5M million pandas users without requiring them to change their code....

5 MIN READ

Decorative image of a RAG pipeline against a black background.

Mar 18, 2024

Scaling Enterprise RAG with Accelerated Ethernet Networking and Networked Storage

In the era of generative AI, where machines are not just learning from data but generating human-like text, images, video, and more, retrieval-augmented...

10 MIN READ

Feb 22, 2024

Benchmarking NVIDIA Spectrum-X for AI Network Performance, Now Available from Supermicro

NVIDIA Spectrum-X is swiftly gaining traction as the leading networking platform tailored for AI in hyperscale cloud infrastructures. Spectrum-X networking...

6 MIN READ

Dec 18, 2023

Deploying Retrieval-Augmented Generation Applications on NVIDIA GH200 Delivers Accelerated Performance

Large language model (LLM) applications are essential in enhancing productivity across industries through natural language. However, their effectiveness is...

10 MIN READ

Dec 14, 2023

Achieving Top Inference Performance with the NVIDIA H100 Tensor Core GPU and NVIDIA TensorRT-LLM

Best-in-class AI performance requires an efficient parallel computing architecture, a productive tool stack, and deeply optimized algorithms. NVIDIA released...

4 MIN READ

Dec 12, 2023

Oracle Cloud Infrastructure Sets Quantitative Financial HPC Calculations Record with NVIDIA GPUs

NVIDIA A100 Tensor Core GPUs were featured in a stack that set several records in a recent STAC-A2™ benchmark standard based on financial market risk...

1 MIN READ

Dec 12, 2023

Benchmarking Quantum Computing Applications with BMW Group and NVIDIA cuQuantum

Quantum computing has the potential to revolutionize various aspects of industry, ranging from numerical simulations and optimization of complex systems to...

5 MIN READ

An illustration showing the steps "LLM" then "Optimize" then "Deploy."

Dec 04, 2023

NVIDIA TensorRT-LLM Enhancements Deliver Massive Large Language Model Speedups on NVIDIA H200

Large language models (LLMs) have seen dramatic growth over the last year, and the challenge of delivering great user experiences depends on both high-compute...

5 MIN READ

Illustration representing NeMo Framework.

Dec 04, 2023

New NVIDIA NeMo Framework Features and NVIDIA H200 Supercharge LLM Training Performance and Versatility

The rapid growth in the size, complexity, and diversity of large language models (LLMs) continues to drive an insatiable need for AI training performance....

9 MIN READ

Image of GPU on black background with an artful spotlight.

Nov 27, 2023

New Risk Calculation Record in Financial Services with Dell Technologies and NVIDIA H100 System for HPC and AI

End clients are working on converged HPC quant finance and AI business solutions. Dell Technologies, along with NVIDIA, is uniquely positioned to accelerate...

7 MIN READ

Nov 13, 2023

Simplifying GPU Programming for HPC with NVIDIA Grace Hopper Superchip

The new hardware developments in NVIDIA Grace Hopper Superchip systems enable some dramatic changes to the way developers approach GPU programming. Most...

17 MIN READ

Nov 08, 2023

Setting New Records at Data Center Scale Using NVIDIA H100 GPUs and NVIDIA Quantum-2 InfiniBand

Generative AI is rapidly transforming computing, unlocking new use cases and turbocharging existing ones. Large language models (LLMs), such as OpenAI’s GPT...

19 MIN READ