Supercomputing

May 10, 2024

Dynamic Control Flow in CUDA Graphs with Conditional Nodes

CUDA Graphs can provide a significant performance increase, as the driver is able to optimize execution using the complete description of tasks and...

7 MIN READ

Mar 13, 2024

An Introduction to Quantum Accelerated Supercomputing

The development of useful quantum computing is a massive global effort, spanning government, enterprise, and academia. The benefits of quantum computing could...

10 MIN READ

Decorative image of matrices on a black background, with the text, "Part 2."

Mar 08, 2024

cuTENSOR 2.0: Applications and Performance

While part 1 focused on the usage of the new NVIDIA cuTENSOR 2.0 CUDA math library, this post introduces a variety of usage modes beyond that, specifically...

9 MIN READ

Decorative image of matrices on a black background, with the text "Part 1."

Mar 08, 2024

cuTENSOR 2.0: A Comprehensive Guide for Accelerating Tensor Computations

NVIDIA cuTENSOR is a CUDA math library that provides optimized implementations of tensor operations where tensors are dense, multi-dimensional arrays or array...

17 MIN READ

Feb 07, 2024

NVIDIA CUDA-Q Introduces More Capabilities for Quantum Accelerated Supercomputing

NVIDIA CUDA-Q is an open-source programming model for building quantum-classical applications. Useful quantum computing workloads will run on heterogeneous...

6 MIN READ

Feb 01, 2024

Just Released: NVIDIA HPC SDK v24.1

This NVIDIA HPC SDK update includes the cuBLASMp preview library, along with minor bug fixes and enhancements.

1 MIN READ

Decorative image of light fields in green, purple, and blue.

Jan 05, 2024

Improving CUDA Initialization Times Using cgroups in Certain Scenarios

Many CUDA applications running on multi-GPU platforms usually use a single GPU for their compute needs. In such scenarios, a performance penalty is paid by...

5 MIN READ

Dec 12, 2023

Oracle Cloud Infrastructure Sets Quantitative Financial HPC Calculations Record with NVIDIA GPUs

NVIDIA A100 Tensor Core GPUs were featured in a stack that set several records in a recent STAC-A2™ benchmark standard based on financial market risk...

1 MIN READ

Nov 28, 2023

One Giant Superchip for LLMs, Recommenders, and GNNs: Introducing NVIDIA GH200 NVL32

At AWS re:Invent 2023, AWS and NVIDIA announced that AWS will be the first cloud provider to offer NVIDIA GH200 Grace Hopper Superchips interconnected with...

9 MIN READ

An illustration representing HPC applications.

Nov 16, 2023

Unlock the Power of NVIDIA Grace and NVIDIA Hopper Architectures with Foundational HPC Software

High-performance computing (HPC) powers applications in simulation and modeling, healthcare and life sciences, industry and engineering, and more. In the modern...

7 MIN READ

Nov 13, 2023

Optimize Energy Efficiency of Multi-Node VASP Simulations with NVIDIA Magnum IO

Computational energy efficiency has become a primary decision criterion for most supercomputing centers. Data centers, once built, are capped in terms of the...

17 MIN READ

Oct 05, 2023

Just Released: NVIDIA HPC SDK 23.9

This NVIDIA HPC SDK 23.9 update expands platform support and provides minor updates.

1 MIN READ

Abstract image with three different illustrations representing HPC applications.

Jul 31, 2023

Just Released: NVIDIA HPC SDK v23.7

NVIDIA HPC SDK version 23.7 is now available and provides minor updates and enhancements.

1 MIN READ

Jul 19, 2023

Programming the Quantum-Classical Supercomputer

Heterogeneous computing architectures—those that incorporate a variety of processor types working in tandem—have proven extremely valuable in the continued...

9 MIN READ

Jun 27, 2023

Breaking MLPerf Training Records with NVIDIA H100 GPUs

At the heart of the rapidly expanding set of AI-powered applications are powerful AI models. Before these models can be deployed, they must be trained through a...

15 MIN READ

Jun 21, 2023

Optimizing Ethernet-Based AI Management Fabrics with MLAG

For HPC clusters purposely built for AI training, such as the NVIDIA DGX BasePOD and NVIDIA DGX SuperPOD, fine-tuning the cluster is critical to increasing and...

7 MIN READ