Supercomputing
May 10, 2024
Dynamic Control Flow in CUDA Graphs with Conditional Nodes
CUDA Graphs can provide a significant performance increase, as the driver is able to optimize execution using the complete description of tasks and...
7 MIN READ
Mar 13, 2024
An Introduction to Quantum Accelerated Supercomputing
The development of useful quantum computing is a massive global effort, spanning government, enterprise, and academia. The benefits of quantum computing could...
10 MIN READ
Mar 08, 2024
cuTENSOR 2.0: Applications and Performance
While part 1 focused on the usage of the new NVIDIA cuTENSOR 2.0 CUDA math library, this post introduces a variety of usage modes beyond that, specifically...
9 MIN READ
Mar 08, 2024
cuTENSOR 2.0: A Comprehensive Guide for Accelerating Tensor Computations
NVIDIA cuTENSOR is a CUDA math library that provides optimized implementations of tensor operations where tensors are dense, multi-dimensional arrays or array...
17 MIN READ
Feb 07, 2024
NVIDIA CUDA-Q Introduces More Capabilities for Quantum Accelerated Supercomputing
NVIDIA CUDA-Q is an open-source programming model for building quantum-classical applications. Useful quantum computing workloads will run on heterogeneous...
6 MIN READ
Feb 01, 2024
Just Released: NVIDIA HPC SDK v24.1
This NVIDIA HPC SDK update includes the cuBLASMp preview library, along with minor bug fixes and enhancements.
1 MIN READ
Jan 05, 2024
Improving CUDA Initialization Times Using cgroups in Certain Scenarios
Many CUDA applications running on multi-GPU platforms usually use a single GPU for their compute needs. In such scenarios, a performance penalty is paid by...
5 MIN READ
Dec 12, 2023
Oracle Cloud Infrastructure Sets Quantitative Financial HPC Calculations Record with NVIDIA GPUs
NVIDIA A100 Tensor Core GPUs were featured in a stack that set several records in a recent STAC-A2™ benchmark standard based on financial market risk...
1 MIN READ
Nov 28, 2023
One Giant Superchip for LLMs, Recommenders, and GNNs: Introducing NVIDIA GH200 NVL32
At AWS re:Invent 2023, AWS and NVIDIA announced that AWS will be the first cloud provider to offer NVIDIA GH200 Grace Hopper Superchips interconnected with...
9 MIN READ
Nov 16, 2023
Unlock the Power of NVIDIA Grace and NVIDIA Hopper Architectures with Foundational HPC Software
High-performance computing (HPC) powers applications in simulation and modeling, healthcare and life sciences, industry and engineering, and more. In the modern...
7 MIN READ
Nov 13, 2023
Optimize Energy Efficiency of Multi-Node VASP Simulations with NVIDIA Magnum IO
Computational energy efficiency has become a primary decision criterion for most supercomputing centers. Data centers, once built, are capped in terms of the...
17 MIN READ
Oct 05, 2023
Just Released: NVIDIA HPC SDK 23.9
This NVIDIA HPC SDK 23.9 update expands platform support and provides minor updates.
1 MIN READ
Jul 31, 2023
Just Released: NVIDIA HPC SDK v23.7
NVIDIA HPC SDK version 23.7 is now available and provides minor updates and enhancements.
1 MIN READ
Jul 19, 2023
Programming the Quantum-Classical Supercomputer
Heterogeneous computing architectures—those that incorporate a variety of processor types working in tandem—have proven extremely valuable in the continued...
9 MIN READ
Jun 27, 2023
Breaking MLPerf Training Records with NVIDIA H100 GPUs
At the heart of the rapidly expanding set of AI-powered applications are powerful AI models. Before these models can be deployed, they must be trained through a...
15 MIN READ
Jun 21, 2023
Optimizing Ethernet-Based AI Management Fabrics with MLAG
For HPC clusters purposely built for AI training, such as the NVIDIA DGX BasePOD and NVIDIA DGX SuperPOD, fine-tuning the cluster is critical to increasing and...
7 MIN READ