Accelerated Computing Libraries
Nov 18, 2024
Accelerate Drug and Material Discovery with New Math Library NVIDIA cuEquivariance
AI models for science are often trained to make predictions about the workings of nature, such as predicting the structure of a biomolecule or the properties of...
8 MIN READ
Nov 14, 2024
Just Released: NVIDIA HPC SDK v24.11
The new release includes several enhancements to the Math Libraries and improvements for C++ programming.
1 MIN READ
Oct 09, 2024
Just Released: Updated Math Libraries in CUDA Toolkit 12.6.2
CUDA Toolkit 12.6.2 improves performance and provides new features in cuBLAS, cuSOLVER, and cuFFT LTO libraries.
1 MIN READ
Oct 03, 2024
Event: NVIDIA cuOpt at INFORMS 2024
Join NVIDIA cuOpt engineers at INFORMS 2024 on October 22-23 to learn how to revolutionize accelerated computing.
1 MIN READ
Sep 16, 2024
Memory Efficiency, Faster Initialization, and Cost Estimation with NVIDIA Collective Communications Library 2.22
For the past few months, the NVIDIA Collective Communications Library (NCCL) developers have been working hard on a set of new library features and bug fixes....
8 MIN READ
Aug 01, 2024
Just Released: CUDA Toolkit 12.6
The release supports GB100 capabilities and new library enhancements to cuBLAS, cuFFT, cuSOLVER, cuSPARSE, as well as the release of Nsight Compute 2024.3.
1 MIN READ
Jul 11, 2024
Next Generation of FlashAttention
NVIDIA is excited to collaborate with Colfax, Together.ai, Meta, and Princeton University on their recent achievement to exploit the Hopper GPU architecture and...
1 MIN READ
Apr 19, 2024
Measuring the GPU Occupancy of Multi-stream Workloads
NVIDIA GPUs are becoming increasingly powerful with each new generation. This increase generally comes in two forms. Each streaming multi-processor (SM), the...
11 MIN READ
Apr 11, 2024
New Video Series: OpenUSD for Developers
Universal Scene Description, also called OpenUSD or USD, is an open and extensible framework for creating, editing, querying, rendering, collaborating, and...
3 MIN READ
Mar 27, 2024
Efficient CUDA Debugging: Using NVIDIA Compute Sanitizer with NVIDIA Tools Extension and Creating Custom Tools
NVIDIA Compute Sanitizer is a powerful tool that can save you time and effort while improving the reliability and performance of your CUDA applications....
14 MIN READ
Mar 25, 2024
Building High-Performance Applications in the Era of Accelerated Computing
AI is augmenting high-performance computing (HPC) with novel approaches to data processing, simulation, and modeling. Because of the computational requirements...
6 MIN READ
Mar 08, 2024
cuTENSOR 2.0: Applications and Performance
While part 1 focused on the usage of the new NVIDIA cuTENSOR 2.0 CUDA math library, this post introduces a variety of usage modes beyond that, specifically...
9 MIN READ
Mar 08, 2024
cuTENSOR 2.0: A Comprehensive Guide for Accelerating Tensor Computations
NVIDIA cuTENSOR is a CUDA math library that provides optimized implementations of tensor operations where tensors are dense, multi-dimensional arrays or array...
17 MIN READ
Oct 22, 2023
Differentiable Slang: Example Applications
Differentiable Slang easily integrates with existing codebases—from Python, PyTorch, and CUDA to HLSL—to aid multiple computer graphics tasks and enable...
6 MIN READ
Oct 22, 2023
Differentiable Slang: A Shading Language for Renderers That Learn
NVIDIA just released a SIGGRAPH Asia 2023 research paper, SLANG.D: Fast, Modular and Differentiable Shader Programming. The paper shows how a single language...
12 MIN READ
Oct 05, 2023
Power Optimization with NVIDIA Jetson
When working with embedded systems such as the Jetson modules, you must optimize your application based on your power budget and compute resources. To avoid...
7 MIN READ