Accelerated Computing Libraries
 
    
        
          Aug 04, 2025
        
      
      NVIDIA CUDA-Q 0.12 Expands Toolset for Developing Hardware-Performant Quantum Applications
          NVIDIA CUDA-Q 0.12 introduces new simulation tools for accelerating how researchers develop quantum applications and design performant quantum hardware. With...
        
      
        7 MIN READ
      
      
     
    
        
          Nov 18, 2024
        
      
      Accelerate Drug and Material Discovery with New Math Library NVIDIA cuEquivariance
          AI models for science are often trained to make predictions about the workings of nature, such as predicting the structure of a biomolecule or the properties of...
        
      
        8 MIN READ
      
      
     
    
        
          Nov 14, 2024
        
      
      Just Released: NVIDIA HPC SDK v24.11
          The new release includes several enhancements to the Math Libraries and improvements for C++ programming.
        
      
         1 MIN READ
      
      
     
    
        
          Oct 09, 2024
        
      
      Just Released: Updated Math Libraries in CUDA Toolkit 12.6.2
          CUDA Toolkit 12.6.2 improves performance and provides new features in cuBLAS, cuSOLVER, and cuFFT LTO libraries.
        
      
         1 MIN READ
      
      
     
    
        
          Oct 03, 2024
        
      
      Event: NVIDIA cuOpt at INFORMS 2024
          Join NVIDIA cuOpt engineers at INFORMS 2024 on October 22-23 to learn how to revolutionize accelerated computing.
        
      
         1 MIN READ
      
      
     
    
        
          Sep 16, 2024
        
      
      Memory Efficiency, Faster Initialization, and Cost Estimation with NVIDIA Collective Communications Library 2.22
          For the past few months, the NVIDIA Collective Communications Library (NCCL) developers have been working hard on a set of new library features and bug fixes....
        
      
        8 MIN READ
      
      
     
    
        
          Aug 01, 2024
        
      
      Just Released: CUDA Toolkit 12.6
          The release supports GB100 capabilities and new library enhancements to cuBLAS, cuFFT, cuSOLVER, cuSPARSE, as well as the release of Nsight Compute 2024.3.
        
      
         1 MIN READ
      
      
     
    
        
          Jul 11, 2024
        
      
      Next Generation of FlashAttention
          NVIDIA is excited to collaborate with Colfax, Together.ai, Meta, and Princeton University on their recent achievement to exploit the Hopper GPU architecture and...
        
      
         1 MIN READ
      
      
     
    
        
          Apr 19, 2024
        
      
      Measuring the GPU Occupancy of Multi-stream Workloads
          NVIDIA GPUs are becoming increasingly powerful with each new generation. This increase generally comes in two forms. Each streaming multi-processor (SM), the...
        
      
        11 MIN READ
      
      
     
    
        
          Apr 11, 2024
        
      
      New Video Series: OpenUSD for Developers
          Universal Scene Description, also called OpenUSD or USD, is an open and extensible framework for creating, editing, querying, rendering, collaborating, and...
        
      
        3 MIN READ
      
      
     
    
        
          Mar 27, 2024
        
      
      Efficient CUDA Debugging: Using NVIDIA Compute Sanitizer with NVIDIA Tools Extension and Creating Custom Tools
          NVIDIA Compute Sanitizer is a powerful tool that can save you time and effort while improving the reliability and performance of your CUDA applications....
        
      
        14 MIN READ
      
      
     
    
        
          Mar 25, 2024
        
      
      Building High-Performance Applications in the Era of Accelerated Computing
          AI is augmenting high-performance computing (HPC) with novel approaches to data processing, simulation, and modeling. Because of the computational requirements...
        
      
        6 MIN READ
      
      
     
    
        
          Mar 08, 2024
        
      
      cuTENSOR 2.0: Applications and Performance
          While part 1 focused on the usage of the new NVIDIA cuTENSOR 2.0 CUDA math library, this post introduces a variety of usage modes beyond that, specifically...
        
      
        9 MIN READ
      
      
     
    
        
          Mar 08, 2024
        
      
      cuTENSOR 2.0: A Comprehensive Guide for Accelerating Tensor Computations
          NVIDIA cuTENSOR is a CUDA math library that provides optimized implementations of tensor operations where tensors are dense, multi-dimensional arrays or array...
        
      
        17 MIN READ
      
      
     
    
        
          Oct 22, 2023
        
      
      Differentiable Slang: Example Applications
          Differentiable Slang easily integrates with existing codebases—from Python, PyTorch, and CUDA to HLSL—to aid multiple computer graphics tasks and enable...
        
      
        6 MIN READ
      
      
     
    
        
          Oct 22, 2023
        
      
      Differentiable Slang: A Shading Language for Renderers That Learn
          NVIDIA just released a SIGGRAPH Asia 2023 research paper, SLANG.D: Fast, Modular and Differentiable Shader Programming. The paper shows how a single language...
        
      
        12 MIN READ