Boosting Application Performance with GPU Memory Access Tuning

In this post, we examine a method programmers can use to saturate memory bandwidth on a GPU. 13 MIN READ
Just Released: nvCOMP v2.3

The CUDA library, nvCOMP, now offers support for zStandard and Deflate standards, as well as modified-CRC32 checksum support and improved ANS performance. < 1
Just Released: cuTENSOR V1.5

The high-performance CUDA library for tensor primitives now features updates to increase support, fix bugs, stop false-positive CUDA API errors, and more. < 1
Just Released: cuSPARSELt v0.3

The NVIDIA cuSPARSELt update expands the high-performance CUDA library support for vectors of alpha and beta scalars, GeLu scaling, Split-K Mode, and more. < 1
Improve Guidance and Performance Visualization with the New Nsight Compute

Learn more about new features and ways to improve system performance using Nsight Compute 2022.2. 3 MIN READ
NVIDIA Releases Open-Source GPU Kernel Modules

The first open-source release of GPU kernel modules for the Linux community helps improve NVIDIA GPU driver quality and security. 8 MIN READ