Programming Languages / Compilers

Sep 11, 2024
Advanced Strategies for High-Performance GPU Programming with NVIDIA CUDA
Stephen Jones, a leading expert and distinguished NVIDIA CUDA architect, offers his guidance and insights with a deep dive into the complexities of mapping...
2 MIN READ

Aug 08, 2024
Improving GPU Performance by Reducing Instruction Cache Misses
GPUs are specially designed to crunch through massive amounts of data at high speed. They have a large amount of compute resources, called streaming...
12 MIN READ

Oct 22, 2023
Differentiable Slang: Example Applications
Differentiable Slang easily integrates with existing codebases—from Python, PyTorch, and CUDA to HLSL—to aid multiple computer graphics tasks and enable...
6 MIN READ

Oct 22, 2023
Differentiable Slang: A Shading Language for Renderers That Learn
NVIDIA just released a SIGGRAPH Asia 2023 research paper, SLANG.D: Fast, Modular and Differentiable Shader Programming. The paper shows how a single language...
12 MIN READ

Jul 25, 2023
Ask Me Anything: NVIDIA CUDA Toolkit 12
On July 26, connect with NVIDIA CUDA product team experts on the latest CUDA Toolkit 12.
1 MIN READ

Jul 06, 2023
CUDA Toolkit 12.2 Unleashes Powerful Features for Boosting Applications
The latest release of CUDA Toolkit 12.2 introduces a range of essential new features, modifications to the programming model, and enhanced support for hardware...
8 MIN READ

Jan 17, 2023
CUDA 12.0 Compiler Support for Runtime LTO Using nvJitLink Library
CUDA Toolkit 12.0 introduces a new nvJitLink library for Just-in-Time Link Time Optimization (JIT LTO) support. In the early days of CUDA, to get maximum...
14 MIN READ

Apr 28, 2022
Updating the CUDA Linux GPG Repository Key
To best ensure the security and reliability of our RPM and Debian package repositories, NVIDIA is updating and rotating the signing keys used by the apt,...
5 MIN READ

Apr 06, 2021
N Ways to SAXPY: Demonstrating the Breadth of GPU Programming Options
Back in 2012, NVIDIAN Mark Harris wrote Six Ways to Saxpy, demonstrating how to perform the SAXPY operation on a GPU in multiple ways, using different languages...
9 MIN READ

Nov 16, 2020
Accelerating Fortran DO CONCURRENT with GPUs and the NVIDIA HPC SDK
Fortran developers have long been able to accelerate their programs using CUDA Fortran or OpenACC. For more up-to-date information, please read Using Fortran...
13 MIN READ

Aug 04, 2020
Accelerating Standard C++ with GPUs Using stdpar
Historically, accelerating your C++ code with GPUs has not been possible in Standard C++ without using language extensions or additional libraries: CUDA C++...
19 MIN READ

Oct 25, 2017
High-Performance GPU Computing in the Julia Programming Language
Julia is a high-level programming language for mathematical computing that is as easy to use as Python, but as fast as C. The language has been created with...
10 MIN READ

Nov 11, 2014
12 Things You Should Know about the Tesla Accelerated Computing Platform
You may already know NVIDIA Tesla as a line of GPU accelerator boards optimized for high-performance, general-purpose computing. They are used for parallel...
14 MIN READ

Jul 02, 2012
Six Ways to SAXPY
For even more ways to SAXPY using the latest NVIDIA HPC SDK with standard language parallelism, see N Ways to SAXPY: Demonstrating the Breadth of GPU...
8 MIN READ