Author: Mike Murphy | NVIDIA Technical Blog

Decorative image of light fields in green, purple, and blue.

Agentic AI / Generative AI Jun 18, 2024

Runtime Fatbin Creation Using the NVIDIA CUDA Toolkit 12.4 Compiler

CUDA Toolkit 12.4 introduced a new nvFatbin library for creating fatbins at runtime. Fatbins, otherwise known as NVIDIA device code fat binaries, are containers... 11 MIN READ

Simulation / Modeling / Design Jan 17, 2023

CUDA 12.0 Compiler Support for Runtime LTO Using nvJitLink Library

CUDA Toolkit 12.0 introduces a new nvJitLink library for Just-in-Time Link Time Optimization (JIT LTO) support. In the early days of CUDA, to get maximum... 14 MIN READ

Simulation / Modeling / Design Oct 25, 2021

Reducing Application Build Times Using CUDA C++ Compilation Aids

The CUDA 11.5 C++ compiler addresses a growing customer request. Specifically, how to reduce CUDA application build times. Along with eliminating unused... 13 MIN READ

Computer Vision / Video Analytics Apr 15, 2021

Programming Efficiently with the NVIDIA CUDA 11.3 Compiler Toolchain

The CUDA 11.3 release of the CUDA C++ compiler toolchain incorporates new features aimed at improving developer productivity and code performance. NVIDIA is... 15 MIN READ

Data Center / Cloud Feb 12, 2021

Boosting Productivity and Performance with the NVIDIA CUDA 11.2 C++ Compiler

The 11.2 CUDA C++ compiler incorporates features and enhancements aimed at improving developer productivity and the performance of GPU-accelerated applications.... 21 MIN READ

Data Center / Cloud Feb 12, 2021

Improving GPU Application Performance with NVIDIA CUDA 11.2 Device Link Time Optimization

CUDA 11.2 features the powerful link time optimization (LTO) feature for device code in GPU-accelerated applications. Device LTO brings the performance... 14 MIN READ

Mike Murphy