Posts by Mike Murphy
Generative AI
Jun 18, 2024
Runtime Fatbin Creation Using the NVIDIA CUDA Toolkit 12.4 Compiler
CUDA Toolkit 12.4 introduced a new nvFatbin library for creating fatbins at runtime. Fatbins, otherwise known as NVIDIA device code fat binaries, are containers...
11 MIN READ
Simulation / Modeling / Design
Jan 17, 2023
CUDA 12.0 Compiler Support for Runtime LTO Using nvJitLink Library
CUDA Toolkit 12.0 introduces a new nvJitLink library for Just-in-Time Link Time Optimization (JIT LTO) support. In the early days of CUDA, to get maximum...
14 MIN READ
Simulation / Modeling / Design
Oct 25, 2021
Reducing Application Build Times Using CUDA C++ Compilation Aids
The CUDA 11.5 C++ compiler addresses a growing customer request. Specifically, how to reduce CUDA application build times. Along with eliminating unused...
13 MIN READ
Computer Vision / Video Analytics
Apr 15, 2021
Programming Efficiently with the NVIDIA CUDA 11.3 Compiler Toolchain
The CUDA 11.3 release of the CUDA C++ compiler toolchain incorporates new features aimed at improving developer productivity and code performance. NVIDIA is...
15 MIN READ
Data Center / Cloud
Feb 12, 2021
Boosting Productivity and Performance with the NVIDIA CUDA 11.2 C++ Compiler
The 11.2 CUDA C++ compiler incorporates features and enhancements aimed at improving developer productivity and the performance of GPU-accelerated applications....
21 MIN READ
Data Center / Cloud
Feb 12, 2021
Improving GPU Application Performance with NVIDIA CUDA 11.2 Device Link Time Optimization
CUDA 11.2 features the powerful link time optimization (LTO) feature for device code in GPU-accelerated applications. Device LTO brings the performance...
14 MIN READ