Posts by Matthew Nicely
Models / Libraries / Frameworks
Feb 03, 2025
Just Released: CUTLASS 3.8
Provides support for the NVIDIA Blackwell SM100 architecture. CUTLASS is a collection of CUDA C++ templates and abstractions for implementing high-performance...
1 MIN READ
Models / Libraries / Frameworks
Jan 31, 2025
Just Released: NVIDIA cuDNN 9.7
Bringing support for NVIDIA Blackwell architecture across data center and GeForce products, NVIDIA cuDNN 9.7 delivers speedups of up to 84% for FP8 Flash...
1 MIN READ
Generative AI
May 24, 2024
Accelerating Transformers with NVIDIA cuDNN 9
The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library for accelerating deep learning primitives with state-of-the-art performance....
12 MIN READ
Simulation / Modeling / Design
Dec 12, 2022
CUDA Toolkit 12.0 Released for General Availability
NVIDIA announces the newest CUDA Toolkit software release, 12.0. This release is the first major release in many years and it focuses on new programming models...
12 MIN READ