Posts by Matthew Nicely
Generative AI / LLMs
May 24, 2024
Accelerating Transformers with NVIDIA cuDNN 9
The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library for accelerating deep learning primitives with state-of-the-art performance....
12 MIN READ
Simulation / Modeling / Design
Dec 12, 2022
CUDA Toolkit 12.0 Released for General Availability
NVIDIA announces the newest CUDA Toolkit software release, 12.0. This release is the first major release in many years and it focuses on new programming models...
12 MIN READ
Simulation / Modeling / Design
Nov 29, 2021
Programming Distributed Multi-GPU Tensor Operations with cuTENSOR v1.4
Today, NVIDIA is announcing the availability of cuTENSOR, version 1.4, which supports up to 64-dimensional tensors, distributed multi-GPU tensor...
2 MIN READ
Simulation / Modeling / Design
Nov 23, 2021
Implementing High Performance Matrix Multiplication Using CUTLASS v2.8
NVIDIA continues to enhance CUTLASS to provide extensive support for mixed-precision computations, providing specialized data-movement, and multiply-accumulate...
2 MIN READ