Matthew Nicely

Matthew Nicely is a senior product manager over Deep Learning Compilers at NVIDIA, working with cuDNN and CUTLASS. At NVIDIA, he has worked as a public sector solution architect and CUDA Math Libraries product manager. In 2019, he received his Ph.D. in computer engineering, focusing on algorithm optimizations on GPUs.
Avatar photo

Posts by Matthew Nicely

Decorative image of cuDNN attention.
Generative AI / LLMs

Accelerating Transformers with NVIDIA cuDNN 9

The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library for accelerating deep learning primitives with state-of-the-art performance.... 12 MIN READ
Simulation / Modeling / Design

CUDA Toolkit 12.0 Released for General Availability

NVIDIA announces the newest CUDA Toolkit software release, 12.0. This release is the first major release in many years and it focuses on new programming models... 12 MIN READ
Simulation / Modeling / Design

Programming Distributed Multi-GPU Tensor Operations with cuTENSOR v1.4

Today, NVIDIA is announcing the availability of cuTENSOR, version 1.4, which supports up to 64-dimensional tensors, distributed multi-GPU tensor... 2 MIN READ
Simulation / Modeling / Design

Implementing High Performance Matrix Multiplication Using CUTLASS v2.8

NVIDIA continues to enhance CUTLASS to provide extensive support for mixed-precision computations, providing specialized data-movement, and multiply-accumulate... 2 MIN READ