Matthew Nicely

Matthew Nicely joined NVIDIA in March 2019, having previously worked at the U.S. Army Aviation and Missile Research Development and Engineering Center, Huntsville, AL, USA. There, he focused on CUDA algorithm development and optimizations on the Jetson series. At NVIDIA, he has worked in the Federal segment assisting with CUDA development and optimizations, along with education and proof of concepts for customers on various NVIDIA tool sets, before recently transitioning to math libraries product manager. In 2019, he received his Ph.D. degree in computer engineering, focusing on algorithm optimizations on GPUs.
Avatar photo

Posts by Matthew Nicely

Technical Walkthrough 11

CUDA Toolkit 12.0 Released for General Availability

NVIDIA announces the newest CUDA Toolkit software release, 12.0. This release is the first major release in many years and it focuses on new programming models... 12 MIN READ
News 0

Programming Distributed Multi-GPU Tensor Operations with cuTENSOR v1.4

Today, NVIDIA is announcing the availability of cuTENSOR, version 1.4, which supports up to 64-dimensional tensors, distributed multi-GPU tensor... 2 MIN READ
News 1

Implementing High Performance Matrix Multiplication Using CUTLASS v2.8

NVIDIA continues to enhance CUTLASS to provide extensive support for mixed-precision computations, providing specialized data-movement, and multiply-accumulate... 2 MIN READ
News 0

Accelerating ReLu and GeLu Activation Functions, and Batched Sparse GEMM in cuSPARSELt v0.2.0

Today, NVIDIA is announcing the availability of cuSPARSELt, version 0.2.0, which increases performance on activation functions, bias vectors, and Batched Sparse... 2 MIN READ