Matthew Nicely

Matthew Nicely joined NVIDIA in March 2019, having previously worked at the U.S. Army Aviation and Missile Research Development and Engineering Center, Huntsville, AL, USA. There, he focused on CUDA algorithm development and optimizations on the Jetson series. At NVIDIA, he has worked in the Federal segment assisting with CUDA development and optimizations, along with education and proof of concepts for customers on various NVIDIA tool sets, before recently transitioning to math libraries product manager. In 2019, he received his Ph.D. degree in computer engineering, focusing on algorithm optimizations on GPUs.

Posts by Matthew Nicely

News 0

Programming Distributed Multi-GPU Tensor Operations with cuTENSOR v1.4

Today, NVIDIA is announcing the availability of cuTENSOR, version 1.4, which supports up to 64-dimensional tensors, distributed multi-GPU tensor... 2 MIN READ
News 1

Implementing High Performance Matrix Multiplication Using CUTLASS v2.8

NVIDIA continues to enhance CUTLASS to provide extensive support for mixed-precision computations, providing specialized data-movement, and multiply-accumulate... 2 MIN READ
News 0

Accelerating ReLu and GeLu Activation Functions, and Batched Sparse GEMM in cuSPARSELt v0.2.0

Today, NVIDIA is announcing the availability of cuSPARSELt, version 0.2.0, which increases performance on activation functions, bias vectors, and Batched Sparse... 2 MIN READ
News 0

Using Fully Redesigned Batch API and Performance Optimizations in nvCOMP v2.1.0

Today, NVIDIA is announcing the availability of nvCOMP, version 2.1.0. This software can be downloaded now free of charge. Download now. What's New?... 2 MIN READ