Posts by John Tran
Technical Walkthrough
Jun 10, 2019
Tips for Optimizing GPU Performance Using Tensor Cores
Our most popular question is "What can I do to get great GPU performance for deep learning?" We’ve recently published a detailed Deep Learning Performance Guide…
13 MIN READ
Technical Walkthrough
Dec 05, 2017
CUTLASS: Fast Linear Algebra in CUDA C++
Update May 21, 2018: CUTLASS 1.0 is now available as Open Source software at the CUTLASS repository. CUTLASS 1.0 has changed substantially from our preview…
25 MIN READ