Greg Ruetsch

Greg Ruetsch is a senior applied engineer at NVIDIA, where he works on CUDA Fortran and performance optimization of HPC codes. He holds a bachelor’s degree in mechanical and aerospace engineering from Rutgers University and a Ph.D. in applied mathematics from Brown University. Prior to joining NVIDIA, he held research positions at Stanford University’s Center for Turbulence Research and Sun Microsystems Laboratories.
Avatar photo

Posts by Greg Ruetsch

Technical Walkthrough 0

Using Tensor Cores in CUDA Fortran

Tensor Cores, which are programmable matrix multiply and accumulate units, were first introduced in the V100 GPUs where they operated on half-precision (16-bit)... 28 MIN READ
CUDA Fortran for Scientists and Engineers shows how high-performance application developers can leverage the power of GPUs using Fortran.
Technical Walkthrough 0

Pro Tip: Pinpointing Runtime Errors in CUDA Fortran

[caption id="attachment_2407" align="alignright" width="208"] CUDA Fortran for Scientists and Engineers shows how high-performance application developers can... 4 MIN READ
GPU Pro Tip
Technical Walkthrough 0

CUDA Pro Tip: How to Call Batched cuBLAS routines from CUDA Fortran

[caption id="attachment_8972" align="alignright" width="242"] CUDA Fortran for Scientists and Engineers shows how high-performance application developers can... 7 MIN READ
CUDA Fortran for Scientists and Engineers shows how high-performance application developers can leverage the power of GPUs using Fortran.
Technical Walkthrough 0

Peer-to-Peer Multi-GPU Transpose in CUDA Fortran (Book Excerpt)

This post is an excerpt from Chapter 4 of the book CUDA Fortran for Scientists and Engineers, by Gregory Ruetsch and Massimiliano Fatica. In this excerpt we... 12 MIN READ
Technical Walkthrough 0

Finite Difference Methods in CUDA Fortran, Part 2

[caption id="attachment_8972" align="alignright" width="318"] CUDA Fortran for Scientists and Engineers shows how high-performance application developers can... 6 MIN READ
Technical Walkthrough 0

Finite Difference Methods in CUDA Fortran, Part 1

[caption id="attachment_8972" align="alignright" width="318"] CUDA Fortran for Scientists and Engineers shows how high-performance application developers can... 9 MIN READ