Posts by Jeff Larkin
Technical Walkthrough
Jun 12, 2022
Using Fortran Standard Parallel Programming for GPU Acceleration
We present lessons learned from refactoring a Fortran application to use modern do concurrent loops in place of OpenACC for GPU acceleration.
12 MIN READ
Technical Walkthrough
Apr 18, 2022
Multi-GPU Programming with Standard Parallel C++, Part 2
By developing applications using MPI and standard C++ language features, it is possible to program for GPUs without sacrificing portability or performance.
13 MIN READ
Technical Walkthrough
Apr 18, 2022
Multi-GPU Programming with Standard Parallel C++, Part 1
By developing applications using MPI and standard C++ language features, it is possible to program for GPUs without sacrificing portability or performance.
17 MIN READ
Technical Walkthrough
Jan 12, 2022
Developing Accelerated Code with Standard Language Parallelism
Learn how standard language parallelism can be used for programming accelerated computing applications on NVIDIA GPUs with ISO C++, ISO Fortran, or Python.
11 MIN READ
Technical Walkthrough
Jul 13, 2015
Getting Started with OpenACC
This week NVIDIA has released the NVIDIA OpenACC Toolkit, a starting point for anyone interested in using OpenACC. OpenACC gives scientists and researchers a…
16 MIN READ
Technical Walkthrough
May 05, 2015
GPU Pro Tip: Track MPI Calls In The NVIDIA Visual Profiler
Often when profiling GPU-accelerated applications that run on clusters, one needs to visualize MPI (Message Passing Interface) calls on the GPU timeline in the…
5 MIN READ