Continents Apart, Two Students Share a Common Language: They Speak Robot

Corporate, mobile, Embedded Computing, Jetson, mobility

Lan Malin, posted Oct 28 2015

Two students, separated by different cultures and thousands of miles, share a common language. They speak robot. Doctoral candidate Raffaello Bonghi splits his time between Italy and France. Ryan Beethe is an undergrad in Colorado. They’re at the leading edge of bot builds.

Read more

Open, Reproducible Computational Chemistry with Python and CUDA

CUDA Spotlight, Computational Chemistry, CUDA, Python

Nadeem Mohammad, posted Oct 12 2015

Increasingly, computational chemistry researchers use GPUs to push the boundaries of discovery. This motivated Christopher Cooper, an Instructor at Universidad Técnica Federico Santa María in Chile, to move to a Python-based software stack. Cooper’s recent paper, “Probing protein orientation near charged nanosurfaces for simulation-assisted biosensor design,” was recently accepted in J. Chemical Physics. Brad: Can you talk a

Read more

Combine OpenACC and Unified Memory for Productivity and Performance

Features, CUDA, LULESH, OpenACC, Optimization, Unified Memory

Nadeem Mohammad, posted Oct 01 2015

The post Getting Started with OpenACC covered four steps to progressively accelerate your code with OpenACC. It’s often necessary to use OpenACC directives to express both loop parallelism and data locality in order to get good performance with accelerators. After expressing available parallelism, excessive data movement generated by the compiler can be a bottleneck, and correcting this by

Read more

Customize CUDA Fortran Profiling with NVTX

CUDA Pro Tip, CUDA Fortran, Optimization, Profiling

Nadeem Mohammad, posted Sep 29 2015

The NVIDIA Tools Extension (NVTX) library lets developers annotate custom events and ranges within the profiling timelines generated using tools such as the NVIDIA Visual Profiler (NVVP) and NSight. In my own optimization work, I rely heavily on NVTX to better understand internal as well as customer codes and to spot opportunities for better interaction

Read more

Simple, Portable Parallel C++ with Hemi 2 and CUDA 7.5

Features, C++, C++11, Hemi, Lambda

Nadeem Mohammad, posted Sep 24 2015

The last two releases of CUDA have added support for the powerful new features of C++. In the post The Power of C++11 in CUDA 7 I discussed the importance of C++11 for parallel programming on GPUs, and in the post New Features in CUDA 7.5 I introduced a new experimental feature in the NVCC CUDA C++ compiler:

Read more