Features, CUDA, LULESH, OpenACC, Optimization, Unified Memory
The post Getting Started with OpenACC covered four steps to progressively accelerate your code with OpenACC.
CUDA Pro Tip, CUDA Fortran, Optimization, Profiling
The NVIDIA Tools Extension (NVTX) library lets developers annotate custom events and ranges within the profiling timelines generated using tools such as the NVIDIA Visual Profiler (NVVP) and NSight.
Features, CUDA 7.5, Optimization, Profiling, tools
[Note: Thejaswi Rao also contributed to the code optimizations shown in this post.] Today NVIDIA released CUDA 7.5, the latest release of the powerful CUDA Toolkit.