Posts by Rajeshwari Devaramani
Models / Libraries / Frameworks
Sep 29, 2025
Unlock GPU Performance: Global Memory Access in CUDA
Managing memory is one of the most important performance characteristics to consider when writing a GPU kernel. This post walks you through the important...
15 MIN READ
Simulation / Modeling / Design
Aug 04, 2025
CUDA Pro Tip: Increase Performance with Vectorized Memory Access
Many CUDA kernels are bandwidth bound, and the increasing ratio of flops to bandwidth in new hardware results in more bandwidth bound kernels. This makes it...
6 MIN READ