Posts by Jake Hemstad
Development & Optimization
Jun 18, 2025
Compiler Explorer: An Essential Kernel Playground for CUDA Developers
Have you ever wondered exactly what the CUDA compiler generates when you write GPU kernels? Ever wanted to share a minimal CUDA example with a colleague...
7 MIN READ
Data Science
Mar 06, 2023
Maximizing Performance with Massively Parallel Hash Maps on GPUs
Decades of computer science history have been devoted to devising solutions for efficient storage and retrieval of information. Hash maps (or hash tables) are a...
19 MIN READ
Data Center / Cloud
Jul 27, 2021
Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 2
In part 1 of this series, we introduced new API functions, cudaMallocAsync and cudaFreeAsync, that enable memory allocation and deallocation to be...
9 MIN READ
Data Center / Cloud
Jul 27, 2021
Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 1
Most CUDA developers are familiar with the cudaMalloc and cudaFree API functions to allocate GPU accessible memory. However, there has long been an obstacle...
14 MIN READ