Mark Harris

Mark is an NVIDIA Distinguished Engineer working on RAPIDS. Mark has over twenty years of experience developing software for GPUs, ranging from graphics and games, to physically-based simulation, to parallel algorithms and high-performance computing. While a Ph.D. student at The University of North Carolina he recognized a nascent trend and coined a name for it: GPGPU (General-Purpose computing on Graphics Processing Units).

Posts by Mark Harris

Technical Walkthrough 1

Implementing High-Precision Decimal Arithmetic with CUDA int128

This post details CUDA's new int128 support and how to implement decimal fixed-point arithmetic on top of it. 19 MIN READ
Image depicting NVIDIA CEO Jen-Hsun Huang explaining the importance of the RAPIDS launch demo at GTC Europe 2018.
Technical Walkthrough 0

Fast, Flexible Allocation for NVIDIA CUDA with RAPIDS Memory Manager

When I joined the RAPIDS team in 2018, NVIDIA CUDA device memory allocation was a performance problem. RAPIDS cuDF allocates and deallocates memory at high… 24 MIN READ
Technical Walkthrough 0

CUDA Pro Tip: The Fast Way to Query Device Properties

CUDA applications often need to know the maximum available shared memory per block or to query the number of multiprocessors in the active GPU. One way to do… 3 MIN READ
Technical Walkthrough 0

RAPIDS Accelerates Data Science End-to-End

Today's data science problems demand a dramatic increase in the scale of data as well as the computational power required to process it. Unfortunately… 10 MIN READ
Technical Walkthrough 0

Cooperative Groups: Flexible CUDA Thread Programming

In efficient parallel algorithms, threads cooperate and share data to perform collective computations. To share data, the threads must synchronize. 16 MIN READ
Technical Walkthrough 0

Unified Memory for CUDA Beginners

This post introduces CUDA programming with Unified Memory, a single memory address space that is accessible from any GPU or CPU in a system. 16 MIN READ