Nader Al Awar

Nader Al Awar is a senior software engineer at NVIDIA and a member of the CUDA Core Compute Libraries (CCCL) team, where he focuses on the development of CUB and cuda.compute. He earned his doctorate in electrical and computer engineering from the University of Texas at Austin, specializing in high-performance computing for Python. Nader is passionate about bridging the gap between high-level languages and hardware by accelerating Python code using GPUs.
Avatar photo

Posts by Nader Al Awar

Decorative image.
Developer Tools & Techniques

Controlling Floating-Point Determinism in NVIDIA CCCL

A computation is considered deterministic if multiple runs with the same input data produce the same bitwise result. While this may seem like a simple property... 7 MIN READ
Developer Tools & Techniques

Topping the GPU MODE Kernel Leaderboard with NVIDIA cuda.compute

Python dominates machine learning for its ergonomics, but writing truly fast GPU code has historically meant dropping into C++ to write custom kernels and to... 5 MIN READ