Fred Oh

Fred is a senior product marketing manager for CUDA, CUDA on WSL, and CUDA Python. Fred has a B.S. in Computer Science and Math from UC Davis. He began his career as a UNIX software engineer porting kernel services and device drivers to x86 architectures. He loves Star Wars, Star Trek and the NBA Warriors.
Avatar photo

Posts by Fred Oh

Agentic AI / Generative AI

Accelerating Long-Context Inference with Skip Softmax in NVIDIA TensorRT-LLM

For machine learning engineers deploying LLMs at scale, the equation is familiar and unforgiving: as context length increases, attention computation costs... 6 MIN READ
Networking / Communications

New Scaling Algorithm and Initialization with NVIDIA Collective Communications Library 2.23

The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multinode communication primitives optimized for NVIDIA GPUs and networking. NCCL... 9 MIN READ
Simulation / Modeling / Design

Dynamic Loading in the CUDA Runtime

Historically, the GPU device code is compiled alongside the application with offline tools such as nvcc. In this case, the GPU device code is managed internally... 8 MIN READ
Simulation / Modeling / Design

CUDA Toolkit Now Available for NVIDIA Blackwell 

The latest release of the CUDA Toolkit, version 12.8, continues to push accelerated computing performance in data sciences, AI, scientific computing, and... 9 MIN READ
Simulation / Modeling / Design

Upcoming Event: CUDA Developer Meet Up in Silicon Valley

Whether you're just starting your GPU programming journey or you're a CUDA ninja looking to share advanced techniques, join us in San Jose on 1/30/25. 1 MIN READ
Decorative image of a cube of green cubes, surrounded by other cubes on a dark background.
Networking / Communications

Memory Efficiency, Faster Initialization, and Cost Estimation with NVIDIA Collective Communications Library 2.22

For the past few months, the NVIDIA Collective Communications Library (NCCL) developers have been working hard on a set of new library features and bug fixes.... 8 MIN READ