Posts by Misbah Mubarak
Networking / Communications
Jul 22, 2025
Understanding NCCL Tuning to Accelerate GPU-to-GPU Communication
The NVIDIA Collective Communications Library (NCCL) is essential for fast GPU-to-GPU communication in AI workloads, using various optimizations and tuning to...
14 MIN READ
Data Center / Cloud
Jul 14, 2025
Enabling Fast Inference and Resilient Training with NCCL 2.27
As AI workloads scale, fast and reliable GPU communication becomes vital, not just for training, but increasingly for inference at scale. The NVIDIA Collective...
9 MIN READ
Networking / Communications
Jul 14, 2025
NCCL Deep Dive: Cross Data Center Communication and Network Topology Awareness
As the scale of AI training increases, a single data center (DC) is not sufficient to deliver the required computational power. Most recent approaches to...
9 MIN READ