Posts by Sylvain Jeaugey
Networking / Communications
Jan 31, 2025
New Scaling Algorithm and Initialization with NVIDIA Collective Communications Library 2.23
The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multinode communication primitives optimized for NVIDIA GPUs and networking. NCCL...
9 MIN READ
Networking / Communications
Sep 16, 2024
Memory Efficiency, Faster Initialization, and Cost Estimation with NVIDIA Collective Communications Library 2.22
For the past few months, the NVIDIA Collective Communications Library (NCCL) developers have been working hard on a set of new library features and bug fixes....
8 MIN READ
Data Science
Feb 28, 2022
Doubling all2all Performance with NVIDIA Collective Communication Library 2.12
Collective communications are a performance-critical ingredient of modern distributed AI training workloads such as recommender systems and natural language...
8 MIN READ
Data Science
Feb 04, 2019
Massively Scale Your Deep Learning Training with NCCL 2.4
Imagine using tens of thousands of GPUs to train your neural network. Using multiple GPUs to train neural networks has become quite common with all deep...
8 MIN READ
Simulation / Modeling / Design
Sep 26, 2018
Scaling Deep Learning Training with NCCL
NVIDIA Collective Communications Library (NCCL) provides optimized implementation of inter-GPU communication operations, such as allreduce and variants....
6 MIN READ