Posts by Karthik Mandakolathur
Data Science
Feb 28, 2022
Doubling all2all Performance with NVIDIA Collective Communication Library 2.12
Collective communications are a performance-critical ingredient of modern distributed AI training workloads such as recommender systems and natural language...
8 MIN READ
Data Center / Cloud
Dec 01, 2021
Boosting NVIDIA MLPerf Training v1.1 Performance with Full Stack Optimization
Five months have passed since v1.0, so it is time for another round of the MLPerf training benchmark. In this v1.1 edition, optimization over the entire...
22 MIN READ