Pavel Shamis

Pavel (Pasha) Shamis is a distinguished engineer at NVIDIA in the AI Data-Infra Optimization group where his primary focus lies in optimizing efficiency of the AI software and hardware stack. Before joining NVIDIA, Pasha served as a senior principal research engineer at Arm for six years, working on co-designing software and hardware building blocks for large-scale distributed systems.
Avatar photo

Posts by Pavel Shamis

Image shows cloud-based GPU clusters dedicated to AI training.
Data Center / Cloud

Ensuring Reliable Model Training on NVIDIA DGX Cloud

Training AI models on massive GPU clusters presents significant challenges for model builders. Because manual intervention becomes impractical as job scale... 8 MIN READ