Swati Gupta

Swati Gupta is a systems software engineer at NVIDIA where she works on GPU cloud infrastructure, orchestration and monitoring. She is focused on enabling GPU-accelerated DL and AI workloads in container orchestration systems such as Kubernetes, OpenShift, and Docker.
Avatar photo

Posts by Swati Gupta

Simulation / Modeling / Design

Monitoring GPUs in Kubernetes with DCGM

Monitoring GPUs is critical for infrastructure or site reliability engineering (SRE) teams who manage large-scale GPU clusters for AI or HPC workloads. GPU... 12 MIN READ