DEVELOPER BLOG

Swati Gupta

Swati Gupta is a systems software engineer at NVIDIA where she works on GPU cloud infrastructure, orchestration and monitoring. She is focused on enabling GPU-accelerated DL and AI workloads in container orchestration systems such as Kubernetes, OpenShift, and Docker.

Posts by Swati Gupta

AI / Deep Learning

Monitoring GPUs in Kubernetes with DCGM

Monitoring GPUs is critical for infrastructure or site reliability engineering (SRE) teams who manage large-scale GPU clusters for AI or HPC workloads. 12 MIN READ