Mark Chmarny

Mark Chmarny, a principal cloud architect in the NVIDIA DGX Cloud organization, specializes in large-scale distributed systems, container orchestration, and GPU-accelerated compute. He focuses on Kubernetes-based AI/ML platforms, high-performance GPU clusters, and multi-cloud infrastructure for training and inference.
Avatar photo

Posts by Mark Chmarny

Data Center / Cloud

Automate Kubernetes AI Cluster Health with NVSentinel

Kubernetes underpins a large portion of all AI workloads in production. Yet, maintaining GPU nodes and ensuring that applications are running, training jobs are... 7 MIN READ