Mark Chmarny

Mark Chmarny, a principal cloud architect in the NVIDIA DGX Cloud organization, specializes in large-scale distributed systems, container orchestration, and GPU-accelerated compute. He focuses on Kubernetes-based AI/ML platforms, high-performance GPU clusters, and multi-cloud infrastructure for training and inference.
Avatar photo

Posts by Mark Chmarny

Decorative image.
Data Center / Cloud

Validate Kubernetes for GPU Infrastructure with Layered, Reproducible Recipes

Every AI cluster running on Kubernetes requires a full software stack that works together, from low-level driver and kernel settings to high-level operator and... 5 MIN READ
Data Center / Cloud

Automate Kubernetes AI Cluster Health with NVSentinel

Kubernetes underpins a large portion of all AI workloads in production. Yet, maintaining GPU nodes and ensuring that applications are running, training jobs are... 7 MIN READ