Nathan Taber

Nathan Taber is a product manager who has helped define how modern cloud and AI infrastructure are built. At AWS, he was a founding member of the Amazon EKS team and helped define Kubernetes at AWS through work on EKS, Karpenter, and the broader OSS ecosystem. At NVIDIA he helps define GPU-accelerated Kubernetes and health-automation patterns for large-scale AI infrastructure, influencing how cloud providers and their customers run production GPU workloads reliably at scale.
Avatar photo

Posts by Nathan Taber

Data Center / Cloud

Automate Kubernetes AI Cluster Health with NVSentinel

Kubernetes underpins a large portion of all AI workloads in production. Yet, maintaining GPU nodes and ensuring that applications are running, training jobs are... 7 MIN READ