Omri Cohen

Omri Cohen is a senior software engineer at NVIDIA. He is a maintainer for KAI Scheduler, and has been a developer in the Run:ai scheduler team for four years. Before that, he managed large-scale, multi-tenant AI Kubernetes clusters, making sure research teams get access to the resources they need, and helping researchers navigate Kubernetes for training and inference.
Avatar photo

Posts by Omri Cohen

Data Center / Cloud

Ensuring Balanced GPU Allocation in Kubernetes Clusters with Time-Based Fairshare

NVIDIA Run:ai v2.24 introduces time-based fairshare, a new scheduling mode that brings fair-share scheduling with time awareness for over-quota resources to... 11 MIN READ