Omer Dayan

Omer Dayan is a software developer at RunAI, a startup acquired by NVIDIA, specializing in AI infrastructure. Omer’s research focuses on improving the efficiency and performance of inference software. Omer is a maintainer of the open-source KAI Scheduler project and has a deep technical interest in Nintendo Game Boy architecture.
Avatar photo

Posts by Omer Dayan

AI Platforms / Deployment

Reducing Cold Start Latency for LLM Inference with NVIDIA Run:ai Model Streamer

Deploying large language models (LLMs) poses a challenge in optimizing inference efficiency. In particular, cold start delays—where models take significant... 13 MIN READ