Dan Feigin

Dan Feigin is a senior software engineer at NVIDIA with an emphasis on container state recovery and GPU checkpoint/restore mechanisms. He focuses on developing solutions utilizing Snapshot restore, CRIU, and CUDA to minimize startup times for large-scale inference services and fractional GPU workloads on Kubernetes.
Avatar photo

Posts by Dan Feigin

Top Stories

NVIDIA Dynamo Snapshot: Fast Startup for Inference Workloads on Kubernetes

The cold-start problem In production inference deployments, demand fluctuates over time, requiring inference replicas to scale elastically. However,... 15 MIN READ