Kimi Zhao

Kimi Zhao is a solution architect at NVIDIA, focusing on large model inference acceleration, analysis, and reinforcement learning. He holds a B.S. in physics and M.S. in signal and information processing.
Avatar photo

Posts by Kimi Zhao

Agentic AI / Generative AI

Removing the Guesswork from Disaggregated Serving

Deploying and optimizing large language models (LLMs) for high-performance, cost-effective serving can be an overwhelming engineering problem. The ideal... 10 MIN READ