Jason Zhou

Jason Zhou is a software engineer at NVIDIA, focused on LLM inference performance and optimization. He joined NVIDIA at the end of 2025. Previously, he worked at ByteDance on large-scale training frameworks and, prior to that, at Alibaba Group and Microsoft on distributed cloud storage systems. Outside of work, Jason enjoys watching movies and traveling around the world.
Avatar photo

Posts by Jason Zhou

Agentic AI / Generative AI

Removing the Guesswork from Disaggregated Serving

Deploying and optimizing large language models (LLMs) for high-performance, cost-effective serving can be an overwhelming engineering problem. The ideal... 10 MIN READ