Hongkuan Zhou

Dr. Hongkuan is a senior Deep Learning Algorithm Engineer. His work focuses on developing efficient and scalable LLM inference systems. Previously, he worked on acceleration and application of Graph Neural Networks.
Avatar photo

Posts by Hongkuan Zhou

Three icons, with text LLMs, Optimize, Deploy.
Data Center / Cloud

NVIDIA Dynamo Adds GPU Autoscaling, Kubernetes Automation, and Networking Optimizations

At NVIDIA GTC 2025, we announced NVIDIA Dynamo, a high-throughput, low-latency open-source inference serving framework for deploying generative AI and reasoning... 7 MIN READ