Zihao Kong

Zihao Kong is a software engineer on the TensorRT team at NVIDIA. He focuses on delivering world-class deep learning inference performance on NVIDIA data center GPUs and Jetson platform GPUs on edge. He has experience with performance analysis and profiling and deep learning accelerators as well. He holds a bachelor’s degree in Computer Engineering from UC San Diego.
Avatar photo

Posts by Zihao Kong

Data Center / Cloud

NVIDIA Blackwell Platform Sets New LLM Inference Records in MLPerf Inference v4.1

Large language model (LLM) inference is a full-stack challenge. Powerful GPUs, high-bandwidth GPU-to-GPU interconnects, efficient acceleration libraries, and a... 13 MIN READ