Author: Zhihan Jiang | NVIDIA Technical Blog

Zhihan Jiang

Zhihan Jiang is a senior system software engineer on the TensorRT team at NVIDIA and focuses on delivering world-class inference results in MLPerf Inference. Before working on MLPerf, he worked on TensorRT autonomous safety libraries and infrastructure, and NVIDIA CPU architecture modeling. Zhihan holds an M.S. degree in electrical engineering from Stanford university, and a B.S. degree in computer engineering from Georgia Tech.

Posts by Zhihan Jiang

An image of an NVIDIA H200 Tensor Core GPU.

Generative AI Mar 27, 2024

NVIDIA H200 Tensor Core GPUs and NVIDIA TensorRT-LLM Set MLPerf LLM Inference Records

Generative AI is unlocking new computing applications that greatly augment human capability, enabled by continued model innovation. Generative AI... 11 MIN READ

Data Center / Cloud Sep 09, 2023

Leading MLPerf Inference v3.1 Results with NVIDIA GH200 Grace Hopper Superchip Debut

AI is transforming computing, and inference is how the capabilities of AI are deployed in the world’s applications. Intelligent chatbots, image and video... 13 MIN READ

Data Center / Cloud Apr 05, 2023

Setting New Records in MLPerf Inference v3.0 with Full-Stack Optimizations for AI

The most exciting computing applications currently rely on training and running inference on complex AI models, often in demanding, real-time deployment... 15 MIN READ

Simulation / Modeling / Design Sep 08, 2022

Full-Stack Innovation Fuels Highest MLPerf Inference 2.1 Results for NVIDIA

Today’s AI-powered applications are enabling richer experiences, fueled by both larger and more complex AI models as well as the application of many models in... 14 MIN READ