Jie-Fang Zhang

Jie-Fang Zhang is a senior deep learning architect at NVIDIA focusing on inference performance and optimization of data center AI workloads. He holds a Ph.D. degree in electrical and computer engineering from University of Michigan and a bachelor's degree in electrical engineering from National Taiwan University.
Avatar photo

Posts by Jie-Fang Zhang

Image of an HGX H200
Data Center / Cloud

NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200

Generative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series... 5 MIN READ
Data Center / Cloud

Boosting Llama 3.1 405B Throughput by Another 1.5x on NVIDIA H200 Tensor Core GPUs and NVLink Switch

The continued growth of LLMs capability, fueled by increasing parameter counts and support for longer contexts, has led to their usage in a wide variety of... 8 MIN READ