Jie-Fang Zhang

Jie-Fang Zhang is a senior deep learning architect at NVIDIA focusing on inference performance and optimization of data center AI workloads. He holds a Ph.D. degree in electrical and computer engineering from University of Michigan and a bachelor's degree in electrical engineering from National Taiwan University.

Posts by Jie-Fang Zhang

Data Center / Cloud Nov 21, 2024

NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200

Generative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series... 5 MIN READ

Data Center / Cloud Oct 09, 2024

Boosting Llama 3.1 405B Throughput by Another 1.5x on NVIDIA H200 Tensor Core GPUs and NVLink Switch

The continued growth of LLMs capability, fueled by increasing parameter counts and support for longer contexts, has led to their usage in a wide variety of... 8 MIN READ