Shiqing Fan

Shiqing Fan is a senior architect in Compute Architecture at NVIDIA, where he works on improving the end-to-end performance of neural network training both at single-node scale and supercomputer scale. He received his M.S. and B.S. from Nanjing University.
Avatar photo

Posts by Shiqing Fan

Agentic AI / Generative AI

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of... 11 MIN READ