Yu Yao

Yu Yao is a senior deep learning algorithm engineer at NVIDIA, where he contributes to the NVIDIA NeMo framework for large-scale generative AI. His work focuses on LLM training, multimodal models, model compression, and GPU-accelerated optimization. Yu holds a PhD in Physics and a master’s degree in Computer Science from the University of Southern California, combining research depth with hands-on AI systems engineering experience.
Avatar photo

Posts by Yu Yao

Developer Tools & Techniques

Implementing Falcon-H1 Hybrid Architecture in NVIDIA Megatron Core

In the rapidly evolving landscape of large language model (LLM) development, NVIDIA Megatron Core has emerged as the foundational framework for training massive... 9 MIN READ