Shizhe Diao

Shizhe Diao is a research scientist at NVIDIA Research and is working on the research in efficient training and alignment of foundation models. He completed his PhD at the Hong Kong University of Science and Technology. Shizhe has seven years of experience in machine learning and natural language processing, and is the first author of the popular post-training project LMFlow.
Avatar photo

Posts by Shizhe Diao

Generative AI

Scaling LLM Reinforcement Learning with Prolonged Training Using ProRL v2

Currently, one of the most compelling questions in AI is whether large language models (LLMs) can continue to improve through sustained reinforcement learning... 8 MIN READ
Generative AI

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,... 12 MIN READ