Jian Hu

Jian Hu is a senior deep learning engineer at NVIDIA, focusing on large language models (LLMs) and reinforcement learning from human feedback (RLHF). He received his master’s degree in Computer Science from National Taiwan University and began a PhD program at HKUST(GZ), which he later chose to leave. Jian has five years of working experience in computer engineering and machine learning, and is the first author of popular RLHF projects OpenRLHF and REINFORCE++. His interests include reinforcement learning, artificial general intelligence (AGI), and the model-system co-optimization.

Posts by Jian Hu

Agentic AI / Generative AI Nov 19, 2025

Breaking Through Reinforcement Learning Training Limits with Scaling Rollouts in BroRL

When training large language models (LLMs) with reinforcement learning from verifiable rewards (RLVR), one of the most compelling questions is how to overcome... 7 MIN READ

Agentic AI / Generative AI Aug 13, 2025

Scaling LLM Reinforcement Learning with Prolonged Training Using ProRL v2

Currently, one of the most compelling questions in AI is whether large language models (LLMs) can continue to improve through sustained reinforcement learning... 8 MIN READ