Jian Hu

Jian Hu is a senior deep learning engineer at NVIDIA, focusing on large language models (LLMs) and reinforcement learning from human feedback (RLHF). He received his master’s degree in Computer Science from National Taiwan University and began a PhD program at HKUST(GZ), which he later chose to leave. Jian has five years of working experience in computer engineering and machine learning, and is the first author of popular RLHF projects OpenRLHF and REINFORCE++. His interests include reinforcement learning, artificial general intelligence (AGI), and the model-system co-optimization.
Avatar photo

Posts by Jian Hu

Generative AI

Scaling LLM Reinforcement Learning with Prolonged Training Using ProRL v2

Currently, one of the most compelling questions in AI is whether large language models (LLMs) can continue to improve through sustained reinforcement learning... 8 MIN READ