Roger Waleffe

Roger Waleffe is an applied deep learning research scientist at NVIDIA. His work focuses on studying and developing efficient large language model architectures for training and inference such as the hybrid Mamba-Transformer architecture used in Nemotron-H. He holds a Ph.D. in Computer Science from the University of Wisconsin-Madison.
Avatar photo

Posts by Roger Waleffe

Generative AI

Introducing the Nemotron-H Reasoning Model Family: Throughput Gains Without Compromise

As large language models increasingly take on reasoning-intensive tasks in areas like math and science, their output lengths are getting significantly... 7 MIN READ