Bernard Nguyen

Bernard Nguyen is a deep learning engineering director at NVIDIA. He leads development of the NVIDIA NeMo Framework for GPU accelerated, distributed pre-training, and post-training of generative AI models from single GPU to thousand-node clusters. Previously, he led the development of PyTorch distributed and large-scale AI systems at Meta.
Avatar photo

Posts by Bernard Nguyen

Agentic AI / Generative AI

Accelerating Large-Scale Mixture-of-Experts Training in PyTorch

Training massive mixture-of-experts (MoE) models has long been the domain of a few advanced users with deep infrastructure and distributed-systems expertise.... 7 MIN READ
Agentic AI / Generative AI

Run Hugging Face Models Instantly with Day-0 Support from NVIDIA NeMo Framework

As organizations strive to maximize the value of their generative AI investments, accessing the latest model developments is crucial to continued success. By... 6 MIN READ