Dheevatsa Mudigere

Dheevatsa Mudigere is a senior distinguished engineer in the NVIDIA Compute Architecture group, focusing on the application-driven co-design of large-scale AI systems. He and his team work on understanding current and future AI applications and developing HW/SW technology to enable more capable and efficient AI systems. Before NVIDIA, he worked on designing, building, and deploying production hyperscale AI systems.
Avatar photo

Posts by Dheevatsa Mudigere

Developer Tools & Techniques

Advancing Emerging Optimizers for Accelerated LLM Training with NVIDIA Megatron

Higher-order optimization algorithms such as Shampoo have been effectively applied in neural network training for at least a decade. These methods have achieved... 9 MIN READ
A multi-data center illustration.
Data Center / Cloud

Turbocharge LLM Training Across Long-Haul Data Center Networks with NVIDIA Nemo Framework

Multi-data center training is becoming essential for AI factories as pretraining scaling fuels the creation of even larger models, leading the demand for... 6 MIN READ