Bita Darvish Rouhani

Bita Darvish Rouhani is a distinguished engineer and manager at NVIDIA, leading algorithms, software, and hardware co-design initiatives for cost-optimized generative AI inference. Prior to joining Nvidia, Bita was a partner group manager at Microsoft, where she co-founded and led the OCP MX consortium. This consortium has standardized the first set of 4- and 6-bit data types for AI training and inference for nearly all mainstream AI chips. Bita holds a Ph.D. in computer engineering from UC San Diego.
Avatar photo

Posts by Bita Darvish Rouhani

Data Center / Cloud

How NVIDIA GB200 NVL72 and NVIDIA Dynamo Boost Inference Performance for MoE Models

The latest wave of open source large language models (LLMs), like DeepSeek R1, Llama 4, and Qwen3, have embraced Mixture of Experts (MoE) architectures. Unlike... 12 MIN READ