Utkarsh Uppal

Utkarsh Uppal is a senior applied deep learning solutions architect at NVIDIA, where he specializes in building high-performance deep learning pipelines across domains like language and speech. His primary focus is on developing end-to-end conversational AI systems, including training LLMs from scratch, particularly for Indic languages and building domain-specific models with enterprises. He also has deep expertise in designing and optimizing inference architectures for production, with a focus on low-precision formats (FP4, FP8), decoding strategies, and KV-cache optimizations.

Posts by Utkarsh Uppal

Agentic AI / Generative AI Feb 18, 2026

Utkarsh Uppal

Posts by Utkarsh Uppal

How NVIDIA Extreme Hardware-Software Co-Design Delivered a Large Inference Boost for Sarvam AI’s Sovereign Models

Faster Training Throughput in FP8 Precision with NVIDIA NeMo

Per-Tensor and Per-Block Scaling Strategies for Effective FP8 Training

Floating-Point 8: An Introduction to Efficient, Lower-Precision AI Training