Posts by Utkarsh Uppal
Agentic AI / Generative AI
Sep 23, 2025
Faster Training Throughput in FP8 Precision with NVIDIA NeMo
In previous posts on FP8 training, we explored the fundamentals of FP8 precision and took a deep dive into the various scaling recipes for practical large-scale...
12 MIN READ
Agentic AI / Generative AI
Jul 01, 2025
Per-Tensor and Per-Block Scaling Strategies for Effective FP8 Training
In this blog post, we’ll break down the main FP8 scaling strategies—per-tensor scaling, delayed and current scaling, and per-block scaling (including the...
10 MIN READ
Agentic AI / Generative AI
Jun 04, 2025
Floating-Point 8: An Introduction to Efficient, Lower-Precision AI Training
With the growth of large language models (LLMs), deep learning is advancing both model architecture design and computational efficiency. Mixed precision...
11 MIN READ