Posts by Farshad Ghodsian
Agentic AI / Generative AI
Oct 09, 2025
NVIDIA Blackwell Leads on SemiAnalysis InferenceMAX™ v1 Benchmarks
SemiAnalysis recently launched InferenceMAX™ v1, a new open source initiative that provides a comprehensive methodology to evaluate inference hardware...
11 MIN READ
Agentic AI / Generative AI
Sep 11, 2025
How Quantization Aware Training Enables Low-Precision Accuracy Recovery
After training AI models, a variety of compression techniques can be used to optimize them for deployment. The most common is post-training quantization (PTQ),...
10 MIN READ
Agentic AI / Generative AI
Aug 25, 2025
NVFP4 Trains with Precision of 16-Bit and Speed and Efficiency of 4-Bit
In recent years, AI workloads have grown exponentially—not only in the deployment of large language models (LLMs) but also in the demand to process ever more...
9 MIN READ