Posts by Farshad Ghodsian
Networking / Communications
Nov 10, 2025
Fusing Communication and Compute with New Device API and Copy Engine Collectives in NVIDIA NCCL 2.28
The latest release of the NVIDIA Collective Communications Library (NCCL) introduces a groundbreaking fusion of communication and computation for higher...
9 MIN READ
Agentic AI / Generative AI
Oct 13, 2025
NVIDIA Blackwell Leads on SemiAnalysis InferenceMAX v1 Benchmarks
SemiAnalysis recently launched InferenceMAX v1, a new open source initiative that provides a comprehensive methodology to evaluate inference hardware...
11 MIN READ
Agentic AI / Generative AI
Sep 11, 2025
How Quantization Aware Training Enables Low-Precision Accuracy Recovery
After training AI models, a variety of compression techniques can be used to optimize them for deployment. The most common is post-training quantization (PTQ),...
10 MIN READ
Agentic AI / Generative AI
Aug 25, 2025
NVFP4 Trains with Precision of 16-Bit and Speed and Efficiency of 4-Bit
In recent years, AI workloads have grown exponentially—not only in the deployment of large language models (LLMs) but also in the demand to process ever more...
9 MIN READ