Posts by Ashraf Eassa
Data Center / Cloud
Apr 01, 2026
NVIDIA Platform Delivers Lowest Token Cost Enabled by Extreme Co-Design
Co-designed hardware, software, and models are key to delivering the highest AI factory throughput and lowest token cost. Measuring this goes far beyond peak...
10 MIN READ
Data Center / Cloud
Feb 06, 2026
3 Ways NVFP4 Accelerates AI Training and Inference
The latest AI models continue to grow in size and complexity, demanding increasing amounts of compute performance for training and inference—far beyond what...
6 MIN READ
Data Center / Cloud
Jan 08, 2026
Delivering Massive Performance Leaps for Mixture of Experts Inference on NVIDIA Blackwell
As AI models continue to get smarter, people can rely on them for an expanding set of tasks. This leads users—from consumers to enterprises—to interact with...
6 MIN READ
Data Center / Cloud
Dec 11, 2025
NVIDIA Blackwell Enables 3x Faster Training and Nearly 2x Training Performance Per Dollar than Previous-Gen Architecture
AI innovation continues to be driven by three scaling laws: pre-training, post-training, and test-time scaling. Training is foundational to building smarter...
7 MIN READ
Agentic AI / Generative AI
Nov 12, 2025
NVIDIA Blackwell Architecture Sweeps MLPerf Training v5.1 Benchmarks
The NVIDIA Blackwell architecture powered the fastest time to train across every MLPerf Training v5.1 benchmark, marking a clean sweep in the latest round of...
10 MIN READ
Data Center / Cloud
Sep 09, 2025
NVIDIA Blackwell Ultra Sets New Inference Records in MLPerf Debut
As large language models (LLMs) grow larger, they get smarter, with open models from leading developers now featuring hundreds of billions of parameters. At the...
10 MIN READ