Ben Hamm

Ben Hamm is a technical product manager at NVIDIA, with a focus on LLM inference performance and optimization. He joined NVIDIA in 2024 as part of the acquisition of OctoAI, where he served as the PM for Octo’s LLM hosting service. Prior to that, he was a PM at Amazon where he worked on Alexa’s ML stack for wakeword detection. Ben is also a computer vision hobbyist, and invented an AI-powered cat door.

Posts by Ben Hamm

Agentic AI / Generative AI Mar 09, 2026

Removing the Guesswork from Disaggregated Serving

Deploying and optimizing large language models (LLMs) for high-performance, cost-effective serving can be an overwhelming engineering problem. The ideal... 10 MIN READ

Data Center / Cloud May 22, 2025

Blackwell Breaks the 1,000 TPS/User Barrier With Meta’s Llama 4 Maverick

NVIDIA has achieved a world-record large language model (LLM) inference speed. A single NVIDIA DGX B200 node with eight NVIDIA Blackwell GPUs can achieve over... 9 MIN READ