Posts by Sanjay Chatterjee
Data Center / Cloud
Mar 23, 2026
Deploying Disaggregated LLM Inference Workloads on Kubernetes
As large language model (LLM) inference workloads grow in complexity, a single monolithic serving process starts to hit its limits. Prefill and decode stages...
14 MIN READ
Agentic AI / Generative AI
Nov 10, 2025
Streamline Complex AI Inference on Kubernetes with NVIDIA Grove
Over the past few years, AI inference has evolved from single-model, single-pod deployments into complex, multicomponent systems. A model deployment may now...
10 MIN READ