Sanjay Chatterjee

Sanjay Chatterjee is an engineering manager at NVIDIA. He works on GPU compute infrastructure with a focus on GPU scheduling to enable AI and HPC workloads to scale on Kubernetes. He is the creator and architect of the open source NVIDIA Grove project. Previously he worked on multiple DoE/DARPA funded advanced technology projects towards designing the first exascale systems. His interests include novel programming models, parallel languages, and runtime systems.
Avatar photo

Posts by Sanjay Chatterjee

Data Center / Cloud

Deploying Disaggregated LLM Inference Workloads on Kubernetes

As large language model (LLM) inference workloads grow in complexity, a single monolithic serving process starts to hit its limits. Prefill and decode stages... 14 MIN READ
Agentic AI / Generative AI

Streamline Complex AI Inference on Kubernetes with NVIDIA Grove

Over the past few years, AI inference has evolved from single-model, single-pod deployments into complex, multicomponent systems. A model deployment may now... 10 MIN READ