Suyog Gupta

Suyog Gupta is a distinguished engineer and manager at NVIDIA where he works on inference software architecture for large-scale AI systems. He received his PhD from Stanford University and has previously worked in machine learning hardware/software codesign at IBM Research, Google, and GM Cruise.
Avatar photo

Posts by Suyog Gupta

Developer Tools & Techniques

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy

NVIDIA TensorRT LLM enables developers to build high-performance inference engines for large language models (LLMs), but deploying a new architecture... 9 MIN READ