Shobhit Verma

Shobhit Verma is a software engineer on the TensorRT team at NVIDIA, where he focuses on MLPerf Inference. He has experience in the design and verification of ML accelerators, developing high performance computing applications and distributed systems. Shobhit holds an M.Sc. in computer science from the University of Chicago and a B.Sc. in computer engineering from Delhi Technological University
Avatar photo

Posts by Shobhit Verma

Decorative image.
Generative AI / LLMs

NVIDIA Triton Inference Server Achieves Outstanding Performance in MLPerf Inference 4.1 Benchmarks

Six years ago, we embarked on a journey to develop an AI inference serving solution specifically designed for high-throughput and time-sensitive production use... 8 MIN READ