Ganesh Kudleppanavar

Ganesh Kudleppanavar is a System Software Manager at NVIDIA, dedicated to optimizing the performance of Machine Learning and Generative AI models. He leads the Triton tools team, utilizing the powerful Triton Tools to meticulously benchmark these models, ensuring their efficient deployment and seamless utilization across diverse applications. Ganesh holds a Master of Science degree in Electrical Engineering from California State University, Long Beach.
Avatar photo

Posts by Ganesh Kudleppanavar

Decorative image of a datacenter with floating icons overlaid.
Generative AI

LLM Inference Benchmarking Guide: NVIDIA GenAI-Perf and NIM

This is the second post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. ... 11 MIN READ
Generative AI

LLM Inference Benchmarking: Fundamental Concepts

This is the first post in the large language model latency-throughput benchmarking series, which aims to instruct developers on common metrics used for LLM... 15 MIN READ
Decorative image.
Generative AI

Measuring Generative AI Model Performance Using NVIDIA GenAI-Perf and an OpenAI-Compatible API

NVIDIA offers tools like Perf Analyzer and Model Analyzer to assist machine learning engineers with measuring and balancing the trade-off between latency and... 6 MIN READ