Harry Kim

Harry Kim is a Principal Product Manager at NVIDIA enabling performant and scalable AI/ML inference with Triton. He has experience working on recommendation systems at Meta, AI infrastructure at Intel AI, and Ads ranking and recommendation at Google. He holds a PhD in Statistics from UC Berkeley.
Avatar photo

Posts by Harry Kim

Decorative image.
Generative AI / LLMs

Measuring Generative AI Model Performance Using NVIDIA GenAI-Perf and an OpenAI-Compatible API

NVIDIA offers tools like Perf Analyzer and Model Analyzer to assist machine learning engineers with measuring and balancing the trade-off between latency and... 6 MIN READ