​​Lucas Liebenwein

​​Lucas Liebenwein is a tech lead and senior engineer with the TensorRT-LLM team at NVIDIA, where he co-leads the development of AutoDeploy for deploying new and emerging LLM architectures with high-performance inference. Lucas joined NVIDIA through the acquisition of OmniML, Inc., where he was a founding engineer and chief architect. He received his PhD from MIT CSAIL, where his research focused on efficient deep learning.
Avatar photo

Posts by ​​Lucas Liebenwein

Developer Tools & Techniques

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy

NVIDIA TensorRT LLM enables developers to build high-performance inference engines for large language models (LLMs), but deploying a new architecture... 9 MIN READ