Chenjie Luo

Chenjie Luo is a manager in the Deep Learning Algorithm and Software team at NVIDIA, leading the initiative of the TensorRT Model Optimizer user experience and production. Chenjie joined NVIDIA through the acquisition of OmniML, Inc. as an early team member. Before that, he was a SW manager at Nuro developing self-driving robots and a SW engineer at Google on edge acceleration. He received his master’s degree in Electrical Engineering from Stanford, and bachelor’s degree from Zhejiang University.
Chenjie Luo

Posts by Chenjie Luo

Illustration showing models and NeMo.
Generative AI

Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer

As large language models (LLMs) are becoming even bigger, it is increasingly important to provide easy-to-use and efficient deployment paths because the cost of... 10 MIN READ
Generative AI

Accelerate Generative AI Inference Performance with NVIDIA TensorRT Model Optimizer, Now Publicly Available

In the fast-evolving landscape of generative AI, the demand for accelerated inference speed remains a pressing concern. With the exponential growth in model... 9 MIN READ