Daisy Chu

Daisy Chu is a senior systems software engineer on the NVIDIA TensorRT team, specializing in multi-device architectures. Her work centers on building production-grade inference systems, with an emphasis on performance optimization, correctness validation, and scalable execution across single- and multi-GPU environments. Daisy is instrumental in enabling efficient multi-GPU inference for large language and multimodal models, ensuring high scalability and robustness. She holds a master’s degree in Computer Science from the University of Illinois Urbana-Champaign.

Posts by Daisy Chu

Developer Tools & Techniques Jun 25, 2026

Scaling AI Inference Across Multiple GPUs Using NVIDIA TensorRT with Multi-Device Inference Support

Generative AI workloads are rapidly outgrowing the memory and compute budget of single GPUs. For inference developers building media generation pipelines, the... 11 MIN READ