Yao (Jason) Lu

Yao (Jason) Lu is a principal research scientist at NVIDIA Research. His current research interest is efficient large language models (LLM) and vision language models (VLM). Before joining NVIDIA, he was a TLM at Google Deepmind where he worked on reinforcement learning, imitation learning on embodied AI. He co-led the SayCan, RT-1, RT-2, and RT-X algorithms that have been featured extensively by media, such as New York Times, Washington Post, Forbes, Reuters, TechCrunch, The WIRED, and so on.
Avatar photo

Posts by Yao (Jason) Lu

Decorative image of VILA and Jetson Orin workflow.
Generative AI

Visual Language Intelligence and Edge AI 2.0

VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest... 8 MIN READ
Decorative image.
Generative AI

Visual Language Models on NVIDIA Hardware with VILA

Visual language models have evolved significantly recently. However, the existing technology typically only supports one single image. They cannot reason among... 11 MIN READ