Yu Sun

Yu is a researcher at NVIDIA and a postdoc at Stanford University. His research focuses on continual learning, specifically a conceptual framework called test-time training, where each test instance defines its own learning problem.
Avatar photo

Posts by Yu Sun

Decorative image.
Agentic AI / Generative AI

Reimagining LLM Memory: Using Context as Training Data Unlocks Models That Learn at Test-Time

We keep seeing LLMs with larger context windows in the news, along with promises that they can hold entire conversation histories, volumes of books, or multiple... 6 MIN READ