Sarah Yurick

Sarah Yurick is a senior software engineer on the RAPIDS team at NVIDIA. Her efforts are focused on accelerating data science workflows on the GPU. Prior to NVIDIA, Sarah received her M.S. degree in Computer Science from Case Western Reserve University.
Avatar photo

Posts by Sarah Yurick

Generative AI

Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator

Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable... 7 MIN READ