Jared Casper

Jared works as a Senior Deep Learning Scientist in the Applied Deep Learning Research team at NVIDIA. Prior to joining NVIDIA in 2017, Jared worked on systems for deep learning at Baidu’s Silicon Valley AI Lab and on domain-specific hardware accelerators at Oracle Labs. Jared received his Ph.D. from Stanford in 2015 focusing on Computer Architecture.

Posts by Jared Casper

AI / Deep Learning

Scaling Language Model Training to a Trillion Parameters Using Megatron

Natural Language Processing (NLP) has seen rapid progress in recent years as computation at scale has become more available and datasets have become larger. 17 MIN READ
AI / Deep Learning

State-of-the-Art Language Modeling Using Megatron on the NVIDIA A100 GPU

Recent work has demonstrated that larger language models dramatically advance the state of the art in natural language processing (NLP) applications such as… 9 MIN READ
Artificial Intelligence

NVVL Accelerates Machine Learning on Video Datasets

Loading data onto GPUs for training has historically been a minor issue for most deep learning practitioners. Data read from a local spinning hard drive or NAS… 11 MIN READ