Posts by Jared Casper
Technical Walkthrough
Apr 12, 2021
Scaling Language Model Training to a Trillion Parameters Using Megatron
Natural Language Processing (NLP) has seen rapid progress in recent years as computation at scale has become more available and datasets have become larger. At...
17 MIN READ
Technical Walkthrough
May 14, 2020
State-of-the-Art Language Modeling Using Megatron on the NVIDIA A100 GPU
Recent work has demonstrated that larger language models dramatically advance the state of the art in natural language processing (NLP) applications such as...
9 MIN READ
Technical Walkthrough
May 02, 2018
NVVL Accelerates Machine Learning on Video Datasets
Loading data onto GPUs for training has historically been a minor issue for most deep learning practitioners. Data read from a local spinning hard drive or NAS...
11 MIN READ