Technical Walkthrough 3

Applying Language Model Techniques to Compose AI Music

Language models such as the NVIDIA Megatron-LM and OpenAI GPT-2 and GPT-3 have been used to enhance human productivity and creativity. Specifically, these... 11 MIN READ
Technical Walkthrough 4

Adapting P-Tuning to Solve Non-English Downstream Tasks

With the increasing demand for access to pretrained large language model (LLM) weights, the climate around LLM sharing is changing. Recently, Meta released Open... 15 MIN READ
Technical Walkthrough 1

The Full Stack Optimization Powering NVIDIA MLPerf Training v2.0 Performance

MLPerf benchmarks are developed by a consortium of AI leaders across industry, academia, and research labs, with the aim of providing standardized, fair, and... 14 MIN READ
Technical Walkthrough 0

Training a State-of-the-Art ImageNet-1K Visual Transformer Model using NVIDIA DGX SuperPOD

Recent work has demonstrated that large transformer models can achieve or advance the SOTA in computer vision tasks such as semantic segmentation and object... 9 MIN READ
Technical Walkthrough 0

Saving Time and Money in the Cloud with the Latest NVIDIA-Powered Instances

AI is transforming every industry, enabling powerful new applications and use cases that simply weren’t possible with traditional software. As AI continues to... 9 MIN READ
Technical Walkthrough 3

Doubling all2all Performance with NVIDIA Collective Communication Library 2.12

Collective communications are a performance-critical ingredient of modern distributed AI training workloads such as recommender systems and natural language... 8 MIN READ