Technical Walkthrough 2

Improving Japanese Language ASR by Combining Convolutions with Attention Mechanisms

Automatic speech recognition (ASR) research generally focuses on high-resource languages such as English, which is supported by hundreds of thousands of hours... 5 MIN READ
Technical Walkthrough 3

Accelerated Inference for Large Transformer Models Using NVIDIA Triton Inference Server

This is the first part of a two-part series discussing the NVIDIA Triton Inference Server’s FasterTransformer (FT) library, one of the fastest libraries for... 10 MIN READ
Technical Walkthrough 3

Deploying GPT-J and T5 with NVIDIA Triton Inference Server

This is the second part of a two-part series about NVIDIA tools that allow you to run large transformer models for accelerated inference. For an introduction to... 16 MIN READ
Technical Walkthrough 4

NVIDIA AI Platform Delivers Big Gains for Large Language Models

As the size and complexity of large language models (LLMs) continue to grow, NVIDIA is today announcing updates to the NeMo Megatron framework that provide... 7 MIN READ
News 1

Developing NLP Applications for Healthcare

Natural language processing (NLP) can be defined as the combination of artificial intelligence (AI), computer science, and computational linguistics to... 4 MIN READ
Technical Walkthrough 4

Adapting P-Tuning to Solve Non-English Downstream Tasks

With the increasing demand for access to pretrained large language model (LLM) weights, the climate around LLM sharing is changing. Recently, Meta released Open... 15 MIN READ