Posts by Arts Yang
Conversational AI
Jan 12, 2023
Autoscaling NVIDIA Riva Deployment with Kubernetes for Speech AI in Production
Speech AI applications, from call centers to virtual assistants, rely heavily on automatic speech recognition (ASR) and text-to-speech (TTS). ASR can process...
13 MIN READ
Data Center / Cloud
Aug 25, 2021
Deploying NVIDIA Triton at Scale with MIG and Kubernetes
NVIDIA Triton Inference Server is an open-source AI model serving software that simplifies the deployment of trained AI models at scale in production. Clients...
24 MIN READ
Data Science
Jul 26, 2021
Accelerating Volkswagen Connected Car Data Pipelines 100x Faster with NVIDIA RAPIDS
Connected cars are vehicles that communicate with other vehicles using backend systems to enhance usability, enable convenient services, and keep distributed...
18 MIN READ
Data Science
Nov 30, 2020
Getting Kubernetes Ready for the NVIDIA A100 GPU with Multi-Instance GPU
Multi-Instance GPU (MIG) is a new feature of the latest generation of NVIDIA GPUs, such as A100. It enables users to maximize the utilization of a single GPU by...
13 MIN READ