Posts by Davide Onofrio
Conversational AI
Jan 12, 2023
Autoscaling NVIDIA Riva Deployment with Kubernetes for Speech AI in Production
Speech AI applications, from call centers to virtual assistants, rely heavily on automatic speech recognition (ASR) and text-to-speech (TTS). ASR can process...
13 MIN READ
Simulation / Modeling / Design
Dec 08, 2022
Introducing NVIDIA Riva: A GPU-Accelerated SDK for Developing Speech AI Applications
This post was updated in March 2023. Sign up for the latest Speech AI news from NVIDIA. Speech AI is used in a variety of applications, including contact...
8 MIN READ
Conversational AI
Sep 14, 2022
Developing the Next Generation of Extended Reality Applications with Speech AI
Virtual reality (VR), augmented reality (AR), and mixed reality (MR) environments can feel incredibly real due to the physically immersive experience. Adding a...
12 MIN READ
Data Center / Cloud
Aug 30, 2022
Dividing NVIDIA A30 GPUs and Conquering Multiple Workloads
Multi-Instance GPU (MIG) is an important feature of NVIDIA H100, A100, and A30 Tensor Core GPUs, as it can partition a GPU into multiple instances. Each...
9 MIN READ
Data Center / Cloud
May 11, 2022
Accelerating AI Inference Workloads with NVIDIA A30 GPU
NVIDIA A30 GPU is built on the latest NVIDIA Ampere Architecture to accelerate diverse workloads like AI inference at scale, enterprise training, and HPC...
6 MIN READ
Data Center / Cloud
Aug 25, 2021
Deploying NVIDIA Triton at Scale with MIG and Kubernetes
NVIDIA Triton Inference Server is an open-source AI model serving software that simplifies the deployment of trained AI models at scale in production. Clients...
24 MIN READ