News 0

Just Released: New Updates to NVIDIA Riva

Build better GPU-accelerated Speech AI applications with the latest NVIDIA Riva updates, including enterprise support. < 1
Technical Walkthrough 3

Low-Code Building Blocks for Speech AI Robotics

When examining an intricate speech AI robotic system, it’s easy for developers to feel intimidated by its complexity. Arthur C. Clarke claimed, “Any... 8 MIN READ
Technical Walkthrough 8

Text Normalization and Inverse Text Normalization with NVIDIA NeMo

Text normalization (TN) converts text from written form into its verbalized form, and it is an essential preprocessing step before text-to-speech (TTS). TN... 9 MIN READ
Technical Walkthrough 2

Dynamic Scale Weighting Through Multiscale Speaker Diarization

Speaker diarization is the process of segmenting audio recordings by speaker labels and aims to answer the question “Who spoke when?”. It makes a clear... 10 MIN READ
Technical Walkthrough 4

Developing the Next Generation of Extended Reality Applications with Speech AI

Virtual reality (VR), augmented reality (AR), and mixed reality (MR) environments can feel incredibly real due to the physically immersive experience. Adding a... 12 MIN READ
Technical Walkthrough 4

Changing CTC Rules to Reduce Memory Consumption in Training and Decoding

Loss functions for training automatic speech recognition (ASR) models are not set in stone. The older rules of loss functions are not necessarily optimal.... 8 MIN READ