Speech Recognition / Diarization
Apr 18, 2024
New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model
NVIDIA NeMo is an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises. The NeMo team...
4 MIN READ
Apr 18, 2024
Turbocharge ASR Accuracy and Speed with NVIDIA NeMo Parakeet-TDT
NVIDIA NeMo, an end-to-end platform for developing multimodal generative AI models at scale anywhere—on any cloud and on-premises—recently released...
6 MIN READ
Apr 18, 2024
Pushing the Boundaries of Speech Recognition with NVIDIA NeMo Parakeet ASR Models
NVIDIA NeMo, an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises—released the...
6 MIN READ
Mar 19, 2024
NVIDIA Speech and Translation AI Models Set Records for Speed and Accuracy
Speech and translation AI models developed at NVIDIA are pushing the boundaries of performance and innovation. The NVIDIA Parakeet automatic speech recognition...
8 MIN READ
Jan 16, 2024
New Support for Dutch and Persian Released by NVIDIA NeMo ASR
Breaking barriers in speech recognition, NVIDIA NeMo proudly presents pretrained models tailored for Dutch and Persian—languages often overlooked in the AI...
2 MIN READ
Jan 09, 2024
Enhancing Phone Customer Service with ASR Customization
At the core of understanding people correctly and having natural conversations is automatic speech recognition (ASR). To make customer-led voice assistants and...
7 MIN READ
Jan 08, 2024
Building Lifelike Digital Avatars with NVIDIA ACE Microservices
Generative AI technologies are revolutionizing how games are produced and played. Game developers are exploring how these technologies can accelerate their...
5 MIN READ
Nov 29, 2023
Boost Meeting Productivity with AI-Powered Note-Taking and Summarization
Meetings are the lifeblood of an organization. They foster collaboration and informed decision-making. They eliminate silos through brainstorming and...
6 MIN READ
Nov 07, 2023
Video: Exploring Speech AI from Research to Practical Production Applications
The integration of speech and translation AI into our daily lives is rapidly reshaping our interactions, from virtual assistants to call centers and augmented...
2 MIN READ
Sep 20, 2023
Workshop: Building Conversational AI Applications
Learn how to build and deploy production-quality conversational AI apps with real-time transcription and NLP.
1 MIN READ
Aug 29, 2023
How to Deploy NVIDIA Riva Speech and Translation AI in the Public Cloud
From start-ups to large enterprises, businesses use cloud marketplaces to find the new solutions needed to quickly transform their businesses. Cloud...
16 MIN READ
Jun 23, 2023
Speech AI Spotlight: Visualizing Spoken Language and Sounds on AR Glasses
Audio can include a wide range of sounds, from human speech to non-speech sounds like barking dogs and sirens. When designing accessible applications for people...
4 MIN READ
Jun 06, 2023
Unlocking Speech AI Technology for Global Language Users: Top Q&As
Voice-enabled technology is becoming ubiquitous. But many are being left behind by an anglocentric and demographically biased algorithmic world. Mozilla Common...
10 MIN READ
May 30, 2023
How Language Neutralization Is Transforming Customer Service Contact Centers
According to Gartner,® "Nearly half of digital workers struggle to find the data they need to do their jobs, and close to one-third have made a wrong business...
6 MIN READ
May 30, 2023
Enhancing Customer Experience in Telecom with NVIDIA Customized Speech AI
The telecom sector is transforming how communication happens. Striving to provide reliable, uninterrupted service, businesses are tackling the challenge of...
10 MIN READ
May 02, 2023
How Speech Recognition Improves Customer Service in Telecommunications
The telecommunication industry has seen a proliferation of AI-powered technologies in recent years, with speech recognition and translation leading the charge....
7 MIN READ