Speech Recognition / Diarization

Decorative image of text and speech recognition processes encircling the globe.

Apr 18, 2024

New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model

NVIDIA NeMo is an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises. The NeMo team...

4 MIN READ

Apr 18, 2024

Turbocharge ASR Accuracy and Speed with NVIDIA NeMo Parakeet-TDT

NVIDIA NeMo, an end-to-end platform for developing multimodal generative AI models at scale anywhere—on any cloud and on-premises—recently released...

6 MIN READ

Image of two people sitting in their cubicles with speech recognition visualizations in the background.

Apr 18, 2024

Pushing the Boundaries of Speech Recognition with NVIDIA NeMo Parakeet ASR Models

NVIDIA NeMo, an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises—released the...

6 MIN READ

Mar 19, 2024

NVIDIA Speech and Translation AI Models Set Records for Speed and Accuracy

Speech and translation AI models developed at NVIDIA are pushing the boundaries of performance and innovation. The NVIDIA Parakeet automatic speech recognition...

8 MIN READ

Person sitting at a desk having a conversation with a speech ai chatbot.

Jan 16, 2024

New Support for Dutch and Persian Released by NVIDIA NeMo ASR

Breaking barriers in speech recognition, NVIDIA NeMo proudly presents pretrained models tailored for Dutch and Persian—languages often overlooked in the AI...

2 MIN READ

Jan 09, 2024

Enhancing Phone Customer Service with ASR Customization

At the core of understanding people correctly and having natural conversations is automatic speech recognition (ASR). To make customer-led voice assistants and...

7 MIN READ

Still image from Kairos demo, of an NPC at a bar.

Jan 08, 2024

Building Lifelike Digital Avatars with NVIDIA ACE Microservices

Generative AI technologies are revolutionizing how games are produced and played. Game developers are exploring how these technologies can accelerate their...

5 MIN READ

Nov 29, 2023

Boost Meeting Productivity with AI-Powered Note-Taking and Summarization

Meetings are the lifeblood of an organization. They foster collaboration and informed decision-making. They eliminate silos through brainstorming and...

6 MIN READ

Decorative image of groups of people using speech AI in different ways standing around a globe.

Nov 07, 2023

Video: Exploring Speech AI from Research to Practical Production Applications

The integration of speech and translation AI into our daily lives is rapidly reshaping our interactions, from virtual assistants to call centers and augmented...

2 MIN READ

Sep 20, 2023

Workshop: Building Conversational AI Applications

Learn how to build and deploy production-quality conversational AI apps with real-time transcription and NLP.

1 MIN READ

Image of two boxes with text, in two languages, with speech icons joining them to a central box symbolizing translation. The English language box displays, "One language is never enough."

Aug 29, 2023

How to Deploy NVIDIA Riva Speech and Translation AI in the Public Cloud

From start-ups to large enterprises, businesses use cloud marketplaces to find the new solutions needed to quickly transform their businesses. Cloud...

16 MIN READ

Image of glasses with computer screen reflected.

Jun 23, 2023

Speech AI Spotlight: Visualizing Spoken Language and Sounds on AR Glasses

Audio can include a wide range of sounds, from human speech to non-speech sounds like barking dogs and sirens. When designing accessible applications for people...

4 MIN READ

Jun 06, 2023

Unlocking Speech AI Technology for Global Language Users: Top Q&As

Voice-enabled technology is becoming ubiquitous. But many are being left behind by an anglocentric and demographically biased algorithmic world. Mozilla Common...

10 MIN READ

May 30, 2023

How Language Neutralization Is Transforming Customer Service Contact Centers

According to Gartner,® "Nearly half of digital workers struggle to find the data they need to do their jobs, and close to one-third have made a wrong business...

6 MIN READ

Image of a chatbot as the interface between customers, with speech bubbles.

May 30, 2023

Enhancing Customer Experience in Telecom with NVIDIA Customized Speech AI

The telecom sector is transforming how communication happens. Striving to provide reliable, uninterrupted service, businesses are tackling the challenge of...

10 MIN READ

May 02, 2023

How Speech Recognition Improves Customer Service in Telecommunications

The telecommunication industry has seen a proliferation of AI-powered technologies in recent years, with speech recognition and translation leading the charge....

7 MIN READ