Taejin Park

Taejin Park is a senior research scientist at NVIDIA focused on NeMo Speech AI. His research focuses on deep learning for speech processing, including context-aware speaker diarization, and multi-speaker automatic speech recognition (ASR). He received his Ph.D. in Electrical and Computer Engineering and M.S. in Computer Science from the University of Southern California (USC) in 2021, where he was part of the Signal Analysis and Interpretation Laboratory (SAIL). Before that, he earned his B.S. and M.S. in Electrical Engineering and Computer Science from Seoul National University (SNU), South Korea. Prior to joining NVIDIA, he worked as a researcher at the Electronics and Telecommunications Research Institute (ETRI) and held internships at Microsoft, Amazon Alexa Speech, and Capio Inc., where he contributed to advancements in federated continual learning, ASR, and speaker diarization. Taejin Park has published extensively in signal processing-related conferences and journals such as ICASSP, ICML, Interspeech, and IEEE SPL.

Posts by Taejin Park

Agentic AI / Generative AI Aug 18, 2025

Identify Speakers in Meetings, Calls, and Voice Apps in Real-Time with NVIDIA Streaming Sortformer

In every meeting, call, crowded room, or voice-enabled app, technology has a core question: who is speaking, and when? For decades, answering that question in... 5 MIN READ

Conversational AI / NLP Sep 16, 2022

Dynamic Scale Weighting Through Multiscale Speaker Diarization

Speaker diarization is the process of segmenting audio recordings by speaker labels and aims to answer the question “Who spoke when?”. It makes a clear... 10 MIN READ