Taejin Park

Taejin Park is a senior research scientist at NVIDIA focused on NeMo Speech AI. His research focuses on deep learning for speech processing, including context-aware speaker diarization, and multi-speaker automatic speech recognition (ASR). He received his Ph.D. in Electrical and Computer Engineering and M.S. in Computer Science from the University of Southern California (USC) in 2021, where he was part of the Signal Analysis and Interpretation Laboratory (SAIL). Before that, he earned his B.S. and M.S. in Electrical Engineering and Computer Science from Seoul National University (SNU), South Korea. Prior to joining NVIDIA, he worked as a researcher at the Electronics and Telecommunications Research Institute (ETRI) and held internships at Microsoft, Amazon Alexa Speech, and Capio Inc., where he contributed to advancements in federated continual learning, ASR, and speaker diarization. Taejin Park has published extensively in signal processing-related conferences and journals such as ICASSP, ICML, Interspeech, and IEEE SPL.
Avatar photo

Posts by Taejin Park

Decorative image.
Conversational AI

Identify Speakers in Meetings, Calls, and Voice Apps in Real-Time with NVIDIA Streaming Sortformer

In every meeting, call, crowded room, or voice-enabled app, technology has a core question: who is speaking, and when? For decades, answering that question in... 5 MIN READ
Conversational AI

Dynamic Scale Weighting Through Multiscale Speaker Diarization

Speaker diarization is the process of segmenting audio recordings by speaker labels and aims to answer the question “Who spoke when?”. It makes a clear... 10 MIN READ