Huck Yang

Huck Yang is a senior research scientist at NVIDIA Research. He received his Ph.D. and M.Sc. from Georgia Institute of Technology. His primary research lies in the area of speech-language modeling, robust speech recognition, and multimodal post-editing. He served as an area chair and committee members in IEEE ICASSP 2022 to 2025, EMNLP 2024, and SLT 2024.
Avatar photo

Posts by Huck Yang

Generative AI

Multi-Agent AI and GPU-Powered Innovation in Sound-to-Text Technology

The Automated Audio Captioning task centers around generating natural language descriptions from audio inputs. Given the distinct modalities between the input... 7 MIN READ