Aleksandr Laptev is a Ph.D. student at ITMO University and a senior research scientist at NVIDIA. His scientific interests are automatic speech recognition, speech synthesis (TTS), and natural language processing. He writes open-access scientific articles, contributes to open-source software, and participates in international speech recognition competitions. His current research area is differentiable weighted finite-state transducers.

Changing CTC Rules to Reduce Memory Consumption in Training and Decoding

Loss functions for training automatic speech recognition (ASR) models are not set in stone. The older rules of loss functions are not necessarily optimal.... 8 MIN READ