Exploring neural transducers for end-to-end speech recognition E Battenberg, J Chen, R Child, A Coates, YGY Li, H Liu, S Satheesh, ... 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 238* | 2017 |
On the comparison of popular end-to-end models for large scale speech recognition J Li, Y Wu, Y Gaur, C Wang, R Zhao, S Liu arXiv preprint arXiv:2005.14327, 2020 | 106 | 2020 |
Robust speech recognition using generative adversarial networks A Sriram, H Jun, Y Gaur, S Satheesh 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 62 | 2018 |
Serialized output training for end-to-end overlapped speech recognition N Kanda, Y Gaur, X Wang, Z Meng, T Yoshioka arXiv preprint arXiv:2003.12687, 2020 | 56 | 2020 |
Internal language model estimation for domain-adaptive end-to-end speech recognition Z Meng, S Parthasarathy, E Sun, Y Gaur, N Kanda, L Lu, X Chen, R Zhao, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 243-250, 2021 | 53 | 2021 |
The effects of automatic speech recognition quality on human transcription latency Y Gaur, WS Lasecki, F Metze, JP Bigham Proceedings of the 13th International Web for All Conference, 1-8, 2016 | 47 | 2016 |
Minimum latency training strategies for streaming sequence-to-sequence ASR H Inaguma, Y Gaur, L Lu, J Li, Y Gong ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 41 | 2020 |
Joint speaker counting, speech recognition, and speaker identification for overlapped speech of any number of speakers N Kanda, Y Gaur, X Wang, Z Meng, Z Chen, T Zhou, T Yoshioka arXiv preprint arXiv:2006.10930, 2020 | 40 | 2020 |
Domain adaptation via teacher-student learning for end-to-end speech recognition Z Meng, J Li, Y Gaur, Y Gong 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 38 | 2019 |
Speaker adaptation for attention-based end-to-end speech recognition Z Meng, Y Gaur, J Li, Y Gong arXiv preprint arXiv:1911.03762, 2019 | 35 | 2019 |
Internal language model training for domain-adaptive end-to-end speech recognition Z Meng, N Kanda, Y Gaur, S Parthasarathy, E Sun, L Lu, X Chen, J Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 28 | 2021 |
A Federated Approach in Training Acoustic Models. D Dimitriadis, RG Ken'ichi Kumatani, R Gmyr, Y Gaur, SE Eskimez Interspeech, 981-985, 2020 | 26 | 2020 |
Large-scale pre-training of end-to-end multi-talker ASR for meeting transcription with single distant microphone N Kanda, G Ye, Y Wu, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka arXiv preprint arXiv:2103.16776, 2021 | 24 | 2021 |
Investigation of end-to-end speaker-attributed ASR for continuous multi-talker recordings N Kanda, X Chang, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka 2021 IEEE Spoken Language Technology Workshop (SLT), 809-816, 2021 | 22 | 2021 |
End-to-end speaker-attributed ASR with transformer N Kanda, G Ye, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka arXiv preprint arXiv:2104.02128, 2021 | 15 | 2021 |
Combination of End-to-End and Hybrid Models for Speech Recognition. JHM Wong, Y Gaur, R Zhao, L Lu, E Sun, J Li, Y Gong Interspeech, 1783-1787, 2020 | 14 | 2020 |
Streaming multi-talker ASR with token-level serialized output training N Kanda, J Wu, Y Wu, X Xiao, Z Meng, X Wang, Y Gaur, Z Chen, J Li, ... arXiv preprint arXiv:2202.00842, 2022 | 13 | 2022 |
Federated transfer learning with dynamic gradient aggregation D Dimitriadis, K Kumatani, R Gmyr, Y Gaur, SE Eskimez arXiv preprint arXiv:2008.02452, 2020 | 13 | 2020 |
Algorithms for speech segmentation at syllable-level for text-to-speech synthesis system in Gujarati HA Patil, T Patel, S Talesara, N Shah, H Sailor, B Vachhani, J Akhani, ... 2013 International Conference Oriental COCOSDA held jointly with 2013 …, 2013 | 13 | 2013 |
Exploring end-to-end multi-channel ASR with bias information for meeting transcription X Wang, N Kanda, Y Gaur, Z Chen, Z Meng, T Yoshioka 2021 IEEE Spoken Language Technology Workshop (SLT), 833-840, 2021 | 12 | 2021 |