An investigation of deep neural networks for noise robust speech recognition ML Seltzer, D Yu, Y Wang 2013 IEEE international conference on acoustics, speech and signal …, 2013 | 749 | 2013 |
CNTK: Microsoft's open-source deep-learning toolkit F Seide, A Agarwal Proceedings of the 22nd ACM SIGKDD international conference on knowledge …, 2016 | 559 | 2016 |
Towards end-to-end spoken language understanding D Serdyuk, Y Wang, C Fuegen, A Kumar, B Liu, Y Bengio 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 215 | 2018 |
Transformer-based acoustic modeling for hybrid speech recognition Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 202 | 2020 |
Transformer-transducer: End-to-end speech recognition with self-attention CF Yeh, J Mahadeokar, K Kalgaonkar, Y Wang, D Le, M Jain, K Schubert, ... arXiv preprint arXiv:1910.12977, 2019 | 124 | 2019 |
Efficient lattice rescoring using recurrent neural network language models X Liu, Y Wang, X Chen, MJF Gales, PC Woodland 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 110 | 2014 |
Emformer: Efficient memory transformer based acoustic model for low latency streaming speech recognition Y Shi, Y Wang, C Wu, CF Yeh, J Chan, F Zhang, D Le, M Seltzer ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 103 | 2021 |
Simplifying long short-term memory acoustic models for fast training and decoding Y Miao, J Li, Y Wang, SX Zhang, Y Gong 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 90 | 2016 |
Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch X Chen, Y Wang, X Liu, MJF Gales, PC Woodland Fifteenth Annual Conference of the International Speech Communication …, 2014 | 81 | 2014 |
Adaptation of deep neural network acoustic models using factorised i-vectors. P Karanasou, Y Wang, MJF Gales, PC Woodland Interspeech 2014, 2180-2184, 2014 | 80 | 2014 |
Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1519-1532, 2022 | 78 | 2022 |
Speaker and noise factorization for robust speech recognition Y Wang, MJF Gales IEEE Transactions on Audio, Speech, and Language Processing 20 (7), 2149-2158, 2012 | 58 | 2012 |
Investigations on speaker adaptation of LSTM RNN models for speech recognition C Liu, Y Wang, K Kumar, Y Gong 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 51 | 2016 |
Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory C Wu, Y Wang, Y Shi, CF Yeh, F Zhang arXiv preprint arXiv:2005.08042, 2020 | 49 | 2020 |
Small-footprint high-performance deep neural network-based speech recognition using split-VQ Y Wang, J Li, Y Gong 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 45 | 2015 |
End-to-end contextual speech recognition using class language models and a token passing decoder Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 36 | 2019 |
Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR. Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen Interspeech, 3490-3494, 2019 | 36 | 2019 |
Speaker and noise factorisation on the AURORA4 task YQ Wang, MJF Gales Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International …, 2011 | 29 | 2011 |
Model-based approaches to handling additive noise in reverberant environments MJF Gales, YQ Wang Hands-free Speech Communication and Microphone Arrays (HSCMA), 2011 Joint …, 2011 | 28 | 2011 |
Streaming simultaneous speech translation with augmented memory transformer X Ma, Y Wang, MJ Dousti, P Koehn, J Pino ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 25 | 2021 |