Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling J Cho, MK Baskar, R Li, M Wiesner, SH Mallidi, N Yalta, M Karafiat, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 521-527, 2018 | 113 | 2018 |
The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays N Kanda, R Ikeshita, S Horiguchi, Y Fujita, K Nagamatsu, X Wang, ... Proc. CHiME-5, 6-10, 2018 | 51 | 2018 |
BAT System Description for NIST LRE 2015. O Plchot, P Matejka, O Glembek, R Fer, O Novotný, J Pesan, L Burget, ... Odyssey, 166-173, 2016 | 24 | 2016 |
Stream attention-based multi-array end-to-end speech recognition X Wang, R Li, SH Mallidi, T Hori, S Watanabe, H Hermansky ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 22 | 2019 |
Multi-stream end-to-end speech recognition R Li, X Wang, SH Mallidi, S Watanabe, T Hori, H Hermansky IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 646-655, 2019 | 18 | 2019 |
The MIT-LL, JHU and LRDE NIST 2016 Speaker Recognition Evaluation System. PA Torres-Carrasquillo, F Richardson, SC Nercessian, DE Sturim, ... Interspeech, 1333-1337, 2017 | 18 | 2017 |
Stream Attention for Distributed Multi-Microphone Speech Recognition. X Wang, R Li, H Hermansky Interspeech, 3033-3037, 2018 | 14 | 2018 |
Exploiting Hidden Layer Responses of Deep Neural Networks for Language Recognition R Li, SH Mallidi, L Burget, O Plchot, N Dehak Johns Hopkins University Baltimore United States, 2016 | 13 | 2016 |
Multi-encoder multi-resolution framework for end-to-end speech recognition R Li, X Wang, SH Mallidi, T Hori, S Watanabe, H Hermansky arXiv preprint arXiv:1811.04897, 2018 | 12 | 2018 |
M-vectors: sub-band based energy modulation features for multi-stream automatic speech recognition S Sadhu, R Li, H Hermansky ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 11 | 2019 |
Stream attention-based multi-array end-to-end speech recognition. ICASSP 2019–2019 IEEE international conference on acoustics. Speech and signal processing (ICASSP) X Wang, R Li, SH Mallidi, T Hori, S Watanabe, H Hermansky IEEE, 2019 | 9 | 2019 |
A practical two-stage training strategy for multi-stream end-to-end speech recognition R Li, G Sell, X Wang, S Watanabe, H Hermansky ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 8 | 2020 |
Deriving spectro-temporal properties of hearing from speech data L Ondel, R Li, G Sell, H Hermansky ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 5 | 2019 |
Performance monitoring for end-to-end speech recognition R Li, G Sell, H Hermansky arXiv preprint arXiv:1904.04896, 2019 | 2 | 2019 |
Exploring methods for the automatic detection of errors in manual transcription X Wang, J Yang, R Li, S Sadhu, H Hermansky arXiv preprint arXiv:1904.04294, 2019 | 2 | 2019 |
Two-stage augmentation and adaptive CTC fusion for improved robustness of multi-stream end-to-end ASR R Li, G Sell, H Hermansky 2021 IEEE Spoken Language Technology Workshop (SLT), 229-235, 2021 | 1 | 2021 |
An Efficient and Robust Multi-stream Framework for End-to-end Speech Recognition R Li The Johns Hopkins University, 2020 | | 2020 |