Follow
Ruizhi Li
Ruizhi Li
Verified email at microsoft.com
Title
Cited by
Cited by
Year
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling
J Cho, MK Baskar, R Li, M Wiesner, SH Mallidi, N Yalta, M Karafiat, ...
2018 IEEE Spoken Language Technology Workshop (SLT), 521-527, 2018
1332018
The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays
N Kanda, R Ikeshita, S Horiguchi, Y Fujita, K Nagamatsu, X Wang, ...
Proc. CHiME-5, 6-10, 2018
522018
Stream attention-based multi-array end-to-end speech recognition
X Wang, R Li, SH Mallidi, T Hori, S Watanabe, H Hermansky
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
272019
BAT System Description for NIST LRE 2015.
O Plchot, P Matejka, O Glembek, R Fer, O Novotný, J Pesan, L Burget, ...
Odyssey, 166-173, 2016
252016
Multi-stream end-to-end speech recognition
R Li, X Wang, SH Mallidi, S Watanabe, T Hori, H Hermansky
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 646-655, 2019
232019
The MIT-LL, JHU and LRDE NIST 2016 Speaker Recognition Evaluation System.
PA Torres-Carrasquillo, F Richardson, SC Nercessian, DE Sturim, ...
Interspeech, 1333-1337, 2017
192017
M-vectors: sub-band based energy modulation features for multi-stream automatic speech recognition
S Sadhu, R Li, H Hermansky
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
152019
Stream Attention for Distributed Multi-Microphone Speech Recognition.
X Wang, R Li, H Hermansky
Interspeech, 3033-3037, 2018
142018
Multi-encoder multi-resolution framework for end-to-end speech recognition
R Li, X Wang, SH Mallidi, T Hori, S Watanabe, H Hermansky
arXiv preprint arXiv:1811.04897, 2018
132018
Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition.
R Li, SHR Mallidi, L Burget, O Plchot, N Dehak
INTERSPEECH, 3265-3269, 2016
112016
A practical two-stage training strategy for multi-stream end-to-end speech recognition
R Li, G Sell, X Wang, S Watanabe, H Hermansky
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
82020
Deriving spectro-temporal properties of hearing from speech data
L Ondel, R Li, G Sell, H Hermansky
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
52019
Performance monitoring for end-to-end speech recognition
R Li, G Sell, H Hermansky
arXiv preprint arXiv:1904.04896, 2019
22019
Exploring methods for the automatic detection of errors in manual transcription
X Wang, J Yang, R Li, S Sadhu, H Hermansky
arXiv preprint arXiv:1904.04294, 2019
22019
Two-stage augmentation and adaptive CTC fusion for improved robustness of multi-stream end-to-end ASR
R Li, G Sell, H Hermansky
2021 IEEE Spoken Language Technology Workshop (SLT), 229-235, 2021
12021
An Efficient and Robust Multi-stream Framework for End-to-end Speech Recognition
R Li
The Johns Hopkins University, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–16