Acoustic-to-word attention-based model complemented with character-level CTC-based model S Ueno, H Inaguma, M Mimura, T Kawahara 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 76 | 2018 |
Leveraging Sequence-to-Sequence Speech Synthesis for Enhancing Acoustic-to-Word Speech Recognition M Mimura, S Ueno, H Inaguma, S Sakai, T Kawahara 2018 IEEE Spoken Language Technology Workshop (SLT), 477-484, 2018 | 71 | 2018 |
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR H Futami, H Inaguma, S Ueno, M Mimura, S Sakai, T Kawahara arXiv preprint arXiv:2008.03822, 2020 | 60 | 2020 |
End-to-End Speech Emotion Recognition Combined with Acoustic-to-Word ASR Model. H Feng, S Ueno, T Kawahara INTERSPEECH, 501-505, 2020 | 55 | 2020 |
Multi-speaker Sequence-to-sequence Speech Synthesis for Data Augmentation in Acoustic-to-word Speech Recognition S Ueno, M Mimura, S Sakai, T Kawahara ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 41 | 2019 |
Data Augmentation for ASR Using TTS Via a Discrete Representation S Ueno, M Mimura, S Sakai, T Kawahara 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 68-75, 2021 | 26 | 2021 |
Speech corpus of Ainu folklore and end-to-end speech recognition for Ainu language K Matsuura, S Ueno, M Mimura, S Sakai, T Kawahara arXiv preprint arXiv:2002.06675, 2020 | 21 | 2020 |
Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition S Ueno, T Moriya, M Mimura, S Sakai, Y Shinohara, Y Yamaguchi, ... Proc. Interspeech 2018, 2424-2428, 2018 | 13 | 2018 |
End-to-end speech-to-dialog-act recognition VT Dang, T Zhao, S Ueno, H Inaguma, T Kawahara arXiv preprint arXiv:2004.11419, 2020 | 12 | 2020 |
Multi-task Learning with Augmentation Strategy for Acoustic-to-word Attention-based Encoder-decoder Speech Recognition. T Moriya, S Ueno, Y Shinohara, M Delcroix, Y Yamaguchi, Y Aono INTERSPEECH, 2399-2403, 2018 | 8 | 2018 |
Synthesizing waveform sequence-to-sequence to augment training data for sequence-to-sequence speech recognition S Ueno, M Mimura, S Sakai, T Kawahara Acoustical Science and Technology 42 (6), 333-343, 2021 | 2 | 2021 |