Domain adaptation of dnn acoustic models using knowledge distillation T Asami, R Masumura, Y Yamaguchi, H Masataki, Y Aono 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 80 | 2017 |
A transformer-based audio captioning model with keyword estimation Y Koizumi, R Masumura, K Nishida, M Yasuda, S Saito arXiv preprint arXiv:2007.00222, 2020 | 48 | 2020 |
Soft-target training with ambiguous emotional utterances for dnn-based speech emotion classification A Ando, S Kobashikawa, H Kamiyama, R Masumura, Y Ijima, Y Aono 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 38 | 2018 |
Online End-of-Turn Detection from Speech Based on Stacked Time-Asynchronous Sequential Networks. R Masumura, T Asami, H Masataki, R Ishii, R Higashinaka Interspeech 2017, 1661-1665, 2017 | 38 | 2017 |
Large context end-to-end automatic speech recognition via extension of hierarchical recurrent encoder-decoder models R Masumura, T Tanaka, T Moriya, Y Shinohara, T Oba, Y Aono ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 28 | 2019 |
Neural Dialogue Context Online End-of-Turn Detection R Masumura, T Tanaka, A Ando, R Ishii, R Higashinaka, Y Aono Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue …, 2018 | 28 | 2018 |
Neural confnet classification: Fully neural network based spoken utterance classification using word confusion networks R Masumura, Y Ijima, T Asami, H Masataki, R Higashinaka 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 26 | 2018 |
Hierarchical LSTMs with Joint Learning for Estimating Customer Satisfaction from Contact Center Calls. A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono INTERSPEECH, 1716-1720, 2017 | 23 | 2017 |
Improving neural text normalization with data augmentation at character-and morphological levels I Saito, J Suzuki, K Nishida, K Sadamitsu, S Kobashikawa, R Masumura, ... Proceedings of the Eighth International Joint Conference on Natural Language …, 2017 | 21 | 2017 |
Neural Error Corrective Language Models for Automatic Speech Recognition. T Tanaka, R Masumura, H Masataki, Y Aono INTERSPEECH, 401-405, 2018 | 20 | 2018 |
Sequence-level consistency training for semi-supervised end-to-end automatic speech recognition R Masumura, M Ihori, A Takashima, T Moriya, A Ando, Y Shinohara ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 19 | 2020 |
Adversarial training for multi-task and multi-lingual joint modeling of utterance intent classification R Masumura, Y Shinohara, R Higashinaka, Y Aono Proceedings of the 2018 Conference on Empirical Methods in Natural Language …, 2018 | 19 | 2018 |
Parallel phonetically aware DNNs and LSTM-RNNs for frame-by-frame discriminative modeling of spoken language identification R Masumura, T Asami, H Masataki, Y Aono Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International …, 2017 | 18 | 2017 |
Hierarchical transformer-based large-context end-to-end asr with large-context knowledge distillation R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 17 | 2021 |
Customer satisfaction estimation in contact center calls based on a hierarchical multi-task model A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono, T Toda IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 715-728, 2020 | 17 | 2020 |
Training a language model using webdata for large vocabulary Japanese spontaneous speech recognition R Masumura, S Hahm, A Ito Twelfth Annual Conference of the International Speech Communication Association, 2011 | 16 | 2011 |
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ... INTERSPEECH, 546-550, 2020 | 15 | 2020 |
Speech Emotion Recognition Based on Multi-Label Emotion Existence Model. A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono INTERSPEECH, 2818-2822, 2019 | 14 | 2019 |
A Joint End-to-End and DNN-HMM Hybrid Automatic Speech Recognition System with Transferring Sharable Knowledge. T Tanaka, R Masumura, T Moriya, T Oba, Y Aono INTERSPEECH, 2210-2214, 2019 | 14 | 2019 |
Use of latent words language models in ASR: a sampling-based implementation R Masumura, H Masataki, T Oba, O Yoshioka, S Takahashi 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 13 | 2013 |