Sequence-level consistency training for semi-supervised end-to-end automatic speech recognition R Masumura, M Ihori, A Takashima, T Moriya, A Ando, Y Shinohara ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 20 | 2020 |
Hierarchical transformer-based large-context end-to-end asr with large-context knowledge distillation R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 17 | 2021 |
Large-context pointer-generator networks for spoken-to-written style conversion M Ihori, A Takashima, R Masumura ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 11 | 2020 |
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition. R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi INTERSPEECH, 2822-2826, 2020 | 8 | 2020 |
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations H Sato, T Ochiai, M Delcroix, K Kinoshita, T Moriya, N Makishima, M Ihori, ... arXiv preprint arXiv:2206.08174, 2022 | 7 | 2022 |
Improving speech-based end-of-turn detection via cross-modal representation learning with punctuated text data R Masumura, M Ihori, T Tanaka, A Ando, R Ishii, T Oba, R Higashinaka 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 7 | 2019 |
Cross-modal transformer-based neural correction models for automatic speech recognition T Tanaka, R Masumura, M Ihori, A Takashima, T Moriya, T Ashihara, ... arXiv preprint arXiv:2107.01569, 2021 | 6 | 2021 |
Generalized large-context language models based on forward-backward hierarchical recurrent encoder-decoder models R Masumura, M Ihori, T Tanaka, I Saito, K Nishida, T Oba 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 5 | 2019 |
Audio-visual speech separation using cross-modal correspondence loss N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi, R Masumura ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 4 | 2021 |
Enrollment-less training for personalized voice activity detection N Makishima, M Ihori, T Tanaka, A Takashima, S Orihashi, R Masumura arXiv preprint arXiv:2106.12132, 2021 | 3 | 2021 |
End-to-end automatic speech recognition with deep mutual learning R Masumura, M Ihori, A Takashima, T Tanaka, T Ashihara 2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020 | 3 | 2020 |
Memory attentive fusion: External language model integration for transformer-based sequence-to-sequence model M Ihori, R Masumura, N Makishima, T Tanaka, A Takashima, S Orihashi arXiv preprint arXiv:2010.15437, 2020 | 3 | 2020 |
Parallel corpus for Japanese spoken-to-written style conversion M Ihori, A Takashima, R Masumura Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020 | 3 | 2020 |
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks using Switching Tokens M Ihori, N Makishima, T Tanaka, A Takashima, S Orihashi, R Masumura arXiv preprint arXiv:2106.12131, 2021 | 2 | 2021 |
Mapgn: Masked pointer-generator network for sequence-to-sequence pre-training M Ihori, N Makishima, T Tanaka, A Takashima, S Orihashi, R Masumura ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 2 | 2021 |
Large-context conversational representation learning: Self-supervised learning for conversational documents R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi 2021 IEEE Spoken Language Technology Workshop (SLT), 1012-1019, 2021 | 2 | 2021 |
Unsupervised Domain Adaptation for Dialogue Sequence Labeling Based on Hierarchical Adversarial Training. S Orihashi, M Ihori, T Tanaka, R Masumura INTERSPEECH, 1575-1579, 2020 | 2 | 2020 |
Domain Adversarial Self-Supervised Speech Representation Learning for Improving Unknown Domain Downstream Tasks T Tanaka, R Masumura, H Sato, M Ihori, K Matsuura, T Ashihara, T Moriya Proc. Interspeech, 1066-1070, 2022 | 1 | 2022 |
Hierarchical knowledge distillation for dialogue sequence labeling S Orihashi, Y Yamazaki, N Makishima, M Ihori, A Takashima, T Tanaka, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 1 | 2021 |
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages S Orihashi, Y Yamazaki, N Makishima, M Ihori, A Takashima, T Tanaka, ... ACM Multimedia Asia, 1-5, 2021 | 1 | 2021 |