Ryo Masumura

Cited by

	All	Since 2019
Citations	1208	1065
h-index	18	18
i10-index	33	28

260

130

195

2010201120122013201420152016201720182019202020212022202320246 3 6 8 10 20 11 17 61 122 130 233 235 257 83

Co-authors

Tomohiro TanakaNTT Computer & Data Science LaboratoriesVerified email at hco.ntt.co.jp
Atsushi AndoNTT CorporationVerified email at hco.ntt.co.jp
Taichi AsamiNTT CorporationVerified email at ntt.com
Mana IhoriNTTコンピュータ＆データサイエンス研究所Verified email at hco.ntt.co.jp
Oba TakanobuManager, Service Innovation Depertment, NTT Docomo Inc.Verified email at nttdocomo.com
Takanori AshiharaNTTVerified email at ntt.com
Hiroshi SatoNTT CorporationVerified email at ntt.com
Yusuke IjimaNTT CorporationVerified email at lab.ntt.co.jp
Akinori ItoTohoku UniversityVerified email at spcom.ecei.tohoku.ac.jp
Ryuichiro HigashinakaNagoya UniversityVerified email at i.nagoya-u.ac.jp
Yusuke ShinoharaLY CorporationVerified email at lycorp.co.jp
Kyosuke NishidaNTT Human Informatics Laboratories, NTT CorporationVerified email at lab.ntt.co.jp
Nobukatsu HojoNTT Human Informatics LaboratoriesVerified email at ntt.com
Ryo IshiiDistinguished Researcher, Human Informatics Laboratories, NTT CorporationVerified email at hco.ntt.co.jp
Sayaka ShiotaTokyo Metropolitan UniversityVerified email at tmu.ac.jp
Shinnosuke TakamichiKeio UniversityVerified email at keio.jp
Satoshi SuzukiNTTVerified email at hco.ntt.co.jp
Yuma KoizumiGoogleVerified email at google.com
Yuki SaitoLecturer, The University of TokyoVerified email at ipc.i.u-tokyo.ac.jp
Koichi SHINODATokyo Institute of TechnologyVerified email at cs.titech.ac.jp

Ryo Masumura

Distinguished Research Scientist, NTT Computer and Data Science Laboratories, NTT Corporation

Verified email at lab.ntt.co.jp - Homepage

Speech Recognition Spoken Language Processing Natural Language Processing Computer Vision


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Domain adaptation of dnn acoustic models using knowledge distillation T Asami, R Masumura, Y Yamaguchi, H Masataki, Y Aono 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017	95	2017
A transformer-based audio captioning model with keyword estimation Y Koizumi, R Masumura, K Nishida, M Yasuda, S Saito arXiv preprint arXiv:2007.00222, 2020	67	2020
Soft-target training with ambiguous emotional utterances for dnn-based speech emotion classification A Ando, S Kobashikawa, H Kamiyama, R Masumura, Y Ijima, Y Aono 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	46	2018
Online end-of-turn detection from speech based on stacked time-asynchronous sequential networks. R Masumura, T Asami, H Masataki, R Ishii, R Higashinaka Interspeech 2017, 1661-1665, 2017	43	2017
Neural Dialogue Context Online End-of-Turn Detection R Masumura, T Tanaka, A Ando, R Ishii, R Higashinaka, Y Aono Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue …, 2018	35	2018
Hierarchical transformer-based large-context end-to-end asr with large-context knowledge distillation R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	34	2021
Large context end-to-end automatic speech recognition via extension of hierarchical recurrent encoder-decoder models R Masumura, T Tanaka, T Moriya, Y Shinohara, T Oba, Y Aono ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	34	2019
Neural confnet classification: Fully neural network based spoken utterance classification using word confusion networks R Masumura, Y Ijima, T Asami, H Masataki, R Higashinaka 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	30	2018
Neural Error Corrective Language Models for Automatic Speech Recognition. T Tanaka, R Masumura, H Masataki, Y Aono INTERSPEECH, 401-405, 2018	29	2018
Customer satisfaction estimation in contact center calls based on a hierarchical multi-task model A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono, T Toda IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 715-728, 2020	28	2020
Hierarchical LSTMs with Joint Learning for Estimating Customer Satisfaction from Contact Center Calls. A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono INTERSPEECH, 1716-1720, 2017	28	2017
Improving neural text normalization with data augmentation at character-and morphological levels I Saito, J Suzuki, K Nishida, K Sadamitsu, S Kobashikawa, R Masumura, ... Proceedings of the Eighth International Joint Conference on Natural Language …, 2017	24	2017
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ... INTERSPEECH, 546-550, 2020	23	2020
Sequence-level consistency training for semi-supervised end-to-end automatic speech recognition R Masumura, M Ihori, A Takashima, T Moriya, A Ando, Y Shinohara ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	21	2020
Adversarial training for multi-task and multi-lingual joint modeling of utterance intent classification R Masumura, Y Shinohara, R Higashinaka, Y Aono Proceedings of the 2018 Conference on Empirical Methods in Natural Language …, 2018	21	2018
A Joint End-to-End and DNN-HMM Hybrid Automatic Speech Recognition System with Transferring Sharable Knowledge. T Tanaka, R Masumura, T Moriya, T Oba, Y Aono INTERSPEECH, 2210-2214, 2019	20	2019
End-to-end japanese multi-dialect speech recognition and dialect identification with multi-task learning R Imaizumi, R Masumura, S Shiota, H Kiya APSIPA Transactions on Signal and Information Processing 11 (1), 2022	19	2022
Parallel phonetically aware DNNs and LSTM-RNNs for frame-by-frame discriminative modeling of spoken language identification R Masumura, T Asami, H Masataki, Y Aono Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International …, 2017	19	2017
Speech Emotion Recognition Based on Multi-Label Emotion Existence Model. A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono INTERSPEECH, 2818-2822, 2019	18	2019
Training a language model using webdata for large vocabulary Japanese spontaneous speech recognition R Masumura, S Hahm, A Ito Twelfth Annual Conference of the International Speech Communication Association, 2011	17	2011

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors