Follow
Mana Ihori
Mana Ihori
NTTコンピュータ&データサイエンス研究所
Verified email at hco.ntt.co.jp
Title
Cited by
Cited by
Year
Sequence-level consistency training for semi-supervised end-to-end automatic speech recognition
R Masumura, M Ihori, A Takashima, T Moriya, A Ando, Y Shinohara
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
202020
Hierarchical transformer-based large-context end-to-end asr with large-context knowledge distillation
R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
172021
Large-context pointer-generator networks for spoken-to-written style conversion
M Ihori, A Takashima, R Masumura
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
112020
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition.
R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi
INTERSPEECH, 2822-2826, 2020
82020
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
H Sato, T Ochiai, M Delcroix, K Kinoshita, T Moriya, N Makishima, M Ihori, ...
arXiv preprint arXiv:2206.08174, 2022
72022
Improving speech-based end-of-turn detection via cross-modal representation learning with punctuated text data
R Masumura, M Ihori, T Tanaka, A Ando, R Ishii, T Oba, R Higashinaka
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
72019
Cross-modal transformer-based neural correction models for automatic speech recognition
T Tanaka, R Masumura, M Ihori, A Takashima, T Moriya, T Ashihara, ...
arXiv preprint arXiv:2107.01569, 2021
62021
Generalized large-context language models based on forward-backward hierarchical recurrent encoder-decoder models
R Masumura, M Ihori, T Tanaka, I Saito, K Nishida, T Oba
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
52019
Audio-visual speech separation using cross-modal correspondence loss
N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi, R Masumura
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
42021
Enrollment-less training for personalized voice activity detection
N Makishima, M Ihori, T Tanaka, A Takashima, S Orihashi, R Masumura
arXiv preprint arXiv:2106.12132, 2021
32021
End-to-end automatic speech recognition with deep mutual learning
R Masumura, M Ihori, A Takashima, T Tanaka, T Ashihara
2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020
32020
Memory attentive fusion: External language model integration for transformer-based sequence-to-sequence model
M Ihori, R Masumura, N Makishima, T Tanaka, A Takashima, S Orihashi
arXiv preprint arXiv:2010.15437, 2020
32020
Parallel corpus for Japanese spoken-to-written style conversion
M Ihori, A Takashima, R Masumura
Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020
32020
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks using Switching Tokens
M Ihori, N Makishima, T Tanaka, A Takashima, S Orihashi, R Masumura
arXiv preprint arXiv:2106.12131, 2021
22021
Mapgn: Masked pointer-generator network for sequence-to-sequence pre-training
M Ihori, N Makishima, T Tanaka, A Takashima, S Orihashi, R Masumura
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
22021
Large-context conversational representation learning: Self-supervised learning for conversational documents
R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi
2021 IEEE Spoken Language Technology Workshop (SLT), 1012-1019, 2021
22021
Unsupervised Domain Adaptation for Dialogue Sequence Labeling Based on Hierarchical Adversarial Training.
S Orihashi, M Ihori, T Tanaka, R Masumura
INTERSPEECH, 1575-1579, 2020
22020
Domain Adversarial Self-Supervised Speech Representation Learning for Improving Unknown Domain Downstream Tasks
T Tanaka, R Masumura, H Sato, M Ihori, K Matsuura, T Ashihara, T Moriya
Proc. Interspeech, 1066-1070, 2022
12022
Hierarchical knowledge distillation for dialogue sequence labeling
S Orihashi, Y Yamazaki, N Makishima, M Ihori, A Takashima, T Tanaka, ...
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
12021
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages
S Orihashi, Y Yamazaki, N Makishima, M Ihori, A Takashima, T Tanaka, ...
ACM Multimedia Asia, 1-5, 2021
12021
The system can't perform the operation now. Try again later.
Articles 1–20