Attentive statistics pooling for deep speaker embedding K Okabe, T Koshinaka, K Shinoda arXiv preprint arXiv:1803.10963, 2018 | 611 | 2018 |
MDL-based context-dependent subword modeling for speech recognition K Shinoda, T Watanabe Acoustical Science and Technology 21 (2), 79-86, 2000 | 371 | 2000 |
A structural Bayes approach to speaker adaptation K Shinoda, CH Lee IEEE Transactions on Speech and Audio Processing 9 (3), 276-287, 2001 | 220 | 2001 |
Acoustic modeling based on the MDL principle for speech recognition. K Shinoda, T Watanabe Eurospeech, 99-102, 1997 | 202 | 1997 |
Multimodal fusion of bert-cnn and gated cnn representations for depression detection M Rodrigues Makiuchi, T Warnita, K Uto, K Shinoda Proceedings of the 9th International on Audio/Visual Emotion Challenge and …, 2019 | 127 | 2019 |
Structural MAP speaker adaptation using hierarchical priors K Shinoda, CH Lee 1997 IEEE Workshop on Automatic Speech Recognition and Understanding …, 1997 | 114 | 1997 |
An online attention-based model for speech recognition R Fan, P Zhou, W Chen, J Jia, G Liu arXiv preprint arXiv:1811.05247, 2018 | 87* | 2018 |
GINGA observation of the X-ray pulsar 1E 2259+ 586 in the supernova remnant G109. 1-1.0 K Koyama, F Nagase, Y Ogawara, K Shinoda, N Kawai, MH Jones, ... Astronomical Society of Japan, Publications (ISSN 0004-6264), vol. 41, no. 3 …, 1989 | 82 | 1989 |
Multimodal emotion recognition with high-level speech and text features MR Makiuchi, K Uto, K Shinoda 2021 IEEE automatic speech recognition and understanding workshop (ASRU …, 2021 | 81 | 2021 |
Technique for adaptation of hidden markov models for speech recognition CH Lee, K Shinoda US Patent 6,151,574, 2000 | 77 | 2000 |
Speaker adaptation with autonomous model complexity control by MDL principle K Shinoda, T Watanabe 1996 IEEE International Conference on Acoustics, Speech, and Signal …, 1996 | 69 | 1996 |
A fast and accurate video semantic-indexing system using fast MAP adaptation and GMM supervectors N Inoue, K Shinoda IEEE Transactions on Multimedia 14 (4), 1196-1205, 2012 | 59 | 2012 |
Detecting Alzheimer's disease using gated convolutional neural network from audio data T Warnita, N Inoue, K Shinoda arXiv preprint arXiv:1803.11344, 2018 | 54 | 2018 |
User adaptation of convolutional neural network for human activity recognition S Matsui, N Inoue, Y Akagi, G Nagino, K Shinoda 2017 25th European Signal Processing Conference (EUSIPCO), 753-757, 2017 | 53 | 2017 |
Speaker adaptation techniques for automatic speech recognition K Shinoda Proc. APSIPA ASC 2011, 2011 | 51 | 2011 |
High speed speech recognition using tree-structured probability density function T Watanabe, K Shinoda, K Takagi, KI Iso 1995 International Conference on Acoustics, Speech, and Signal Processing 1 …, 1995 | 50 | 1995 |
Speaker adaptation with autonomous control using tree structure K Shinoda Proc. EuroSpeech-95, 1143-1146, 1995 | 49 | 1995 |
Spectral graph skeletons for 3D action recognition T Kerola, N Inoue, K Shinoda Asian conference on computer vision, 417-432, 2014 | 47 | 2014 |
Speech recognition apparatus K Shinoda US Patent 7,437,288, 2008 | 47 | 2008 |
Hidden Markov model for automatic transcription of MIDI signals H Takeda, N Saito, T Otsuki, M Nakai, H Shimodaira, S Sagayama 2002 IEEE Workshop on Multimedia Signal Processing., 428-431, 2002 | 45 | 2002 |