Phoneme recognition using time-delay neural networks A Waibel, T Hanazawa, G Hinton, K Shikano, KJ Lang Backpropagation, 35-61, 2013 | 4206 | 2013 |
Julius---an open source real-time large vocabulary recognition engine A Lee, T Kawahara, K Shikano | 942 | 2001 |
Voice conversion through vector quantization M Abe, S Nakamura, K Shikano, H Kuwabara Journal of the Acoustical Society of Japan (E) 11 (2), 71-76, 1990 | 874 | 1990 |
ATR Japanese speech database as a tool of speech recognition and synthesis A Kurematsu, K Takeda, Y Sagisaka, S Katagiri, H Kuwabara, K Shikano Speech communication 9 (4), 357-363, 1990 | 418 | 1990 |
JNAS: Japanese speech corpus for large vocabulary continuous speech recognition research K Itou, M Yamamoto, K Takeda, T Takezawa, T Matsuoka, T Kobayashi, ... JOURNAL-ACOUSTICAL SOCIETY OF JAPAN-ENGLISH EDITION- 20, 199-206, 1999 | 408 | 1999 |
Blind source separation combining independent component analysis and beamforming H Saruwatari, S Kurita, K Takeda, F Itakura, T Nishikawa, K Shikano EURASIP Journal on Advances in Signal Processing 2003, 1-12, 2003 | 310 | 2003 |
Modularity and scaling in large phonemic neural networks A Waibel, H Sawai, K Shikano IEEE Transactions on Acoustics, Speech, and Signal Processing 37 (12), 1888-1898, 1989 | 265 | 1989 |
Blind source separation based on a fast-convergence algorithm combining ICA and beamforming H Saruwatari, T Kawamura, T Nishikawa, A Lee, K Shikano IEEE Transactions on Audio, speech, and language processing 14 (2), 666-678, 2006 | 262 | 2006 |
Phoneme recognition: neural networks vs. hidden Markov models vs. hidden Markov models A Waibel, T Hanazawa, G Hinton, K Shikano, K Lang ICASSP-88., International Conference on Acoustics, Speech, and Signal …, 1988 | 252 | 1988 |
Speaker adaptation through vector quantization K Shikano, KF Lee, R Reddy ICASSP'86. IEEE International Conference on Acoustics, Speech, and Signal …, 1986 | 251 | 1986 |
Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum T Toda, H Saruwatari, K Shikano 2001 IEEE international conference on acoustics, speech, and signal …, 2001 | 234 | 2001 |
Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech K Nakamura, T Toda, H Saruwatari, K Shikano Speech communication 54 (1), 134-146, 2012 | 221 | 2012 |
Statistical voice conversion techniques for body-conducted unvoiced speech enhancement T Toda, M Nakagiri, K Shikano IEEE Transactions on Audio, Speech, and Language Processing 20 (9), 2505-2517, 2012 | 212 | 2012 |
Consonant recognition by modular construction of large phonemic time-delay neural networks A Waibel Advances in neural information processing systems 1, 1988 | 205 | 1988 |
Lip movement synthesis from speech based on Hidden Markov Models E Yamamoto, S Nakamura, K Shikano Speech Communication 26 (1-2), 105-115, 1998 | 169 | 1998 |
Free software toolkit for Japanese large vocabulary continuous speech recognition. T Kawahara, A Lee, T Kobayashi, K Takeda, N Minematsu, S Sagayama, ... INTERSPEECH, 476-479, 2000 | 163 | 2000 |
Blind spatial subtraction array for speech enhancement in noisy environment Y Takahashi, T Takatani, K Osako, H Saruwatari, K Shikano IEEE transactions on audio, speech, and language processing 17 (4), 650-664, 2009 | 158 | 2009 |
Eigenvoice conversion based on Gaussian mixture model T Toda, Y Ohtani, K Shikano | 152 | 2006 |
実環境研究プラットホームとしての音声情報案内システムの運用 西村竜一, 西原洋平, 鶴身玲典, 李晃伸, 猿渡洋, 鹿野清宏 電子情報通信学会論文誌 D 87 (3), 789-798, 2004 | 147* | 2004 |
One-to-many and many-to-one voice conversion based on eigenvoices T Toda, Y Ohtani, K Shikano 2007 IEEE International Conference on Acoustics, Speech and Signal …, 2007 | 143 | 2007 |