Singing Voice Synthesis Based on Deep Neural Networks. M Nishimura, K Hashimoto, K Oura, Y Nankaku, K Tokuda Interspeech, 2478-2482, 2016 | 108 | 2016 |
Singing voice synthesis based on generative adversarial networks Y Hono, K Hashimoto, K Oura, Y Nankaku, K Tokuda ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 70 | 2019 |
The effect of neural networks in statistical parametric speech synthesis K Hashimoto, K Oura, Y Nankaku, K Tokuda 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 64 | 2015 |
Privacy-preserving sound to degrade automatic speaker verification performance K Hashimoto, J Yamagishi, I Echizen 2016 IEEE international conference on acoustics, speech and signal …, 2016 | 49 | 2016 |
Sinsy: A deep neural network-based singing voice synthesis system Y Hono, K Hashimoto, K Oura, Y Nankaku, K Tokuda IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2803-2815, 2021 | 41 | 2021 |
Recent development of the DNN-based singing voice synthesis system—sinsy Y Hono, S Murata, K Nakamura, K Hashimoto, K Oura, Y Nankaku, ... 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 40 | 2018 |
Trajectory training considering global variance for speech synthesis based on neural networks K Hashimoto, K Oura, Y Nankaku, K Tokuda 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 39 | 2016 |
Singing voice synthesis based on convolutional neural networks K Nakamura, K Hashimoto, K Oura, Y Nankaku, K Tokuda arXiv preprint arXiv:1904.06868, 2019 | 36 | 2019 |
Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis T Fujimoto, K Hashimoto, K Oura, Y Nankaku, K Tokuda 10th ISCA Speech Synthesis Workshop. ISCA, Vienna, Austria, 2019 | 29 | 2019 |
Hierarchical multi-grained generative model for expressive speech synthesis Y Hono, K Tsuboi, K Sawada, K Hashimoto, K Oura, Y Nankaku, ... arXiv preprint arXiv:2009.08474, 2020 | 28 | 2020 |
Statistical voice conversion based on WaveNet J Niwa, T Yoshimura, K Hashimoto, K Oura, Y Nankaku, K Tokuda 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 24 | 2018 |
A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech. K Nakamura, K Hashimoto, K Oura, Y Nankaku, K Tokuda Interspeech, 2494-2498, 2014 | 23 | 2014 |
Fast and high-quality singing voice synthesis system based on convolutional neural networks K Nakamura, S Takaki, K Hashimoto, K Oura, Y Nankaku, K Tokuda ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 22 | 2020 |
Temporal modeling in neural network based statistical parametric speech synthesis. K Tokuda, K Hashimoto, K Oura, Y Nankaku SSW, 106-111, 2016 | 22 | 2016 |
Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis K Shirota, K Nakamura, K Hashimoto, K Oura, Y Nankaku, K Tokuda 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 22 | 2014 |
Impacts of machine translation and speech synthesis on speech-to-speech translation K Hashimoto, J Yamagishi, W Byrne, S King, K Tokuda Speech Communication 54 (7), 857-866, 2012 | 21 | 2012 |
Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition. K Hashimoto, H Zen, Y Nankaku, A Lee, K Tokuda INTERSPEECH, 936-939, 2008 | 21 | 2008 |
PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components Y Hono, S Takaki, K Hashimoto, K Oura, Y Nankaku, K Tokuda ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 18 | 2021 |
An analysis of machine translation and speech synthesis in speech-to-speech translation system K Hashimoto, J Yamagishi, W Byrne, S King, K Tokuda 2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011 | 17 | 2011 |
Mel-cepstrum-based quantization noise shaping applied to neural-network-based speech waveform synthesis T Yoshimura, K Hashimoto, K Oura, Y Nankaku, K Tokuda IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (7), 1177 …, 2018 | 16 | 2018 |