Follow
Yusuke Yasda
Yusuke Yasda
Verified email at g.sp.m.is.nagoya-u.ac.jp
Title
Cited by
Cited by
Year
Zero-shot multi-speaker text-to-speech with state-of-the-art neural speaker embeddings
E Cooper, CI Lai, Y Yasuda, F Fang, X Wang, N Chen, J Yamagishi
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1612020
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language
Y Yasuda, X Wang, S Takaki, J Yamagishi
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
972019
Espnet2-tts: Extending the edge of tts research
T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ...
arXiv preprint arXiv:2110.07840, 2021
312021
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis
Y Yasuda, X Wang, J Yamagishi
Computer Speech & Language 67, 101183, 2021
262021
Can speaker augmentation improve multi-speaker end-to-end TTS?
E Cooper, CI Lai, Y Yasuda, J Yamagishi
arXiv preprint arXiv:2005.01245, 2020
222020
End-to-end text-to-speech using latent duration based on VQ-VAE
Y Yasuda, X Wang, J Yamagishd
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
182021
Initial investigation of encoder-decoder end-to-end TTS using marginalization of monotonic hard alignments
Y Yasuda, X Wang, J Yamagishi
Proc. Speech Synthesis Workshop, 211-216, 2019
142019
Modeling of Rakugo speech and its limitations: Toward speech synthesis that entertains audiences
S Kato, Y Yasuda, X Wang, E Cooper, S Takaki, J Yamagishi
IEEE Access 8, 138149-138161, 2020
92020
Rakugo speech synthesis using segment-to-segment neural transduction and style tokens—toward speech synthesis for entertaining audiences
S Kato, Y Yasuda, X Wang, E Cooper, S Takaki, J Yamagishi
Proc. 10th ISCA Speech Synth. Workshop, 111-116, 2019
82019
Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments
Y Yasuda, X Wang, J Yamagishi
arXiv preprint arXiv:1908.11535, 2019
82019
The Singing Voice Conversion Challenge 2023
WC Huang, LP Violeta, S Liu, J Shi, Y Yasuda, T Toda
arXiv preprint arXiv:2306.14422, 2023
62023
Pretraining strategies, waveform model choice, and acoustic configurations for multi-speaker end-to-end speech synthesis
E Cooper, X Wang, Y Zhao, Y Yasuda, J Yamagishi
arXiv preprint arXiv:2011.04839, 2020
62020
Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language
Y Yasuda, T Toda
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1319-1328, 2022
42022
Tts tutorial at ieice sp workshop
X Wang, Y Yasuda
42019
Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment
Y Yasuda, X Wang, J Yamagishi
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
32020
落語音声合成における Tacotron およびコンテキスト特徴量の使用とその評価
加藤集平, 高木信二, 山岸順一, 安田裕介
電子情報通信学会技術研究報告= IEICE technical report: 信学技報 118 (495 …, 2019
12019
Preference-based training framework for automatic speech quality assessment using deep neural network
CH Hu, Y Yasuda, T Toda
arXiv preprint arXiv:2308.15203, 2023
2023
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder
Y Yasuda, T Toda
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
Spoken-Text-Style Transfer with Conditional Variational Autoencoder and Content Word Storage}}
D Yoshioka, Y Yasuda, N Matsunaga, Y Ohtani, T Toda
Proc. Interspeech 2022, 4576-4580, 2022
2022
How Similar or Different is Rakugo Speech Synthesizer to Professional Performers?
S Kato, Y Yasuda, X Wang, E Cooper, J Yamagishi
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–20