Follow
Takaaki Saeki
Takaaki Saeki
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
T Saeki, D Xin, W Nakata, T Koriyama, S Takamichi, H Saruwatari
arXiv preprint arXiv:2204.02152, 2022
1232022
Espnet2-tts: Extending the edge of tts research
T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ...
arXiv preprint arXiv:2110.07840, 2021
592021
SpeechLMScore: Evaluating speech generation using speech language model
S Maiti, Y Peng, T Saeki, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
272023
JTubeSpeech: corpus of Japanese speech collected from YouTube for speech recognition and speaker verification
S Takamichi, L Kürzinger, T Saeki, S Shiota, S Watanabe
arXiv preprint arXiv:2112.09323, 2021
232021
Incremental text-to-speech synthesis using pseudo lookahead with large pretrained language model
T Saeki, S Takamichi, H Saruwatari
IEEE Signal Processing Letters 28, 857-861, 2021
202021
Real-Time, Full-Band, Online DNN-Based Voice Conversion System Using a Single CPU.
T Saeki, Y Saito, S Takamichi, H Saruwatari
INTERSPEECH, 1021-1022, 2020
152020
Duration-aware pause insertion using pre-trained language model for multi-speaker text-to-speech
D Yang, T Koriyama, Y Saito, T Saeki, D Xin, H Saruwatari
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech
T Saeki, H Zen, Z Chen, N Morioka, G Wang, Y Zhang, A Bapna, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining
T Saeki, S Maiti, X Li, S Watanabe, S Takamichi, H Saruwatari
arXiv preprint arXiv:2301.12596, 2023
122023
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning
T Saeki, K Tachibana, R Yamamoto
arXiv preprint arXiv:2203.15683, 2022
122022
SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics
T Saeki, S Maiti, S Takamichi, S Watanabe, H Saruwatari
arXiv preprint arXiv:2401.16812, 2024
82024
Text-to-speech synthesis from dark data with evaluation-in-the-loop data selection
K Seki, S Takamichi, T Saeki, H Saruwatari
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
82023
End-to-End Deep Learning Speech Recognition Model for Silent Speech Challenge.
N Kimura, Z Su, T Saeki
INTERSPEECH, 1025-1026, 2020
82020
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data
T Saeki, G Wang, N Morioka, I Elias, K Kastner, A Rosenberg, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
72024
Yodas: Youtube-Oriented Dataset for Audio and Speech
X Li, S Takamichi, T Saeki, W Chen, S Shiota, S Watanabe
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
72023
SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling
T Saeki, S Takamichi, T Nakamura, N Tanji, H Saruwatari
arXiv preprint arXiv:2203.12937, 2022
62022
Diversity-based core-set selection for text-to-speech with linguistic and acoustic features
K Seki, S Takamichi, T Saeki, H Saruwatari
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network
T Saeki, S Takamichi, H Saruwatari
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
42021
Lifter training and sub-band modeling for computationally efficient and high-quality voice conversion using spectral differentials
T Saeki, Y Saito, S Takamichi, H Saruwatari
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
42020
SSR7000: A Synchronized Corpus of Ultrasound Tongue Imaging for End-to-End Silent Speech Recognition
N Kimura, Z Su, T Saeki, J Rekimoto
Proceedings of the Thirteenth Language Resources and Evaluation Conference …, 2022
32022
The system can't perform the operation now. Try again later.
Articles 1–20