Takaaki Saeki

Cited by

	All	Since 2019
Citations	231	231
h-index	8	8
i10-index	6	6

120

202020212022202320241 11 44 111 63

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Shinnosuke TakamichiKeio UniversityVerified email at keio.jp
Hiroshi SaruwatariProfessor, The University of TokyoVerified email at ipc.i.u-tokyo.ac.jp
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Yuki SaitoLecturer, The University of TokyoVerified email at ipc.i.u-tokyo.ac.jp
Soumi MaitiCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Detai XinThe University of TokyoVerified email at ipc.i.u-tokyo.ac.jp
Ryuichi YamamotoLY CorporationVerified email at lycorp.co.jp
Wataru NakataThe University of TokyoVerified email at g.ecc.u-tokyo.ac.jp
Jiatong Shi (史嘉彤)Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Yusuke YasdaNagoya universityVerified email at g.sp.m.is.nagoya-u.ac.jp
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityVerified email at g.sp.m.is.nagoya-u.ac.jp
Yooncheol JuSpeech synthesis AI researcher, 42dot.Inc, Hyundai Motor GroupVerified email at 42dot.ai
Peter WuSchool of Computer Science, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Takenori YoshimuraNagoya Institute of TechnologyVerified email at nitech.ac.jp
Sayaka ShiotaTokyo Metropolitan UniversityVerified email at tmu.ac.jp
Yuta Matsunaga東京大学Verified email at g.ecc.u-tokyo.ac.jp
Xinjian LiGoogleVerified email at google.com
Heiga ZenPrincipal Scientist (Director), Google DeepMindVerified email at google.com
Zixiong SuPh.D. Candidate at the University of TokyoVerified email at g.ecc.u-tokyo.ac.jp
Bhuvana RamabhadranManager, GoogleVerified email at google.com

Takaaki Saeki

Google

Verified email at google.com - Homepage

Speech Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022 T Saeki, D Xin, W Nakata, T Koriyama, S Takamichi, H Saruwatari arXiv preprint arXiv:2204.02152, 2022	53	2022
Espnet2-tts: Extending the edge of tts research T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ... arXiv preprint arXiv:2110.07840, 2021	42	2021
Incremental text-to-speech synthesis using pseudo lookahead with large pretrained language model T Saeki, S Takamichi, H Saruwatari IEEE Signal Processing Letters 28, 857-861, 2021	19	2021
JTubeSpeech: corpus of Japanese speech collected from YouTube for speech recognition and speaker verification S Takamichi, L Kürzinger, T Saeki, S Shiota, S Watanabe arXiv preprint arXiv:2112.09323, 2021	17	2021
SpeechLMScore: Evaluating speech generation using speech language model S Maiti, Y Peng, T Saeki, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	16	2023
Real-Time, Full-Band, Online DNN-Based Voice Conversion System Using a Single CPU. T Saeki, Y Saito, S Takamichi, H Saruwatari INTERSPEECH, 1021-1022, 2020	14	2020
Duration-aware pause insertion using pre-trained language model for multi-speaker text-to-speech D Yang, T Koriyama, Y Saito, T Saeki, D Xin, H Saruwatari ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	9	2023
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning T Saeki, K Tachibana, R Yamamoto arXiv preprint arXiv:2203.15683, 2022	8	2022
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech T Saeki, H Zen, Z Chen, N Morioka, G Wang, Y Zhang, A Bapna, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	7	2023
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining T Saeki, S Maiti, X Li, S Watanabe, S Takamichi, H Saruwatari arXiv preprint arXiv:2301.12596, 2023	6	2023
Text-to-speech synthesis from dark data with evaluation-in-the-loop data selection K Seki, S Takamichi, T Saeki, H Saruwatari arXiv preprint arXiv:2210.14850, 2022	6	2022
SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling T Saeki, S Takamichi, T Nakamura, N Tanji, H Saruwatari arXiv preprint arXiv:2203.12937, 2022	6	2022
End-to-End Deep Learning Speech Recognition Model for Silent Speech Challenge. N Kimura, Z Su, T Saeki INTERSPEECH, 1025-1026, 2020	6	2020
Lifter training and sub-band modeling for computationally efficient and high-quality voice conversion using spectral differentials T Saeki, Y Saito, S Takamichi, H Saruwatari ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	4	2020
Personalized filled-pause generation with group-wise prediction models Y Matsunaga, T Saeki, S Takamichi, H Saruwatari arXiv preprint arXiv:2203.09961, 2022	3	2022
Real-Time Full-Band Voice Conversion with Sub-Band Modeling and Data-Driven Phase Estimation of Spectral Differentials T Saeki, Y Saito, S Takamichi, H Saruwatari IEICE TRANSACTIONS on Information and Systems 104 (7), 1002-1016, 2021	3	2021
vTTS: visual-text to speech Y Nakano, T Saeki, S Takamichi, K Sudoh, H Saruwatari 2022 IEEE Spoken Language Technology Workshop (SLT), 936-942, 2023	2	2023
Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech Synthesis Y Matsunaga, T Saeki, S Takamichi, H Saruwatari 2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022	2	2022
Improving robustness of spontaneous speech synthesis with linguistic speech regularization and pseudo-filled-pause insertion Y Matsunaga, T Saeki, S Takamichi, H Saruwatari arXiv preprint arXiv:2210.09815, 2022	2	2022
Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network T Saeki, S Takamichi, H Saruwatari 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	2	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors