UTMOS: Utokyo-sarulab system for voiceMOS Challenge 2022 T Saeki, D Xin, W Nakata, T Koriyama, S Takamichi, H Saruwatari arXiv preprint arXiv:2204.02152, 2022 | 30 | 2022 |
Audiobook Speech Synthesis Conditioned by Cross-Sentence Context-Aware Word Embeddings W Nakata, T Koriyama, S Takamichi, N Tanji, Y Ijima, R Masumura, ... Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 211-215, 2021 | 12 | 2021 |
J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis S Takamichi, W Nakata, N Tanji, H Saruwatari arXiv preprint arXiv:2201.10896, 2022 | 3 | 2022 |
Predicting VQVAE-based Character Acting Style from Quotation-Annotated Text for Audiobook Speech Synthesis W Nakata, T Koriyama, S Takamichi, Y Saito, Y Ijima, R Masumura, ... Proc. Interspeech 2022, 4551-4555, 2022 | 3 | 2022 |
J-KAC: 日本語オーディオブック・紙芝居朗読音声コーパス 高道慎之介, 中田亘, 郡山知樹, 丹治尚子, 井島勇祐, 増村亮, 猿渡洋 研究報告音楽情報科学 (MUS) 2021 (14), 1-4, 2021 | 1 | 2021 |
Coco-Nut: Corpus of Japanese Utterance and Voice Characteristics Description for Prompt-based Control A Watanabe, S Takamichi, Y Saito, W Nakata, D Xin, H Saruwatari arXiv preprint arXiv:2309.13509, 2023 | | 2023 |
Audiobook Speech Synthesis based on Character embedding for Distinguishable Character Acting W NAKATA, T KOORIYAMA, S TAKAMICHI, Y SAITO, Y IJIMA, ... 日本音響学会研究発表会講演論文集 (CD-ROM) 2022, 3-3, 2022 | | 2022 |
VQVAE によって獲得されたキャラクター演技スタイルに基づく多話者オーディオブック音声合成 中田亘, 郡山知樹, 高道慎之介, 齋藤佑樹, 井島勇祐, 増村亮, 猿渡洋 電子情報通信学会技術研究報告; 信学技報 121 (282), 42-47, 2021 | | 2021 |