The voice conversion challenge 2018: Promoting development of parallel and nonparallel methods J Lorenzo-Trueba, J Yamagishi, T Toda, D Saito, F Villavicencio, ... arXiv preprint arXiv:1804.04262, 2018 | 385 | 2018 |
High-quality nonparallel voice conversion based on cycle-consistent adversarial network F Fang, J Yamagishi, I Echizen, J Lorenzo-Trueba 2018 IEEE International conference on acoustics, speech and signal …, 2018 | 169 | 2018 |
Towards achieving robust universal neural vocoding J Lorenzo-Trueba, T Drugman, J Latorre, T Merritt, B Putrycz, ... arXiv preprint arXiv:1811.06292v2, 2019 | 123 | 2019 |
Investigating different representations for modeling and controlling multiple emotions in DNN-based speech synthesis J Lorenzo-Trueba, GE Henter, S Takaki, J Yamagishi, Y Morino, Y Ochiai Speech Communication 99, 135-143, 2018 | 104* | 2018 |
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data J Lorenzo-Trueba, F Fang, X Wang, I Echizen, J Yamagishi, T Kinnunen arXiv preprint arXiv:1803.00860, 2018 | 88 | 2018 |
Effect of data reduction on sequence-to-sequence neural TTS J Latorre, J Lachowicz, J Lorenzo-Trueba, T Merritt, T Drugman, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 80 | 2019 |
A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis X Wang, J Lorenzo-Trueba, S Takaki, L Juvela, J Yamagishi 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 79 | 2018 |
Low-resource expressive text-to-speech using data augmentation G Huybrechts, T Merritt, G Comini, B Perz, R Shah, J Lorenzo-Trueba ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 71 | 2021 |
Deep encoder-decoder models for unsupervised learning of controllable speech synthesis GE Henter, J Lorenzo-Trueba, X Wang, J Yamagishi arXiv preprint arXiv:1807.11470, 2018 | 68 | 2018 |
Using vaes and normalizing flows for one-shot text-to-speech synthesis of expressive speech V Aggarwal, M Cotescu, N Prateek, J Lorenzo-Trueba, R Barra-Chicote ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 50 | 2020 |
Principles for learning controllable TTS from annotated and latent variation G Henter, J Lorenzo-Trueba, X Wang, J Yamagishi Interspeech 2017, 3956-3960, 2017 | 49 | 2017 |
Segmenting human activities based on HMMs using smartphone inertial sensors R San-Segundo, J Lorenzo-Trueba, B Martínez-González, JM Pardo Pervasive and Mobile Computing 30, 84-96, 2016 | 48 | 2016 |
Dynamic prosody generation for speech synthesis using linguistics-driven acoustic embedding selection S Tyagi, M Nicolis, J Rohnke, T Drugman, J Lorenzo-Trueba arXiv preprint arXiv:1912.00955, 2019 | 45 | 2019 |
Emotion transplantation through adaptation in HMM-based speech synthesis J Lorenzo-Trueba, R Barra-Chicote, R San-Segundo, J Ferreiros, ... Computer Speech & Language 34 (1), 292-307, 2015 | 44 | 2015 |
Computer-assisted pronunciation training—Speech synthesis is almost all you need D Korzekwa, J Lorenzo-Trueba, T Drugman, B Kostek Speech Communication 142, 22-33, 2022 | 36 | 2022 |
Voice conversion for whispered speech synthesis M Cotescu, T Drugman, G Huybrechts, J Lorenzo-Trueba, A Moinet IEEE Signal Processing Letters 27, 186-190, 2019 | 36 | 2019 |
Camp: a two-stage approach to modelling prosody in context Z Hodari, A Moinet, S Karlapati, J Lorenzo-Trueba, T Merritt, A Joly, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 35 | 2021 |
In other news: A bi-style text-to-speech model for synthesizing newscaster voice with limited data N Prateek, M Łajszczak, R Barra-Chicote, T Drugman, J Lorenzo-Trueba, ... arXiv preprint arXiv:1904.02790, 2019 | 34 | 2019 |
A spoofing benchmark for the 2018 voice conversion challenge: Leveraging from spoofing countermeasures for speech artifact assessment T Kinnunen, J Lorenzo-Trueba, J Yamagishi, T Toda, D Saito, ... arXiv preprint arXiv:1804.08438, 2018 | 29 | 2018 |
Cross-speaker style transfer for text-to-speech using data augmentation MS Ribeiro, J Roth, G Comini, G Huybrechts, A Gabryś, J Lorenzo-Trueba ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 27 | 2022 |