Follow
Xin Wang
Title
Cited by
Cited by
Year
Asvspoof 2019: Future horizons in spoofed and fake audio detection
M Todisco, X Wang, V Vestman, M Sahidullah, H Delgado, A Nautsch, ...
Proc. Interspeech, 1008-1012, 2019
3742019
Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech.
C Valentini-Botinhao, X Wang, S Takaki, J Yamagishi
SSW, 146-152, 2016
2402016
ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech
X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ...
Computer Speech & Language, 101114, 2020
1562020
Neural source-filter-based waveform model for statistical parametric speech synthesis
X Wang, S Takaki, J Yamagishi
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1182019
Zero-shot multi-speaker text-to-speech with state-of-the-art neural speaker embeddings
E Cooper, CI Lai, Y Yasuda, F Fang, X Wang, N Chen, J Yamagishi
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1152020
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
J Yamagishi, X Wang, M Todisco, M Sahidullah, J Patino, A Nautsch, ...
Proc. 2021 Edition of the Automatic Speaker Verification and Spoofing …, 2021
1032021
Neural source-filter waveform models for statistical parametric speech synthesis
X Wang, S Takaki, J Yamagishi
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 402-415, 2019
992019
Introducing the VoicePrivacy initiative
N Tomashenko, BML Srivastava, X Wang, E Vincent, A Nautsch, ...
Proc. Interspeech, 1693--1697, 2020
882020
Speaker anonymization using x-vector and neural waveform models
F Fang, X Wang, J Yamagishi, I Echizen, M Todisco, N Evans, ...
Proc. SSW, 155-160, 2019
852019
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language
Y Yasuda, X Wang, S Takaki, J Yamagishi
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
822019
Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks.
C Valentini-Botinhao, X Wang, S Takaki, J Yamagishi
Interspeech, 352-356, 2016
752016
A comparative study on recent neural spoofing countermeasures for synthetic speech detection
X Wang, J Yamagishi
Proc. Interspeech, 4259--4263, 2021
732021
A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis
X Wang, J Lorenzo-Trueba, S Takaki, L Juvela, J Yamagishi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
702018
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data
J Lorenzo-Trueba, F Fang, X Wang, I Echizen, J Yamagishi, T Kinnunen
Proc. Speaker Odyssey, 240-247, 2018
702018
Speech waveform synthesis from MFCC sequences with generative adversarial networks
L Juvela, B Bollepalli, X Wang, H Kameoka, M Airaksinen, J Yamagishi, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
582018
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech
A Nautsch, X Wang, N Evans, TH Kinnunen, V Vestman, M Todisco, ...
IEEE Transactions on Biometrics, Behavior, and Identity Science 3 (2), 252-265, 2021
552021
Tandem assessment of spoofing countermeasures and automatic speaker verification: Fundamentals
T Kinnunen, H Delgado, N Evans, KA Lee, V Vestman, A Nautsch, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2195-2210, 2020
542020
Joint training framework for text-to-speech and voice conversion using multi-source tacotron and wavenet
M Zhang, X Wang, F Fang, H Li, J Yamagishi
Proc. Interspeech, 1298-1302, 2019
512019
An autoregressive recurrent mixture density network for parametric speech synthesis
X Wang, S Takaki, J Yamagishi
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
482017
The VoicePrivacy 2020 Challenge: Results and findings
N Tomashenko, X Wang, E Vincent, J Patino, BML Srivastava, PG Noé, ...
Computer Speech & Language 74, 101362, 2022
402022
The system can't perform the operation now. Try again later.
Articles 1–20