Follow
Yu-An Chung
Yu-An Chung
Facebook AI Research (FAIR)
Verified email at fb.com - Homepage
Title
Cited by
Cited by
Year
Ast: Audio spectrogram transformer
Y Gong, YA Chung, J Glass
arXiv preprint arXiv:2104.01778, 2021
6112021
An unsupervised autoregressive model for speech representation learning
YA Chung, WN Hsu, H Tang, J Glass
arXiv preprint arXiv:1904.03240, 2019
4222019
W2v-bert: Combining contrastive learning and masked language modeling for self-supervised speech pre-training
YA Chung, Y Zhang, W Han, CC Chiu, J Qin, R Pang, Y Wu
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
2662021
Speech2vec: A sequence-to-sequence framework for learning word embeddings from speech
YA Chung, J Glass
arXiv preprint arXiv:1803.08976, 2018
2062018
Audio word2vec: Unsupervised learning of audio segment representations using sequence-to-sequence autoencoder
YA Chung, CC Wu, CH Shen, HY Lee, LS Lee
arXiv preprint arXiv:1603.00982, 2016
2052016
Generative pre-training for speech with autoregressive predictive coding
YA Chung, J Glass
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1892020
Ssast: Self-supervised audio spectrogram transformer
Y Gong, CI Lai, YA Chung, J Glass
Proceedings of the AAAI Conference on Artificial Intelligence 36 (10), 10699 …, 2022
1852022
Semi-supervised training for improving data efficiency in end-to-end speech synthesis
YA Chung, Y Wang, WN Hsu, Y Zhang, RJ Skerry-Ryan
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1362019
Psla: Improving audio tagging with pretraining, sampling, labeling, and aggregation
Y Gong, YA Chung, J Glass
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3292-3306, 2021
1352021
Disentangling correlated speaker and noise for speech synthesis via data augmentation and adversarial factorization
WN Hsu, Y Zhang, RJ Weiss, YA Chung, Y Wang, Y Wu, J Glass
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1222019
Unsupervised cross-modal alignment of speech and text embedding spaces
YA Chung, WH Weng, S Tong, J Glass
Advances in neural information processing systems 31, 2018
1092018
Vector-quantized autoregressive predictive coding
YA Chung, H Tang, J Glass
arXiv preprint arXiv:2005.08392, 2020
1072020
Cost-aware pre-training for multiclass cost-sensitive deep learning
YA Chung, HT Lin, SW Yang
arXiv preprint arXiv:1511.09337, 2015
1042015
Supervised and unsupervised transfer learning for question answering
YA Chung, HY Lee, J Glass
arXiv preprint arXiv:1711.05345, 2017
1012017
Learning deep representations of medical images using siamese CNNs with application to content-based image retrieval
YA Chung, WH Weng
arXiv preprint arXiv:1711.08490, 2017
902017
Non-autoregressive predictive coding for learning speech representations from local dependencies
AH Liu, YA Chung, J Glass
arXiv preprint arXiv:2011.00406, 2020
792020
SLAM: A unified encoder for speech and language modeling via speech-text joint pre-training
A Bapna, Y Chung, N Wu, A Gulati, Y Jia, JH Clark, M Johnson, J Riesa, ...
arXiv preprint arXiv:2110.10329, 2021
722021
Splat: Speech-language joint pre-training for spoken language understanding
YA Chung, C Zhu, M Zeng
arXiv preprint arXiv:2010.02295, 2020
652020
libact: Pool-based active learning in python
YY Yang, SC Lee, YA Chung, TE Wu, SA Chen, HT Lin
arXiv preprint arXiv:1710.00379, 2017
602017
Improved speech representations with multi-target autoregressive predictive coding
YA Chung, J Glass
arXiv preprint arXiv:2004.05274, 2020
542020
The system can't perform the operation now. Try again later.
Articles 1–20