Yu-An Chung

Cited by

	All	Since 2019
Citations	4250	4129
h-index	29	29
i10-index	34	34

1300

650

325

975

2017201820192020202120222023202431 83 231 366 520 932 1238 817

Co-authors

James GlassMIT Computer Science and Artificial Intelligence LaboratoryVerified email at mit.edu
Yuan GongResearch Scientist, MIT CSAILVerified email at mit.edu
Wei-Ning HsuFacebook AI Research (FAIR)Verified email at csail.mit.edu
Yu ZhangOpenAIVerified email at csail.mit.edu
Wei-Hung WengGoogle ResearchVerified email at mit.edu
Hung-yi LeeNational Taiwan UniversityVerified email at ntu.edu.tw
Hsuan-Tien LinProfessor of Computer Science and Information Engineering, National Taiwan UniversityVerified email at csie.ntu.edu.tw
Schrasing TongMIT Computer Science and Artificial Intelligence LaboratoryVerified email at mit.edu
Yuxuan WangByteDanceVerified email at cse.ohio-state.edu
Shao-Wen YangSr. Applied Scientist at AmazonVerified email at amazon.com
Sravya PopuriResearch Engineer, Facebook AI ResearchVerified email at fb.com
Chenguang ZhuHead of Zoom GenAI ScienceVerified email at zoom.us
RJ Skerry-RyanGoogle, Inc.Verified email at alum.mit.edu
Peng-Jen ChenFacebookVerified email at fb.com
Juan PinoMetaVerified email at fb.com
Hirofumi InagumaFundamental AI Research (FAIR) at MetaVerified email at meta.com
Ann LeeMeta AIVerified email at csail.mit.edu
Alexander H. LiuMassachusetts Institute of TechnologyVerified email at mit.edu
Anmol GulatiResearcher, Google DeepmindVerified email at google.com
Alexis ConneauOpenAIVerified email at openai.com

Yu-An Chung

Facebook AI Research (FAIR)

Verified email at fb.com - Homepage

Machine Learning Speech Processing Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Ast: Audio spectrogram transformer Y Gong, YA Chung, J Glass arXiv preprint arXiv:2104.01778, 2021	813	2021
An unsupervised autoregressive model for speech representation learning YA Chung, WN Hsu, H Tang, J Glass arXiv preprint arXiv:1904.03240, 2019	443	2019
W2v-bert: Combining contrastive learning and masked language modeling for self-supervised speech pre-training YA Chung, Y Zhang, W Han, CC Chiu, J Qin, R Pang, Y Wu 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	344	2021
Ssast: Self-supervised audio spectrogram transformer Y Gong, CI Lai, YA Chung, J Glass Proceedings of the AAAI Conference on Artificial Intelligence 36 (10), 10699 …, 2022	243	2022
Audio word2vec: Unsupervised learning of audio segment representations using sequence-to-sequence autoencoder YA Chung, CC Wu, CH Shen, HY Lee, LS Lee arXiv preprint arXiv:1603.00982, 2016	216	2016
Speech2vec: A sequence-to-sequence framework for learning word embeddings from speech YA Chung, J Glass arXiv preprint arXiv:1803.08976, 2018	212	2018
Generative pre-training for speech with autoregressive predictive coding YA Chung, J Glass ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	198	2020
Psla: Improving audio tagging with pretraining, sampling, labeling, and aggregation Y Gong, YA Chung, J Glass IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3292-3306, 2021	159	2021
Semi-supervised training for improving data efficiency in end-to-end speech synthesis YA Chung, Y Wang, WN Hsu, Y Zhang, RJ Skerry-Ryan ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	139	2019
Disentangling correlated speaker and noise for speech synthesis via data augmentation and adversarial factorization WN Hsu, Y Zhang, RJ Weiss, YA Chung, Y Wang, Y Wu, J Glass ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	124	2019
Vector-quantized autoregressive predictive coding YA Chung, H Tang, J Glass arXiv preprint arXiv:2005.08392, 2020	114	2020
Unsupervised cross-modal alignment of speech and text embedding spaces YA Chung, WH Weng, S Tong, J Glass Advances in neural information processing systems 31, 2018	110	2018
Supervised and unsupervised transfer learning for question answering YA Chung, HY Lee, J Glass arXiv preprint arXiv:1711.05345, 2017	107	2017
Cost-aware pre-training for multiclass cost-sensitive deep learning YA Chung, HT Lin, SW Yang arXiv preprint arXiv:1511.09337, 2015	107	2015
Learning deep representations of medical images using siamese cnns with application to content-based image retrieval YA Chung, WH Weng arXiv preprint arXiv:1711.08490, 2017	91	2017
Non-autoregressive predictive coding for learning speech representations from local dependencies AH Liu, YA Chung, J Glass arXiv preprint arXiv:2011.00406, 2020	89	2020
SLAM: A unified encoder for speech and language modeling via speech-text joint pre-training A Bapna, Y Chung, N Wu, A Gulati, Y Jia, JH Clark, M Johnson, J Riesa, ... arXiv preprint arXiv:2110.10329, 2021	79	2021
Splat: Speech-language joint pre-training for spoken language understanding YA Chung, C Zhu, M Zeng arXiv preprint arXiv:2010.02295, 2020	71	2020
libact: Pool-based active learning in python YY Yang, SC Lee, YA Chung, TE Wu, SA Chen, HT Lin arXiv preprint arXiv:1710.00379, 2017	61	2017
Improved speech representations with multi-target autoregressive predictive coding YA Chung, J Glass arXiv preprint arXiv:2004.05274, 2020	58	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors