Yongqiang Wang

Cited by

	All	Since 2019
Citations	3851	2718
h-index	25	21
i10-index	37	28

640

320

160

480

2011201220132014201520162017201820192020202120222023202410 22 52 123 170 212 197 315 328 355 496 460 628 449

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Mike SeltzerFacebookVerified email at fb.com
Frank Qiaochu ZhangFacebookVerified email at fb.com
Mark GalesCambridge UniversityVerified email at eng.cam.ac.uk
Ching-Feng YehResearch Scientist, FAIRVerified email at fb.com
Duc LeMeta AIVerified email at meta.com
Yangyang ShiMetaVerified email at fb.com
Xiaohui ZhangFacebookVerified email at fb.com
Philip WoodlandProfessor of Information Engineering, Cambridge University Engineering DepartmentVerified email at eng.cam.ac.uk
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
Xie ChenShanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Xunying LiuChinese University of Hong KongVerified email at se.cuhk.edu.hk
Penny KaranasouSpeech Scientist, AmazonVerified email at amazon.com

Yongqiang Wang

Research Scientist, Google

Verified email at google.com - Homepage

foundation model deep learning speech processing multimodal


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
An investigation of deep neural networks for noise robust speech recognition ML Seltzer, D Yu, Y Wang 2013 IEEE international conference on acoustics, speech and signal …, 2013	803	2013
CNTK: Microsoft's open-source deep-learning toolkit F Seide, A Agarwal Proceedings of the 22nd ACM SIGKDD international conference on knowledge …, 2016	635	2016
Towards end-to-end spoken language understanding D Serdyuk, Y Wang, C Fuegen, A Kumar, B Liu, Y Bengio 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	257	2018
Transformer-based acoustic modeling for hybrid speech recognition Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	254	2020
Google usm: Scaling automatic speech recognition beyond 100 languages Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ... arXiv preprint arXiv:2303.01037, 2023	178	2023
Transformer-transducer: End-to-end speech recognition with self-attention CF Yeh, J Mahadeokar, K Kalgaonkar, Y Wang, D Le, M Jain, K Schubert, ... arXiv preprint arXiv:1910.12977, 2019	165	2019
Emformer: Efficient memory transformer based acoustic model for low latency streaming speech recognition Y Shi, Y Wang, C Wu, CF Yeh, J Chan, F Zhang, D Le, M Seltzer ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	162	2021
Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1519-1532, 2022	160	2022
Efficient lattice rescoring using recurrent neural network language models X Liu, Y Wang, X Chen, MJF Gales, PC Woodland 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014	113	2014
Audiopalm: A large language model that can speak and listen PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ... arXiv preprint arXiv:2306.12925, 2023	101	2023
Simplifying long short-term memory acoustic models for fast training and decoding Y Miao, J Li, Y Wang, SX Zhang, Y Gong 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	100	2016
Adaptation of deep neural network acoustic models using factorised i-vectors. P Karanasou, Y Wang, MJF Gales, PC Woodland Interspeech 2014, 2180-2184, 2014	83	2014
Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch. X Chen, Y Wang, X Liu, MJF Gales, PC Woodland Interspeech 14, 641-645, 2014	82	2014
Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory C Wu, Y Wang, Y Shi, CF Yeh, F Zhang arXiv preprint arXiv:2005.08042, 2020	67	2020
Speaker and noise factorization for robust speech recognition Y Wang, MJF Gales IEEE Transactions on Audio, Speech, and Language Processing 20 (7), 2149-2158, 2012	61	2012
Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR. Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen Interspeech, 3490-3494, 2019	55	2019
Investigations on speaker adaptation of LSTM RNN models for speech recognition C Liu, Y Wang, K Kumar, Y Gong 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	54	2016
End-to-end contextual speech recognition using class language models and a token passing decoder Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	53	2019
Small-footprint high-performance deep neural network-based speech recognition using split-VQ Y Wang, J Li, Y Gong 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	47	2015
Deja-vu: Double feature presentation and iterated loss in deep transformer networks A Tjandra, C Liu, F Zhang, X Zhang, Y Wang, G Synnaeve, S Nakamura, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	46	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors