Xiaohui Zhang

Cited by

	All	Since 2019
Citations	1723	1129
h-index	16	15
i10-index	22	21

260

130

195

2014201520162017201820192020202120222023202437 106 131 143 166 148 157 244 206 255 113

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Daniel PoveyChief Speech Scientist, Xiaomi Corp.Verified email at xiaomi.com
Sanjeev KhudanpurThe Johns Hopkins UniversityVerified email at jhu.edu
Chunxi LiuTwo SigmaVerified email at twosigma.com
Frank Qiaochu ZhangFacebookVerified email at fb.com
Andros TjandraFacebook AI (research scientist)Verified email at fb.com
Mike SeltzerFacebookVerified email at fb.com
Jan "Yenda" TrmalAssociate Research Scientist at Johns Hopkins UniversityVerified email at jhu.edu
Yongqiang WangResearch Scientist, GoogleVerified email at google.com
Vimal ManoharMeta Platforms Inc.Verified email at meta.com
Duc LeSenior Staff Research Scientist, Meta AIVerified email at meta.com
Christian FuegenFacebook Inc.Verified email at fb.com
Pegah GhahremaniAmazonVerified email at jhu.edu
Yiming WangMicrosoftVerified email at microsoft.com
Hainan XuNVIDIAVerified email at nvidia.com
Daniel Garcia-RomeroPrincipal Applied Scientist, AWS AIVerified email at amazon.com
Guoguo ChenSeasalt.ai, Vobil.com, Baidu, KITT.AIVerified email at seasalt.ai
Aren JansenGoogle ResearchVerified email at google.com
Haomiao LiuAlibabaVerified email at vipl.ict.ac.cn

Xiaohui Zhang

Facebook

Verified email at fb.com - Homepage

Speech Synthesis Speech Recognition Optimization Lexicon Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Improving deep neural network acoustic models using generalized maxout networks X Zhang, J Trmal, D Povey, S Khudanpur 2014 IEEE international conference on acoustics, speech and signal …, 2014	400	2014
Parallel training of deep neural networks with natural gradient and parameter averaging D Povey, X Zhang, S Khudanpur arXiv preprint arXiv:1410.7455, 124, 2014	394	2014
Transformer-based acoustic modeling for hybrid speech recognition Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	242	2020
Improving Speaker Recognition Performance in the Domain Adaptation Challenge using Deep Neural Networks D Garcia-Romero, X Zhang, A McCree, D Povey Proc. SLT, 2014	108	2014
Scaling speech technology to 1,000+ languages V Pratap, A Tjandra, B Shi, P Tomasello, A Babu, S Kundu, A Elkahky, ... Journal of Machine Learning Research (JMLR), 2024	106	2024
From senones to chenones: Tied context-dependent graphemes for hybrid speech recognition D Le, X Zhang, W Zheng, C Fügen, G Zweig, ML Seltzer 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	65	2019
A KEYWORD SEARCH SYSTEM USING OPEN SOURCE SOFTWARE J Trmal, G Chen, D Povey, S Khudanpur, P Ghahremani, X Zhang, ... Proc. SLT, 2014	50	2014
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. J Trmal, M Wiesner, V Peddinti, X Zhang, P Ghahremani, Y Wang, ... Interspeech, 3597-3601, 2017	47	2017
Deja-vu: Double feature presentation and iterated loss in deep transformer networks A Tjandra, C Liu, F Zhang, X Zhang, Y Wang, G Synnaeve, S Nakamura, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	43	2020
Backstitch: Counteracting Finite-Sample Bias via Negative Steps. Y Wang, V Peddinti, H Xu, X Zhang, D Povey, S Khudanpur Interspeech, 1631-1635, 2017	32	2017
Towards measuring fairness in speech recognition: Casual conversations dataset transcriptions C Liu, M Picheny, L Sarı, P Chitkara, A Xiao, X Zhang, M Chou, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	29	2022
Faster, simpler and more accurate hybrid asr systems using wordpieces F Zhang, Y Wang, X Zhang, C Liu, Y Saraf, G Zweig Interspeech 2020, 2020	29	2020
Multilingual graphemic hybrid ASR with massive data augmentation C Liu, Q Zhang, X Zhang, K Singh, Y Saraf, G Zweig arXiv preprint arXiv:1909.06522, 2019	26	2019
A diversity-penalizing ensemble training method for deep learning X Zhang, D Povey, S Khudanpur Sixteenth Annual Conference of the International Speech Communication …, 2015	26	2015
Benchmarking lf-mmi, ctc and rnn-t criteria for streaming asr X Zhang, F Zhang, C Liu, K Schubert, J Chan, P Prakash, J Liu, CF Yeh, ... 2021 IEEE spoken language technology workshop (SLT), 46-51, 2021	21	2021
Accent-robust automatic speech recognition using supervised and unsupervised wav2vec embeddings J Li, V Manohar, P Chitkara, A Tjandra, M Picheny, F Zhang, X Zhang, ... arXiv preprint arXiv:2110.03520, 2021	17	2021
Torchaudio-squim: Reference-less speech quality and intelligibility measures in torchaudio A Kumar, K Tan, Z Ni, P Manocha, X Zhang, E Henderson, B Xu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	15	2023
Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework X Zhang, V Manohar, D Povey, S Khudanpur Interspeech 2017, 2017	13	2017
Omni-sparsity dnn: Fast sparsity optimization for on-device streaming e2e asr via supernet H Yang, Y Shangguan, D Wang, M Li, P Chuang, X Zhang, G Venkatesh, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	10	2022
Streaming transformer transducer based speech recognition using non-causal convolution Y Shi, C Wu, D Wang, A Xiao, J Mahadeokar, X Zhang, C Liu, K Li, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	10	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors