Follow
Frank Zhang
Title
Cited by
Cited by
Year
Transformer-based acoustic modeling for hybrid speech recognition
Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2652020
The llama 3 herd of models
A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ...
arXiv preprint arXiv:2407.21783, 2024
2152024
Emformer: Efficient memory transformer based acoustic model for low latency streaming speech recognition
Y Shi, Y Wang, C Wu, CF Yeh, J Chan, F Zhang, D Le, M Seltzer
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1732021
Streaming transformer-based acoustic models using self-attention with augmented memory
C Wu, Y Wang, Y Shi, CF Yeh, F Zhang
arXiv preprint arXiv:2005.08042, 2020
682020
Improving RNN transducer based ASR with auxiliary tasks
C Liu, F Zhang, D Le, S Kim, Y Saraf, G Zweig
2021 IEEE Spoken Language Technology Workshop (SLT), 172-179, 2021
502021
Deja-vu: Double feature presentation and iterated loss in deep transformer networks
A Tjandra, C Liu, F Zhang, X Zhang, Y Wang, G Synnaeve, S Nakamura, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
482020
Improved language identification through cross-lingual self-supervised learning
A Tjandra, DG Choudhury, F Zhang, K Singh, A Conneau, A Baevski, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
462022
Weak-attention suppression for transformer based speech recognition
Y Shi, Y Wang, C Wu, C Fuegen, F Zhang, D Le, CF Yeh, ML Seltzer
arXiv preprint arXiv:2005.09137, 2020
312020
Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
F Zhang, Y Wang, X Zhang, C Liu, Y Saraf, G Zweig
Interspeech 2020, 2020
302020
Multilingual graphemic hybrid ASR with massive data augmentation
C Liu, Q Zhang, X Zhang, K Singh, Y Saraf, G Zweig
arXiv preprint arXiv:1909.06522, 2019
302019
Contextualizing ASR lattice rescoring with hybrid pointer network language model
DR Liu, C Liu, F Zhang, G Synnaeve, Y Saraf, G Zweig
arXiv preprint arXiv:2005.07394, 2020
252020
Benchmarking lf-mmi, ctc and rnn-t criteria for streaming asr
X Zhang, F Zhang, C Liu, K Schubert, J Chan, P Prakash, J Liu, CF Yeh, ...
2021 IEEE spoken language technology workshop (SLT), 46-51, 2021
232021
Scaling asr improves zero and few shot learning
A Xiao, W Zheng, G Keren, D Le, F Zhang, C Fuegen, O Kalinli, Y Saraf, ...
arXiv preprint arXiv:2111.05948, 2021
222021
Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Y Wang, Y Shi, F Zhang, C Wu, J Chan, CF Yeh, A Xiao
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
192021
Accent-robust automatic speech recognition using supervised and unsupervised wav2vec embeddings
J Li, V Manohar, P Chitkara, A Tjandra, M Picheny, F Zhang, X Zhang, ...
arXiv preprint arXiv:2110.03520, 2021
172021
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
X Zhang, V Manohar, D Zhang, F Zhang, Y Shi, N Singhal, J Chan, ...
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
132021
Streaming attention-based models with augmented memory for end-to-end speech recognition
CF Yeh, Y Wang, Y Shi, C Wu, F Zhang, J Chan, ML Seltzer
2021 IEEE Spoken Language Technology Workshop (SLT), 8-14, 2021
122021
Training asr models by generation of contextual information
K Singh, D Okhonko, J Liu, Y Wang, F Zhang, R Girshick, S Edunov, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
62020
Multilingual ASR with massive data augmentation
C Liu, Q Zhang, X Zhang, K Singh, Y Saraf, G Zweig
arXiv preprint arXiv:1909.06522, 2019
32019
Deja-vu: Double feature presentation in deep transformer networks
A Tjandra, C Liu, F Zhang, X Zhang, Y Wang, G Synnaeve, S Nakamura, ...
arXiv preprint, 2019
32019
The system can't perform the operation now. Try again later.
Articles 1–20