Follow
gao zhifu
gao zhifu
Speech Lab, Alibaba Group
Verified email at alibaba-inc.com
Title
Cited by
Cited by
Year
Improving Aggregation and Loss Function for Better Embedding Learning in End-to-End Speaker Verification System
Z Gao, Y Song, IV McLoughlin, P Li, Y Jiang, LR Dai
INTERSPEECH 2019, 361-365, 2019
782019
An Effective Deep Embedding Learning Architecture for Speaker Verification
Y Jiang, Y Song, IV McLoughlin, Z Gao, LR Dai
INTERSPEECH 2019, 4040-4044, 2019
342019
Paraformer: Fast and accurate parallel transformer for non-autoregressive end-to-end speech recognition
Z Gao, S Zhang, I McLoughlin, Z Yan
arXiv preprint arXiv:2206.08317, 2022
332022
Streaming chunk-aware multihead attention for online end-to-end speech recognition
S Zhang, Z Gao, H Luo, M Lei, J Gao, Z Yan, L Xie
INTERSPEECH 2020, 2142-2146, 2020
302020
An improved deep embedding learning method for short duration speaker verification
Z Gao, Y Song, IV McLoughlin, W Guo, LR Dai
INTERSPEECH 2018, 3578-3582, 2018
302018
San-m: Memory equipped self-attention for end-to-end speech recognition
Z Gao, S Zhang, M Lei, I McLoughlin
INTERSPEECH 2020, 6-10, 2020
282020
Extremely Low Footprint End-to-End ASR System for Smart Device
Z Gao, Y Yao, S Zhang, J Yang, M Lei, I McLoughlin
INTERSPEECH 2021, 4548-4552, 2021
152021
Universal ASR: Unifying streaming and non-streaming ASR using a single encoder-decoder model
Z Gao, S Zhang, M Lei, I McLoughlin
arXiv preprint arXiv:2010.14099, 2020
142020
Lauragpt: Listen, attend, understand, and regenerate audio with gpt
Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, W Wang, S Zheng, ...
arXiv preprint arXiv:2310.04673, 2023
112023
FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Z Gao, Z Li, J Wang, H Luo, X Shi, M Chen, Y Li, L Zuo, Z Du, Z Xiao, ...
INERSPEECH 2023, 2023
112023
emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Z Ma, Z Zheng, J Ye, J Li, Z Gao, S Zhang, X Chen
arXiv preprint arXiv:2312.15185, 2023
22023
SeACo-Paraformer: A non-autoregressive ASR system with flexible and effective hotword customization ability
X Shi, Y Yang, Z Li, Y Chen, Z Gao, S Zhang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity
Z Ma, G Yang, Y Yang, Z Gao, J Wang, Z Du, F Yu, Q Chen, S Zheng, ...
arXiv preprint arXiv:2402.08846, 2024
2024
Wav2vec‐MoE: An unsupervised pre‐training and adaptation method for multi‐accent ASR
Y Lin, S Zhang, Z Gao, L Wang, Y Yang, J Dang
Electronics Letters 59 (11), e12823, 2023
2023
Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System
X Shi, H Luo, Z Gao, S Zhang, Z Yan
INERSPEECH 2023, 2023
2023
Streaming End-to-End Speech Recognition Method, Apparatus and Electronic Device
S Zhang, GAO Zhifu
US Patent App. 17/976,464, 2023
2023
Speech Processing method, Speech Encoder, Speech Decoder and Speech Recognition System
S Zhang, GAO Zhifu, M Lei
US Patent App. 17/951,569, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–17