Mt4ssl: Boosting self-supervised speech representation learning by integrating multiple targets Z Ma, Z Zheng, C Tang, Y Wang, X Chen INTERSPEECH 2023, 2022 | 18 | 2022 |
Leveraging speech ptm, text llm, and emotional tts for speech emotion recognition Z Ma, W Wu, Z Zheng, Y Guo, Q Chen, S Zhang, X Chen ICASSP 2024, 2023 | 4 | 2023 |
Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation Z Ma, Z Zheng, G Yang, Y Wang, C Zhang, X Chen INTERSPEECH 2023, 2023 | 4 | 2023 |
EAT: Self-Supervised Pre-Training with Efficient Audio Transformer W Chen, Y Liang, Z Ma, Z Zheng, X Chen IJCAI 2024, 2024 | 3 | 2024 |
Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning G Yang, Z Ma, Z Zheng, Y Song, Z Niu, X Chen ASRU 2023, 2023 | 3 | 2023 |
Front-End Adapter: Adapting Front-End Input of Speech Based Self-Supervised Learning for Speech Recognition X Chen, Z Ma, C Tang, Y Wang, Z Zheng ICASSP 2023, 1-5, 2023 | 3 | 2023 |
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition Z Zheng, Z Ma, Y Wang, X Chen INTERSPEECH 2023, 2023 | 2 | 2023 |
emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation Z Ma, Z Zheng, J Ye, J Li, Z Gao, S Zhang, X Chen arXiv preprint arXiv:2312.15185, 2023 | 1 | 2023 |
Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition Y Wang, C Tang, Z Ma, Z Zheng, X Chen, WQ Zhang ASRU 2023, 2022 | 1 | 2022 |
BAT: Learning to Reason about Spatial Sounds with Large Language Models Z Zheng, P Peng, Z Ma, X Chen, E Choi, D Harwath arXiv preprint arXiv:2402.01591, 2024 | | 2024 |