Follow
Yifan Peng
Title
Cited by
Cited by
Year
Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding
Y Peng, S Dalmia, I Lane, S Watanabe
International Conference on Machine Learning, 17627-17643, 2022
892022
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
662022
E-Branchformer: Branchformer with Enhanced merging for speech recognition
K Kim, F Wu, Y Peng, J Pan, P Sridhar, KJ Han, S Watanabe
2022 IEEE Spoken Language Technology Workshop (SLT), 84-91, 2023
502023
Improving Massively Multilingual ASR with Auxiliary CTC Objectives
W Chen, B Yan, J Shi, Y Peng, S Maiti, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
222023
DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models
Y Peng, Y Sudo, S Muhammad, S Watanabe
Proc. Interspeech, 2023, 2023
202023
SpeechLMScore: Evaluating speech generation using speech language model
S Maiti, Y Peng, T Saeki, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
162023
A study on the integration of pre-trained ssl, asr, LM and SLU models for spoken language understanding
Y Peng, S Arora, Y Higuchi, Y Ueda, S Kumar, K Ganesan, S Dalmia, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 406-413, 2023
162023
Anomaly Detection of Calcifications in Mammography Based on 11,000 Negative Cases
R Hou, Y Peng, LJ Grimm, Y Ren, MA Mazurowski, JR Marks, LM King, ...
IEEE Transactions on Biomedical Engineering 69 (5), 1639-1650, 2021
162021
Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
W Chen, X Chang, Y Peng, Z Ni, S Maiti, S Watanabe
Proc. Interspeech, 2023, 2023
142023
Structured Pruning of Self-Supervised Pre-Trained Models for Speech Recognition and Understanding
Y Peng, K Kim, F Wu, P Sridhar, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data
Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
112023
CMU’s IWSLT 2022 Dialect Speech Translation System
B Yan, P Fernandes, S Dalmia, J Shi, Y Peng, D Berrebbi, X Wang, ...
Proceedings of the 19th International Conference on Spoken Language …, 2022
112022
VoxtLM: Unified Decoder-Only Models for Consolidating Speech Recognition, Synthesis and Speech, Text Continuation Tasks
S Maiti, Y Peng, S Choi, J Jung, X Chang, S Watanabe
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
102024
I3D: Transformer architectures with input-dependent dynamic depth for speech recognition
Y Peng, J Lee, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
82023
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks
Y Peng, K Kim, F Wu, B Yan, S Arora, W Chen, J Tang, S Shon, P Sridhar, ...
Proc. Interspeech, 2023, 2023
82023
Attention Weight Smoothing Using Prior Distributions for Transformer-Based End-to-End ASR
T Maekaku, Y Fujita, Y Peng, S Watanabe
Proc. Interspeech 2022, 1071-1075, 2022
62022
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
B Yan, J Shi, Y Tang, H Inaguma, Y Peng, S Dalmia, P Polák, ...
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
52023
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech
C Huang, KH Lu, SH Wang, CY Hsiao, CY Kuan, H Wu, S Arora, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
CMU’s IWSLT 2023 Simultaneous Speech Translation System
B Yan, J Shi, S Maiti, W Chen, X Li, Y Peng, S Arora, S Watanabe
Proceedings of the 20th International Conference on Spoken Language …, 2023
42023
A Study on the Integration of Pipeline and E2E SLU Systems for Spoken Semantic Parsing Toward Stop Quality Challenge
S Arora, H Futami, SL Wu, J Huynh, Y Peng, Y Kashiwagi, E Tsunoo, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
42023
The system can't perform the operation now. Try again later.
Articles 1–20