Yifan Peng

Cited by

	All	Since 2019
Citations	407	406
h-index	11	11
i10-index	13	13

240

120

180

20212022202320243 40 233 130

Public access

View all

4 articles

2 articles

available

not available

Based on funding mandates

Co-authors

Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Brian YanCarnegie Mellon UniversityVerified email at cs.cmu.edu
Siddhant AroraGraduate Student, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Soumi MaitiCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Siddharth DalmiaResearch Scientist, Google DeepMindVerified email at google.com
Jiatong Shi (史嘉彤)Carnegie Mellon UniversityVerified email at andrew.cmu.edu
William ChenCarnegie Mellon UniversityVerified email at cmu.edu
Xuankai ChangCarnegie Mellon University, StudentVerified email at andrew.cmu.edu
Yui SudoHonda Research Institute JapanVerified email at jp.honda-ri.com
Felix WuCharacter AIVerified email at character.ai
Prashant SridharASAPPVerified email at asapp.com
Dan BerrebbiApple - Carnegie Mellon University - Ecole PolytechniqueVerified email at andrew.cmu.edu
Jee-weon JungCarnegie Mellon UniversityVerified email at ieee.org
Ian LaneCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Hayato FutamiSony Group CorporationVerified email at sony.com
Alan W BlackProfessor, Language Technologies Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Wangyou ZhangPh.D. candidate, Department of Computer Science and Engineering, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Kyu Jeong HanAmazon Web Services (AWS)Verified email at amazon.com
Jing PanVerified email at microsoft.com
Zhaoheng NiMeta Reality LabsVerified email at meta.com

Yifan Peng

Carnegie Mellon University

Verified email at andrew.cmu.edu - Homepage

Speech Processing Speech Recognition Spoken Language Processing Foundation Models


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding Y Peng, S Dalmia, I Lane, S Watanabe International Conference on Machine Learning, 17627-17643, 2022	89	2022
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	66	2022
E-Branchformer: Branchformer with Enhanced merging for speech recognition K Kim, F Wu, Y Peng, J Pan, P Sridhar, KJ Han, S Watanabe 2022 IEEE Spoken Language Technology Workshop (SLT), 84-91, 2023	50	2023
Improving Massively Multilingual ASR with Auxiliary CTC Objectives W Chen, B Yan, J Shi, Y Peng, S Maiti, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	22	2023
DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models Y Peng, Y Sudo, S Muhammad, S Watanabe Proc. Interspeech, 2023, 2023	20	2023
SpeechLMScore: Evaluating speech generation using speech language model S Maiti, Y Peng, T Saeki, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	16	2023
A study on the integration of pre-trained ssl, asr, LM and SLU models for spoken language understanding Y Peng, S Arora, Y Higuchi, Y Ueda, S Kumar, K Ganesan, S Dalmia, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 406-413, 2023	16	2023
Anomaly Detection of Calcifications in Mammography Based on 11,000 Negative Cases R Hou, Y Peng, LJ Grimm, Y Ren, MA Mazurowski, JR Marks, LM King, ... IEEE Transactions on Biomedical Engineering 69 (5), 1639-1650, 2021	16	2021
Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute W Chen, X Chang, Y Peng, Z Ni, S Maiti, S Watanabe Proc. Interspeech, 2023, 2023	14	2023
Structured Pruning of Self-Supervised Pre-Trained Models for Speech Recognition and Understanding Y Peng, K Kim, F Wu, P Sridhar, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	12	2023
Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	11	2023
CMU’s IWSLT 2022 Dialect Speech Translation System B Yan, P Fernandes, S Dalmia, J Shi, Y Peng, D Berrebbi, X Wang, ... Proceedings of the 19th International Conference on Spoken Language …, 2022	11	2022
VoxtLM: Unified Decoder-Only Models for Consolidating Speech Recognition, Synthesis and Speech, Text Continuation Tasks S Maiti, Y Peng, S Choi, J Jung, X Chang, S Watanabe ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	10	2024
I3D: Transformer architectures with input-dependent dynamic depth for speech recognition Y Peng, J Lee, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	8	2023
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks Y Peng, K Kim, F Wu, B Yan, S Arora, W Chen, J Tang, S Shon, P Sridhar, ... Proc. Interspeech, 2023, 2023	8	2023
Attention Weight Smoothing Using Prior Distributions for Transformer-Based End-to-End ASR T Maekaku, Y Fujita, Y Peng, S Watanabe Proc. Interspeech 2022, 1071-1075, 2022	6	2022
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit B Yan, J Shi, Y Tang, H Inaguma, Y Peng, S Dalmia, P Polák, ... Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023	5	2023
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech C Huang, KH Lu, SH Wang, CY Hsiao, CY Kuan, H Wu, S Arora, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	4	2024
CMU’s IWSLT 2023 Simultaneous Speech Translation System B Yan, J Shi, S Maiti, W Chen, X Li, Y Peng, S Arora, S Watanabe Proceedings of the 20th International Conference on Spoken Language …, 2023	4	2023
A Study on the Integration of Pipeline and E2E SLU Systems for Spoken Semantic Parsing Toward Stop Quality Challenge S Arora, H Futami, SL Wu, J Huynh, Y Peng, Y Kashiwagi, E Tsunoo, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	4	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors