Follow
Jiatong Shi (史嘉彤)
Jiatong Shi (史嘉彤)
Verified email at andrew.cmu.edu - Homepage
Title
Cited by
Cited by
Year
SUPERB: Speech processing Universal PERformance Benchmark
S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ...
Proceedings of the Interspeech, 1194--1198, 2021
8632021
Recent developments on ESPnet toolkit boosted by Conformer
P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2932021
Audiogpt: Understanding and generating speech, music, sound, and talking head
R Huang, M Li, D Yang, J Shi, X Chang, Z Ye, Y Wu, Z Hong, J Huang, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (21), 23802 …, 2024
1312024
Findings of the IWSLT 2022 Evaluation Campaign.
A Anastasopoulos, L Barrault, L Bentivogli, MZ Boito, O Bojar, R Cattoni, ...
Proceedings of the 19th International Conference on Spoken Language …, 2022
1052022
SUPERB-SG: Enhanced speech processing universal performance benchmark for semantic and generative capabilities
HS Tsai, HJ Chang, WC Huang, Z Huang, K Lakhotia, S Yang, S Dong, ...
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
932022
Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task Learning
W Hou, Y Dong, B Zhuang, L Yang, J Shi, T Shinozaki
Proceedings of the Interspeech, 1037-1041, 2020
782020
UniAudio: Towards Universal Audio Generation with Large Language Models
D Yang, J Tian, X Tan, R Huang, S Liu, H Guo, X Chang, J Shi, J Bian, ...
Forty-first International Conference on Machine Learning, 2024
68*2024
Context-aware Goodness of Pronunciation for Computer-Assisted Pronunciation Training
J Shi, N Huo, Q Jin
Proceedings of the Interspeech, 3057-3061, 2020
622020
ESPnet2-TTS: Extending the edge of TTS research
T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ...
arXiv preprint arXiv:2110.07840, 2021
592021
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
J Shi, D Berrebbi, W Chen, HL Chung, EP Hu, WP Huang, X Chang, ...
Proceedings of the Interspeech, 884--888, 2023
472023
The singing voice conversion challenge 2023
WC Huang, LP Violeta, S Liu, J Shi, T Toda
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
452023
Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yolox\'ochitl Mixtec
J Shi, JD Amith, RC García, EG Sierra, K Duh, S Watanabe
Proceedings of the 16th Conference of the European Chapter of the …, 2021
392021
Findings of the iwslt 2023 evaluation campaign
M Agarwal, S Agarwal, A Anastasopoulos, L Bentivogli, O Bojar, C Borg, ...
Association for Computational Linguistics, 2023
382023
Improving massively multilingual ASR with auxiliary CTC objectives
W Chen, B Yan, J Shi, Y Peng, S Maiti, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
342023
SUPERB@ SLT 2022: Challenge on generalization and efficiency of self-supervised speech representation learning
T Feng, A Dong, CF Yeh, S Yang, TQ Lin, J Shi, KW Chang, Z Huang, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 1096-1103, 2023
332023
Reproducing whisper-style training using an open-source toolkit and publicly available data
Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
302023
Sequence-to-sequence singing voice synthesis with perceptual entropy loss
J Shi, S Guo, N Huo, Y Zhang, Q Jin
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
282021
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study
X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
272024
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners
R Huang, C Zhang, Y Wang, D Yang, J Tian, Z Ye, L Liu, Z Wang, Z Jiang, ...
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
25*2024
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation
D Berrebbi, J Shi, B Yan, O Lopez-Francisco, JD Amith, S Watanabe
Proceedings of the Interspeech, 3533--3537, 2022
242022
The system can't perform the operation now. Try again later.
Articles 1–20