Follow
Soumi Maiti
Soumi Maiti
Verified email at andrew.cmu.edu
Title
Cited by
Cited by
Year
Generating multilingual voices using speaker space translation based on bilingual speaker data
S Maiti, E Marchi, A Conkie
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
202020
Parametric resynthesis with neural vocoders
S Maiti, MI Mandel
2019 IEEE Workshop on Applications of Signal Processing to Audio and …, 2019
182019
Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement
S Maiti, MI Mandel
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
172020
End-to-end diarization for variable number of speakers with local-global networks and discriminative speaker embeddings
S Maiti, H Erdogan, K Wilson, S Wisdom, S Watanabe, JR Hershey
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
152021
Speech denoising by parametric resynthesis
S Maiti, MI Mandel
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
112019
Predicting interaction quality in customer service dialogs
S Stoyanchev, S Maiti, S Bangalore
Advanced Social Interaction with Agents: 8th International Workshop on …, 2018
72018
Improving massively multilingual asr with auxiliary ctc objectives
W Chen, B Yan, J Shi, Y Peng, S Maiti, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
42023
TriniTTS: Pitch-controllable end-to-end TTS without external aligner
Y Ju, I Kim, H Yang, JH Kim, B Kim, S Maiti, S Watanabe
Proc. Interspeech, 16-20, 2022
42022
Large Vocabulary Concatenative Resynthesis.
S Maiti, J Ching, MI Mandel
INTERSPEECH, 1190-1194, 2018
42018
Concatenative Resynthesis Using Twin Networks.
S Maiti, MI Mandel
INTERSPEECH, 3647-3651, 2017
42017
EEND-SS: Joint end-to-end neural speaker diarization and speech separation for flexible number of speakers
S Maiti, Y Ueda, S Watanabe, C Zhang, M Yu, SX Zhang, Y Xu
2022 IEEE Spoken Language Technology Workshop (SLT), 480-487, 2023
32023
SpeechLMScore: Evaluating speech generation using speech language model
S Maiti, Y Peng, T Saeki, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
12023
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study
M Baali, T Hayashi, H Mubarak, S Maiti, S Watanabe, W El-Hajj, A Ali
arXiv preprint arXiv:2301.09099, 2023
12023
Speech Enhancement Using Speech Synthesis Techniques
S Maiti
City University of New York, 2021
12021
FindAdaptNet: Find and Insert Adapters by Learned Layer Importance
J Huang, K Ganesan, S Maiti, YM Kim, X Chang, P Liang, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
B Yan, J Shi, Y Tang, H Inaguma, Y Peng, S Dalmia, P Polák, ...
arXiv preprint arXiv:2304.04596, 2023
2023
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining
T Saeki, S Maiti, X Li, S Watanabe, S Takamichi, H Saruwatari
arXiv preprint arXiv:2301.12596, 2023
2023
Method for extracting speech from degraded signals by predicting the inputs to a speech vocoder
M Mandel, S Maiti
US Patent App. 17/441,063, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–18