Siddharth Dalmia
Siddharth Dalmia
Research Scientist, Google Research
Verified email at - Homepage
Cited by
Cited by
Epitran: Precision G2P for Many Languages
DR Mortensen, S Dalmia, P Littell
LREC 2018, 2018
Sequence-based Multi-lingual Low Resource Speech Recognition
S Dalmia, R Sanabria, F Metze, AW Black
ICASSP 2018, 2018
Universal phone recognition with a multilingual allophone system
X Li, S Dalmia, J Li, M Lee, P Littell, J Yao, A Anastasopoulos, ...
ICASSP 2020, 2020
Robust ASR using neural network based speech enhancement and feature simulation
S Sivasankaran, AA Nugraha, E Vincent, JA Morales-Cordovilla, S Dalmia, ...
ASRU 2015, 2015
Towards Zero-shot Learning for Automatic Phonemic Transcription
X Li, S Dalmia, DR Mortensen, J Li, AW Black, F Metze
AAAI 2020, 2020
An approach for self-training audio event detectors using web data
B Elizalde, A Shah, S Dalmia, MH Lee, R Badlani, A Kumar, B Raj, I Lane
EUSIPCO 2017, 2017
Espnet-slu: Advancing spoken language understanding through espnet
S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ...
ICASSP 2022, 7167-7171, 2022
On Long-Tailed Phenomena in Neural Machine Translation
V Raunak, S Dalmia, V Gupta, F Metze
EMNLP 2020 Findings, 2020
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
S Kim, S Dalmia, F Metze
ACL 2019, 2019
Transformer-Transducers for Code-Switched Speech Recognition
S Dalmia, Y Liu, S Ronanki, K Kirchhoff
ICASSP 2021, 2021
NoiseQA: Challenge set evaluation for user-centric question answering
A Ravichander, S Dalmia, M Ryskina, F Metze, E Hovy, AW Black
EACL 2021, 2021
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
S Dalmia, B Yan, V Raunak, F Metze, S Watanabe
NAACL 2021, arXiv: 2105.00573, 2021
Domain Robust Feature Extraction for Rapid Low Resource ASR Development
S Dalmia, X Li, F Metze, AW Black
SLT 2018, 2018
Multilingual Speech Recognition with Corpus Relatedness Sampling
X Li, S Dalmia, AW Black, F Metze
InterSpeech 2019, 2019
Branchformer: Parallel mlp-attention architectures to capture local and global context for speech recognition and understanding
Y Peng, S Dalmia, I Lane, S Watanabe
ICML 2022, 17627-17643, 2022
Cross-Attention End-to-End ASR for Two-Party Conversations
S Kim, S Dalmia, F Metze
InterSpeech 2019, 2019
ESPnet-ST IWSLT 2021 offline speech translation system
H Inaguma, B Yan, S Dalmia, P Guo, J Shi, K Duh, S Watanabe
IWSLT 2021, 2021
Enforcing encoder-decoder modularity in sequence-to-sequence models
S Dalmia, A Mohamed, M Lewis, F Metze, L Zettlemoyer
arXiv preprint arXiv:1911.03782, 2019
FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech
A Conneau, M Ma, S Khanuja, Y Zhang, V Axelrod, S Dalmia, J Riesa, ...
SLT 2022, 2022
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding
S Arora, A Ostapenko, V Viswanathan, S Dalmia, F Metze, S Watanabe, ...
InterSpeech 2021, 2021
The system can't perform the operation now. Try again later.
Articles 1–20