Follow
Ramon Sanabria
Ramon Sanabria
Verified email at ed.ac.uk - Homepage
Title
Cited by
Cited by
Year
How2: A Large-scale Dataset for Multimodal Language Understanding
R Sanabria, O Caglayan, S Palaskar, D Elliott, L Barrault, L Specia, ...
NIPS 2018 Workshop, 2018
1912018
Sequence-based Multi-lingual Low Resource Speech Recognition
S Dalmia, R Sanabria, F Metze, AW Black
IEEE ICASSP 2018, 2018
902018
The IWSLT 2019 evaluation campaign
N Jan, R Cattoni, S Sebastian, M Negri, M Turchi, S Elizabeth, S Ramon, ...
Proceedings of the 16th International Workshop on Spoken Language …, 2019
872019
Hierarchical Multi Task Learning With CTC
R Sanabria, F Metze
IEEE SLT 2018, 2018
592018
Comparison of decoding strategies for ctc acoustic models
T Zenkel, R Sanabria, F Metze, J Niehues, M Sperber, S Stüker, A Waibel
INTERSPEECH 2017, 2017
492017
End-to-End Multimodal Speech Recognition
S Palaskar, R Sanabria, F Metze
IEEE ICASSP 2018, 2018
442018
Cmu sinbads submission for the dstc7 avsd challenge
R Sanabria, S Palaskar, F Metze
DSTC7, AAAI 2019 6, 2019
362019
Subword and Crossword Units for CTC Acoustic Models
T Zenkel, R Sanabria, F Metze, A Waibel
INTERSPEECH 2018, 2017
332017
Multimodal Grounding for Sequence-to-sequence Speech Recognition
O Caglayan, R Sanabria, S Palaskar, L Barraul, F Metze
IEEE ICASSP 2019, 2019
322019
Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval
R Sanabria, A Waters, J Baldridge
Interspeech 2021, 2021
132021
Looking Enhances Listening: Recovering Missing Speech Using Images
T Srinivasan, R Sanabria, F Metze
IEEE ICASSP 2020, 2020
132020
Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions
T Srinivasan, R Sanabria, F Metze
ICML 2019 Workshop, 2019
132019
Multimodal Speech Recognition with Unstructured Audio Masking
T Srinivasan, R Sanabria, F Metze, D Elliott
EMNLP 2020 Workshop, 2020
112020
Fine-Grained Grounding for Multimodal Speech Recognition
T Srinivasan, R Sanabria, F Metze, D Elliott
Findings of EMNLP 2020, 2020
102020
Transfer learning for multimodal dialog
S Palaskar, R Sanabria, F Metze
Computer Speech & Language 64, 101093, 2020
82020
Robust end-to-end deep audiovisual speech recognition
R Sanabria, F Metze, F De La Torre
arXiv preprint arXiv:1611.06986, 2016
82016
Measuring the impact of individual domain factors in self-supervised pre-training
R Sanabria, WN Hsu, A Baevski, M Auli
arXiv preprint arXiv:2203.00648, 2022
52022
Grounded sequence to sequence transduction
L Specia, L Barrault, O Caglayan, A Duarte, D Elliott, S Gella, ...
IEEE journal of selected topics in signal processing 14 (3), 577-591, 2020
52020
Grounding Object Detections With Transcriptions
Y Moriya, R Sanabria, F Metze, GJF Jones
ICML 2019 Workshop, 2019
52019
On the Difficulty of Segmenting Words with Attention
R Sanabria, H Tang, S Goldwater
EMNLP 2021 Workshop, 2021
42021
The system can't perform the operation now. Try again later.
Articles 1–20