Duration modeling of neural tts for automatic dubbing J Effendi, Y Virkar, R Barra-Chicote, M Federico ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 16 | 2022 |
End-to-end image-to-speech generation for untranscribed unknown languages J Effendi, S Sakti, S Nakamura IEEE Access 9, 55144-55154, 2021 | 16 | 2021 |
Neural Speech Completion. K Tsunematsu, J Effendi, S Sakti, S Nakamura INTERSPEECH, 2742-2746, 2020 | 5 | 2020 |
Listening while speaking and visualizing: Improving ASR through multimodal chain J Effendi, A Tjandra, S Sakti, S Nakamura 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 5 | 2019 |
A two-stage emotion detection on Indonesian tweets JE The, AF Wicaksono, M Adriani 2015 International Conference on Advanced Computer Science and Information …, 2015 | 5 | 2015 |
Augmenting images for asr and tts through single-loop and dual-loop multimodal chain framework J Effendi, A Tjandra, S Sakti, S Nakamura arXiv preprint arXiv:2011.02099, 2020 | 3 | 2020 |
Rakutenai-7b: Extending large language models for japanese A Levine, C Huang, C Wang, E Batista, E Szymanska, H Ding, HW Chou, ... arXiv e-prints, arXiv: 2403.15484, 2024 | 2 | 2024 |
Multimodal chain: Cross-modal collaboration through listening, speaking, and visualizing J Effendi, A Tjandra, S Sakti, S Nakamura IEEE Access 9, 70286-70299, 2021 | 2 | 2021 |
Leveraging neural caption translation with visually grounded paraphrase augmentation J Effendi, S Sakti, K Sudoh, S Nakamura IEICE TRANSACTIONS on Information and Systems 103 (3), 674-683, 2020 | 2 | 2020 |
Multi-paraphrase Augmentation to Leverage Neural Caption Translation J Effendi, S Sakti, K Sudoh, S Nakamura Proceedings of the 15th International Conference on Spoken Language …, 2018 | 2 | 2018 |
Benefiting from Language Similarity in the Multilingual MT Training: Case Study of Indonesian and Malaysian A Poncelas, J Effendi Proceedings of the Fifth Workshop on Technologies for Machine Translation of …, 2022 | 1 | 2022 |
From speech chain to multimodal chain: Leveraging cross-modal data augmentation for semi-supervised learning J Effendi, A Tjandra, S Sakti, S Nakamura CoRR, abs/1906.00579, 2019 | 1 | 2019 |
Creation of a multi-paraphrase corpus based on various elementary operations J Effendi, S Sakti, S Nakamura 2017 20th Conference of the Oriental Chapter of the International …, 2017 | 1 | 2017 |
RakutenAI-7B: Extending Large Language Models for Japanese R Group, A Levine, C Huang, C Wang, E Batista, E Szymanska, H Ding, ... arXiv preprint arXiv:2403.15484, 2024 | | 2024 |
Rakuten’s Participation in WAT 2022: Parallel Dataset Filtering by Leveraging Vocabulary Heterogeneity A Poncelas, J Effendi, O Htun, S Yadav, D Wang, S Jain Proceedings of the 9th Workshop on Asian Translation, 68-72, 2022 | | 2022 |
Weakly-Supervised Speech-to-Text Mapping with Visually Connected Non-Parallel Speech-Text Data Using Cyclic Partially-Aligned Transformer. J Effendi, S Sakti, S Nakamura Interspeech, 2257-2261, 2021 | | 2021 |
Enhancing Neural Machine Translation with Image-based Paraphrase Augmentation J Effendi, S Sakti, K Sudoh, S Nakamura | | 2019 |
Corpus Construction and Semantic Analysis of Indonesian Image Description. K Nur'Aini, J Effendi, S Sakti, M Adriani, S Nakamura SLTU, 42-46, 2018 | | 2018 |
Improving ASR with Multimodal Machine Chain J Effendi, A Tjandra, S Sakti, S Nakamura | | |