Follow
Sebastian Ruder
Sebastian Ruder
Research Scientist, Cohere
Verified email at cohere.com - Homepage
Title
Cited by
Cited by
Year
An overview of gradient descent optimization algorithms
S Ruder
arXiv preprint arXiv:1609.04747, 2016
110802016
Universal Language Model Fine-tuning for Text Classification
J Howard*, S Ruder*
Proceedings of ACL 2018, 2018
53342018
An Overview of Multi-Task Learning in Deep Neural Networks
S Ruder
arXiv preprint arXiv:1706.05098, 2017
38212017
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
21552023
PaLM 2 technical report
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
14172023
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
S Cahyawijaya, H Lovenia, AF Aji, GI Winata, B Wilie, R Mahendra, ...
Findings of ACL 2023, 2023
11392023
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization
J Hu*, S Ruder*, A Siddhant, G Neubig, O Firat, M Johnson
Proceedings of ICML 2020, 2020
9312020
A Survey of Cross-lingual Word Embedding Models
S Ruder, I Vulić, A Søgaard
Journal of Artificial Intelligence Research 65, 569-631, 2019
7902019
Transfer learning in natural language processing
S Ruder, ME Peters, S Swayamdipta, T Wolf
Proceedings of the 2019 conference of the North American chapter of the …, 2019
7882019
On the cross-lingual transferability of monolingual representations
M Artetxe, S Ruder, D Yogatama
Proceedings of ACL 2020, 2020
7552020
Long Range Arena: A Benchmark for Efficient Transformers
Y Tay, M Dehghani, S Abnar, Y Shen, D Bahri, P Pham, J Rao, L Yang, ...
Proceedings of ICLR 2021, 2021
6582021
Adapterhub: A framework for adapting transformers
J Pfeiffer, A Rücklé, C Poth, A Kamath, I Vulić, S Ruder, K Cho, I Gurevych
Proceedings of EMNLP 2020: System demonstrations, 2020
6472020
MAD-X: An Adapter-based Framework for Multi-task Cross-lingual Transfer
J Pfeiffer, I Vulić, I Gurevych, S Ruder
Proceedings of EMNLP 2020, 2020
5832020
Latent Multi-task Architecture Learning
S Ruder, J Bingel, I Augenstein, A Søgaard
Proceedings of AAAI 2019, 2019
556*2019
To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks
ME Peters*, S Ruder*, NA Smith
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019
5332019
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers
RK Mahabadi, J Henderson, S Ruder
Proceedings of NeurIPS 2021, 2021
427*2021
Neural Transfer Learning for Natural Language Processing
S Ruder
National University of Ireland, Galway, 2019
4132019
A Hierarchical Model of Reviews for Aspect-based Sentiment Analysis
S Ruder, P Ghaffari, JG Breslin
Proceedings of the 2016 Conference on Empirical Methods in Natural Language …, 2016
3592016
On the Limitations of Unsupervised Bilingual Dictionary Induction
A Søgaard, S Ruder, I Vulić
Proceedings of ACL 2018, 2018
3092018
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks
RK Mahabadi, S Ruder, M Dehghani, J Henderson
Proceedings of ACL 2021, 2021
2922021
The system can't perform the operation now. Try again later.
Articles 1–20