Scaling language models: Methods, analysis & insights from training gopher JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021 | 228* | 2021 |
Flamingo: a visual language model for few-shot learning JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ... arXiv preprint arXiv:2204.14198, 2022 | 199 | 2022 |
Perceiver io: A general architecture for structured inputs & outputs A Jaegle, S Borgeaud, JB Alayrac, C Doersch, C Ionescu, D Ding, ... arXiv preprint arXiv:2107.14795, 2021 | 174 | 2021 |
OpenSpiel: A framework for reinforcement learning in games M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ... arXiv preprint arXiv:1908.09453, 2019 | 161 | 2019 |
Training compute-optimal large language models J Hoffmann, S Borgeaud, A Mensch, E Buchatskaya, T Cai, E Rutherford, ... arXiv preprint arXiv:2203.15556, 2022 | 160* | 2022 |
Improving language models by retrieving from trillions of tokens S Borgeaud, A Mensch, J Hoffmann, T Cai, E Rutherford, K Millican, ... arXiv preprint arXiv:2112.04426, 2021 | 144 | 2021 |
Unsupervised learning of object keypoints for perception and control TD Kulkarni, A Gupta, C Ionescu, S Borgeaud, M Reynolds, A Zisserman, ... Advances in neural information processing systems 32, 2019 | 137 | 2019 |
Emergent abilities of large language models J Wei, Y Tay, R Bommasani, C Raffel, B Zoph, S Borgeaud, D Yogatama, ... arXiv preprint arXiv:2206.07682, 2022 | 76* | 2022 |
General-purpose, long-context autoregressive modeling with perceiver ar C Hawthorne, A Jaegle, C Cangea, S Borgeaud, C Nash, M Malinowski, ... International Conference on Machine Learning, 8535-8558, 2022 | 11 | 2022 |
Spriteworld: A flexible, configurable reinforcement learning environment N Watters, L Matthey, S Borgeaud, R Kabra, A Lerchner | 7 | 2019 |
Leveraging Sentence Similarity in Natural Language Generation: Improving Beam Search using Range Voting S Borgeaud, G Emerson arXiv preprint arXiv:1908.06288, 2019 | 6 | 2019 |
Human-agent cooperation in bridge bidding E Lockhart, N Burch, N Bard, S Borgeaud, T Eccles, L Smaira, R Smith arXiv preprint arXiv:2011.14124, 2020 | 5 | 2020 |
Unified scaling laws for routed language models A Clark, D De Las Casas, A Guy, A Mensch, M Paganini, J Hoffmann, ... International Conference on Machine Learning, 4057-4086, 2022 | 4 | 2022 |
Emergent abilities of large language models B Zoph, C Raffel, D Schuurmans, D Yogatama, D Zhou, D Metzler, EH Chi, ... | | 2022 |