Nathan Lambert

Cited by

	All	Since 2019
Citations	2037	2030
h-index	22	22
i10-index	30	30

1200

600

300

900

20192020202120222023202416 44 121 200 532 1109

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Roberto CalandraProfessor, TU Dresden, Centre for Tactile Internet with Human-in-the-Loop (CeTI)Verified email at tu-dresden.de
Kristofer PISTERUC BerkeleyVerified email at berkeley.edu
Tom ZickHarvardVerified email at berkeley.edu
Daniel S. DrewUniversity of UtahVerified email at utah.edu
Thomas Krendl GilbertNew York Academy of SciencesVerified email at nyas.org
Brandon AmosMetaVerified email at fb.com
Sarah DeanCornellVerified email at cornell.edu
Luis PinedaResearch Engineer, Facebook AI ResearchVerified email at fb.com
Craig B. SchindlerUniversity of California, BerkeleyVerified email at berkeley.edu
Lydia LeeSandia National LaboratoriesVerified email at sandia.gov

Nathan Lambert

Research Scientist, Allen AI

Verified email at allenai.org - Homepage

Reinforcement Learning Machine Learning Robotics Responsible AI


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
[Github] Diffusers: State-of-the-art diffusion models P von Platen, S Patil, A Lozhkov, P Cuenca, N Lambert, K Rasul, ... https://github.com/huggingface/diffusers, 2022	292*	2022
Zephyr: Direct distillation of lm alignment L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ... arXiv preprint arXiv:2310.16944, 2023	245	2023
Open LLM Leaderboard E Beeching, C Fourrier, N Habib, S Han, N Lambert, N Rajani, ... URL https://huggingface. co/spaces/HuggingFaceH4/open_llm_leaderboard, 2023	196	2023
Low Level Control of a Quadrotor with Deep Model-Based Reinforcement Learning N Lambert, DS Drew, J Yaconelli, R Calandra, S Levine, KSJ Pister IEEE Robotics and Automation Letters 4 (4), 4224-4230, 2019	171	2019
[Github] Trl: Transformer reinforcement learning L von Werra, Y Belkada, L Tunstall, E Beeching, T Thrush, N Lambert https://github.com/lvwerra/trl, 2020	112*	2020
On the importance of hyperparameter optimization for model-based reinforcement learning B Zhang, R Rajan, L Pineda, N Lambert, A Biedenkapp, K Chua, F Hutter, ... International Conference on Artificial Intelligence and Statistics, 4015-4023, 2021	107	2021
Objective Mismatch in Model-based Reinforcement Learning N Lambert, B Amos, O Yadan, R Calandra Learning for Dynamics and Control (L4DC), 2020	94	2020
Camels in a changing climate: Enhancing lm adaptation with tulu 2 H Ivison, Y Wang, V Pyatkin, N Lambert, M Peters, P Dasigi, J Jang, ... arXiv preprint arXiv:2311.10702, 2023	88	2023
[Blog] Illustrating reinforcement learning from human feedback (RLHF) N Lambert, L Castricato, L von Werra, A Havrilla https://hf.co/blog/rlhf, 2022	87*	2022
Toward controlled flight of the ionocraft: a flying microrobot using electrohydrodynamic thrust with onboard sensing and no moving parts D Drew, N Lambert, C Schindler, K Pister IEEE Robotics and Automation Letters 3 (4), 2807-2813, 2018	76	2018
Learning generalizable locomotion skills with hierarchical reinforcement learning T Li, N Lambert, R Calandra, F Meier, A Rai IEEE International Conference on Robotics and Automation (ICRA), 413-419, 2020	48	2020
Mbrl-lib: A modular library for model-based reinforcement learning L Pineda, B Amos, A Zhang, NO Lambert, R Calandra arXiv preprint arXiv:2104.10159, 2021	45	2021
Dolma: An open corpus of three trillion tokens for language model pretraining research L Soldaini, R Kinney, A Bhagia, D Schwenk, D Atkinson, R Authur, ... arXiv preprint arXiv:2402.00159, 2024	38	2024
The challenges of exploration for offline reinforcement learning N Lambert, M Wulfmeier, W Whitney, A Byravan, M Bloesch, V Dasagi, ... arXiv preprint arXiv:2201.11861, 2022	38	2022
Olmo: Accelerating the science of language models D Groeneveld, I Beltagy, P Walsh, A Bhagia, R Kinney, O Tafjord, AH Jha, ... arXiv preprint arXiv:2402.00838, 2024	35	2024
Reward reports for reinforcement learning TK Gilbert, N Lambert, S Dean, T Zick, A Snoswell, S Mehta Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 84-130, 2023	34	2023
Rewardbench: Evaluating reward models for language modeling N Lambert, V Pyatkin, J Morrison, LJ Miranda, BY Lin, K Chandu, N Dziri, ... arXiv preprint arXiv:2403.13787, 2024	32	2024
The alignment handbook L Tunstall, E Beeching, N Lambert, N Rajani, S Huang, K Rasul, AM Rush, ...	27	2023
A survey on data selection for language models A Albalak, Y Elazar, SM Xie, S Longpre, N Lambert, X Wang, ... arXiv preprint arXiv:2402.16827, 2024	24	2024
Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning N Lambert, A Wilcox, H Zhang, K Pister, R Calandra IEEE Conference on Decision and Control (CDC), 2880-2887, 2021	24	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors