Follow
Runlong Zhou
Runlong Zhou
Paul G. Allen School of Computer Science & Engineering, University of Washington
Verified email at cs.washington.edu - Homepage
Title
Cited by
Cited by
Year
Stochastic shortest path: Minimax, parameter-free and towards horizon-free regret
J Tarbouriech, R Zhou, SS Du, M Pirotta, M Valko, A Lazaric
Advances in neural information processing systems 34, 6843-6855, 2021
352021
Sharp variance-dependent bounds in reinforcement learning: Best of both worlds in stochastic and deterministic environments
R Zhou, Z Zhang, SS Du
International Conference on Machine Learning, 42878-42914, 2023
102023
Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes
R Zhou, R Wang, SS Du
International Conference on Machine Learning, 42698-42723, 2023
8*2023
Understanding curriculum learning in policy optimization for online combinatorial optimization
R Zhou, Z He, Y Tian, Y Wu, SS Du
Transactions on Machine Learning Research, 2023
5*2023
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning
Z Zhou, C Zhu, R Zhou, Q Cui, A Gupta, SS Du
arXiv preprint arXiv:2310.19308, 2023
32023
Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques
N Zhang, X Wang, Q Cui, R Zhou, SM Kakade, SS Du
arXiv preprint arXiv:2409.00717, 2024
2024
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
R Zhou, SS Du, B Li
Annual Meeting of the Association for Computational Linguistics (ACL) 2024 …, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–7