Provably Efficient CVaR RL in Low-rank MDPs Y Zhao, W Zhan, X Hu, H Leung, F Farnia, W Sun, JD Lee arXiv preprint arXiv:2311.11965, 2023 | 2 | 2023 |
A Tighter Problem-Dependent Regret Bound for Risk-Sensitive Reinforcement Learning X Hu, HF Leung International Conference on Artificial Intelligence and Statistics, 5411-5437, 2023 | 1 | 2023 |
Provably (More) Sample-Efficient Offline RL with Options X Hu, H Leung Advances in Neural Information Processing Systems 36, 2024 | | 2024 |
An Information Theoretic Approach to Interaction-Grounded Learning X Hu, F Farnia, H Leung arXiv preprint arXiv:2401.05015, 2024 | | 2024 |
Provably Efficient Offline RL with Options X Hu, H Leung Proceedings of the 2023 International Conference on Autonomous Agents and …, 2023 | | 2023 |
A Fast-Convergence Method of Monte Carlo Counterfactual Regret Minimization for Imperfect Information Dynamic Games X Hu, L Xia, J Yang, Q Zhao 2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS …, 2020 | | 2020 |