Batch policy learning under constraints H Le, C Voloshin, Y Yue International Conference on Machine Learning, 3703-3712, 2019 | 364 | 2019 |
Empirical study of off-policy policy evaluation for reinforcement learning C Voloshin, HM Le, N Jiang, Y Yue arXiv preprint arXiv:1911.06854, 2019 | 162 | 2019 |
Policy Optimization with Linear Temporal Logic Constraints C Voloshin, H Le, S Chaudhuri, Y Yue Advances in Neural Information Processing Systems 35, 17690-17702, 2022 | 21 | 2022 |
Minimax model learning C Voloshin, N Jiang, Y Yue International Conference on Artificial Intelligence and Statistics, 1612-1620, 2021 | 17 | 2021 |
Eventual Discounting Temporal Logic Counterfactual Experience Replay C Voloshin, A Verma, Y Yue arXiv preprint arXiv:2303.02135, 2023 | 8 | 2023 |
Empirical analysis of off-policy policy evaluation for reinforcement learning C Voloshin, HM Le, Y Yue Real-world Sequential Decision Making Workshop at ICML 2019, 2019 | 7 | 2019 |