Mitigating covariate shift in imitation learning via offline data with partial coverage J Chang, M Uehara, D Sreenivas, R Kidambi, W Sun Advances in Neural Information Processing Systems 34, 965-979, 2021 | 100 | 2021 |
Mobile: Model-based imitation learning from observation alone R Kidambi, J Chang, W Sun Advances in Neural Information Processing Systems 34, 28598-28611, 2021 | 44 | 2021 |
Learning to generate better than your llm JD Chang, K Brantley, R Ramamurthy, D Misra, W Sun arXiv preprint arXiv:2306.11816, 2023 | 28 | 2023 |
Dataset reset policy optimization for rlhf JD Chang, W Zhan, O Oertell, K Brantley, D Misra, JD Lee, W Sun arXiv preprint arXiv:2404.08495, 2024 | 18 | 2024 |
Learning deep parameterized skills from demonstration for re-targetable visuomotor control J Chang, N Kumar, S Hastings, A Gokaslan, D Romeres, D Jha, ... arXiv preprint arXiv:1910.10628, 2019 | 15 | 2019 |
Learning bellman complete representations for offline policy evaluation J Chang, K Wang, N Kallus, W Sun International Conference on Machine Learning, 2938-2971, 2022 | 14 | 2022 |
Rebel: Reinforcement learning via regressing relative rewards Z Gao, JD Chang, W Zhan, O Oertell, G Swamy, K Brantley, T Joachims, ... arXiv preprint arXiv:2404.16767, 2024 | 12 | 2024 |
Using unsupervised clustering to identify pregnancy co-morbidities J Chang, IN Sarkar AMIA Summits on Translational Science Proceedings 2019, 305, 2019 | 7 | 2019 |
Critique-out-loud reward models Z Ankner, M Paul, B Cui, JD Chang, P Ammanabrolu arXiv preprint arXiv:2408.11791, 2024 | 6 | 2024 |
Using self organizing maps to compare sepsis patients from the neonatal and adult intensive care unit B Goddard, J Chang, IN Sarkar AMIA Summits on Translational Science Proceedings 2019, 127, 2019 | 3 | 2019 |
Rl for consistency models: Faster reward guided text-to-image generation O Oertell, JD Chang, Y Zhang, K Brantley, W Sun arXiv preprint arXiv:2404.03673, 2024 | 2 | 2024 |
Policy-Gradient Training of Language Models for Ranking G Gao, JD Chang, C Cardie, K Brantley, T Joachim arXiv preprint arXiv:2310.04407, 2023 | 2 | 2023 |
Adversarial Imitation Learning via Boosting J Chang, D Sreenivas, Y Huang, K Brantley, W Sun International Conference on Learning Representations, 2024 | 1 | 2024 |
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF Z Gao, W Zhan, JD Chang, G Swamy, K Brantley, JD Lee, W Sun arXiv preprint arXiv:2410.04612, 2024 | | 2024 |
Mitigating covariate shift in imitation learning via offline data without great coverage JD Chang, M Uehara, D Sreenivas, R Kidambi, W Sun arXiv preprint arXiv:2106.03207, 2021 | | 2021 |