Follow
Jonathan D. Chang
Jonathan D. Chang
Research Scientist, Databricks Mosaic
Verified email at cornell.edu
Title
Cited by
Cited by
Year
Mitigating covariate shift in imitation learning via offline data with partial coverage
J Chang, M Uehara, D Sreenivas, R Kidambi, W Sun
Advances in Neural Information Processing Systems 34, 965-979, 2021
1002021
Mobile: Model-based imitation learning from observation alone
R Kidambi, J Chang, W Sun
Advances in Neural Information Processing Systems 34, 28598-28611, 2021
442021
Learning to generate better than your llm
JD Chang, K Brantley, R Ramamurthy, D Misra, W Sun
arXiv preprint arXiv:2306.11816, 2023
282023
Dataset reset policy optimization for rlhf
JD Chang, W Zhan, O Oertell, K Brantley, D Misra, JD Lee, W Sun
arXiv preprint arXiv:2404.08495, 2024
182024
Learning deep parameterized skills from demonstration for re-targetable visuomotor control
J Chang, N Kumar, S Hastings, A Gokaslan, D Romeres, D Jha, ...
arXiv preprint arXiv:1910.10628, 2019
152019
Learning bellman complete representations for offline policy evaluation
J Chang, K Wang, N Kallus, W Sun
International Conference on Machine Learning, 2938-2971, 2022
142022
Rebel: Reinforcement learning via regressing relative rewards
Z Gao, JD Chang, W Zhan, O Oertell, G Swamy, K Brantley, T Joachims, ...
arXiv preprint arXiv:2404.16767, 2024
122024
Using unsupervised clustering to identify pregnancy co-morbidities
J Chang, IN Sarkar
AMIA Summits on Translational Science Proceedings 2019, 305, 2019
72019
Critique-out-loud reward models
Z Ankner, M Paul, B Cui, JD Chang, P Ammanabrolu
arXiv preprint arXiv:2408.11791, 2024
62024
Using self organizing maps to compare sepsis patients from the neonatal and adult intensive care unit
B Goddard, J Chang, IN Sarkar
AMIA Summits on Translational Science Proceedings 2019, 127, 2019
32019
Rl for consistency models: Faster reward guided text-to-image generation
O Oertell, JD Chang, Y Zhang, K Brantley, W Sun
arXiv preprint arXiv:2404.03673, 2024
22024
Policy-Gradient Training of Language Models for Ranking
G Gao, JD Chang, C Cardie, K Brantley, T Joachim
arXiv preprint arXiv:2310.04407, 2023
22023
Adversarial Imitation Learning via Boosting
J Chang, D Sreenivas, Y Huang, K Brantley, W Sun
International Conference on Learning Representations, 2024
12024
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Z Gao, W Zhan, JD Chang, G Swamy, K Brantley, JD Lee, W Sun
arXiv preprint arXiv:2410.04612, 2024
2024
Mitigating covariate shift in imitation learning via offline data without great coverage
JD Chang, M Uehara, D Sreenivas, R Kidambi, W Sun
arXiv preprint arXiv:2106.03207, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–15