Doubly robust off-policy evaluation with shrinkage Y Su, M Dimakopoulou, A Krishnamurthy, M Dudík International Conference on Machine Learning, 2020, 2019 | 92 | 2019 |
Cab: Continuous adaptive blending for policy evaluation and learning Y Su, L Wang, M Santacatterina, T Joachims International Conference on Machine Learning, 6005-6014, 2019 | 79 | 2019 |
Off-policy bandits with deficient support N Sachdeva, Y Su, T Joachims Proceedings of the 26th ACM SIGKDD International Conference on Knowledge …, 2020 | 70 | 2020 |
Offline rl for natural language generation with implicit language q learning C Snell, I Kostrikov, Y Su, M Yang, S Levine arXiv preprint arXiv:2206.11871, 2022 | 63 | 2022 |
Online adaptation to label distribution shift R Wu, C Guo, Y Su, KQ Weinberger Advances in Neural Information Processing Systems 34, 11340-11351, 2021 | 48 | 2021 |
Adaptive Estimator Selection for Off-Policy Evaluation Y Su, P Srinath, A Krishnamurthy International Conference on Machine Learning, 2020, 2020 | 36 | 2020 |
Optimizing Rankings for Recommendation in Matching Markets Y Su, M Bayoumi, T Joachims Proceedings of the ACM Web Conference 2022, 328-338, 2022 | 20 | 2022 |
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems C Snell, S Yang, J Fu, Y Su, S Levine NAACL, 2022, 2022 | 19 | 2022 |
Recommendations as treatments T Joachims, B London, Y Su, A Swaminathan, L Wang AI Magazine 42 (3), 19-30, 2021 | 19 | 2021 |
Data-driven offline decision-making via invariant representation learning H Qi, Y Su, A Kumar, S Levine Advances in Neural Information Processing Systems 35, 13226-13237, 2022 | 12 | 2022 |
Learning from logged bandit feedback of multiple loggers Y Su, A Agarwal, T Joachims ICML Workshop on Machine Learning for Causal Inference, Counterfactual …, 2018 | 3 | 2018 |
Unified off-policy learning to rank: a reinforcement learning perspective Z Zhang, Y Su, H Yuan, Y Wu, R Balasubramanian, Q Wu, H Wang, ... Advances in Neural Information Processing Systems 36, 2024 | 2 | 2024 |
Value of exploration: Measurements, findings and algorithms Y Su, X Wang, EY Le, L Liu, Y Li, H Lu, B Lipshitz, S Badam, L Heldt, S Bi, ... arXiv preprint arXiv:2305.07764, 2023 | 1 | 2023 |
Long-Term Value of Exploration: Measurements, Findings and Algorithms Y Su, X Wang, EY Le, L Liu, Y Li, H Lu, B Lipshitz, S Badam, L Heldt, S Bi, ... Proceedings of the 17th ACM International Conference on Web Search and Data …, 2024 | | 2024 |
Online Feature Updates Improve Online (Generalized) Label Shift Adaptation R Wu, S Datta, Y Su, D Baby, YX Wang, KQ Weinberger arXiv preprint arXiv:2402.03545, 2024 | | 2024 |
System for effective use of data for personalization M Dudik, A Krishnamurthy, M Dimakopoulou, Y Su US Patent App. 18/368,400, 2024 | | 2024 |
System for effective use of data for personalization M Dudik, A Krishnamurthy, M Dimakopoulou, Y Su US Patent 11,798,029, 2023 | | 2023 |
Nonlinear Bandits Exploration for Recommendations Y Su, M Chen Proceedings of the 17th ACM Conference on Recommender Systems, 1054-1057, 2023 | | 2023 |
2nd Workshop on Online and Adaptive Recommender Systems (OARS) X Cui, V Dave, Y Su, K Al-Jadda, S Kumar, J McAuley, T Ye, K Aryafar, ... Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022 | | 2022 |
Off-Policy Evaluation and Learning for Interactive Systems Y Su Cornell University, 2021 | | 2021 |