Follow
Yi Su
Yi Su
Google Deepmind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
1962024
Doubly robust off-policy evaluation with shrinkage
Y Su, M Dimakopoulou, A Krishnamurthy, M Dudík
International Conference on Machine Learning, 2020, 2019
982019
Cab: Continuous adaptive blending for policy evaluation and learning
Y Su, L Wang, M Santacatterina, T Joachims
International Conference on Machine Learning, 6005-6014, 2019
792019
Off-policy bandits with deficient support
N Sachdeva, Y Su, T Joachims
Proceedings of the 26th ACM SIGKDD International Conference on Knowledge …, 2020
732020
Offline rl for natural language generation with implicit language q learning
C Snell, I Kostrikov, Y Su, M Yang, S Levine
arXiv preprint arXiv:2206.11871, 2022
672022
Online adaptation to label distribution shift
R Wu, C Guo, Y Su, KQ Weinberger
Advances in Neural Information Processing Systems 34, 11340-11351, 2021
532021
Adaptive Estimator Selection for Off-Policy Evaluation
Y Su, P Srinath, A Krishnamurthy
International Conference on Machine Learning, 2020, 2020
392020
Optimizing Rankings for Recommendation in Matching Markets
Y Su, M Bayoumi, T Joachims
Proceedings of the ACM Web Conference 2022, 328-338, 2022
222022
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
C Snell, S Yang, J Fu, Y Su, S Levine
NAACL, 2022, 2022
202022
Recommendations as treatments
T Joachims, B London, Y Su, A Swaminathan, L Wang
AI Magazine 42 (3), 19-30, 2021
192021
Data-driven offline decision-making via invariant representation learning
H Qi, Y Su, A Kumar, S Levine
Advances in Neural Information Processing Systems 35, 13226-13237, 2022
142022
Learning from logged bandit feedback of multiple loggers
Y Su, A Agarwal, T Joachims
ICML Workshop on Machine Learning for Causal Inference, Counterfactual …, 2018
32018
Unified off-policy learning to rank: a reinforcement learning perspective
Z Zhang, Y Su, H Yuan, Y Wu, R Balasubramanian, Q Wu, H Wang, ...
Advances in Neural Information Processing Systems 36, 2024
22024
International Conference on Machine Learning
Y Su, L Wang, M Santacatterina, T Joachims
22019
Long-Term Value of Exploration: Measurements, Findings and Algorithms
Y Su, X Wang, EY Le, L Liu, Y Li, H Lu, B Lipshitz, S Badam, L Heldt, S Bi, ...
Proceedings of the 17th ACM International Conference on Web Search and Data …, 2024
12024
Value of exploration: Measurements, findings and algorithms
Y Su, X Wang, EY Le, L Liu, Y Li, H Lu, B Lipshitz, S Badam, L Heldt, S Bi, ...
arXiv preprint arXiv:2305.07764, 2023
12023
Online Feature Updates Improve Online (Generalized) Label Shift Adaptation
R Wu, S Datta, Y Su, D Baby, YX Wang, KQ Weinberger
arXiv preprint arXiv:2402.03545, 2024
2024
System for effective use of data for personalization
M Dudik, A Krishnamurthy, M Dimakopoulou, Y Su
US Patent App. 18/368,400, 2024
2024
System for effective use of data for personalization
M Dudik, A Krishnamurthy, M Dimakopoulou, Y Su
US Patent 11,798,029, 2023
2023
Nonlinear Bandits Exploration for Recommendations
Y Su, M Chen
Proceedings of the 17th ACM Conference on Recommender Systems, 1054-1057, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–20