Follow
Shreyas Chaudhari
Title
Cited by
Cited by
Year
Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers
B Eysenbach, S Chaudhari, S Asawa, S Levine, R Salakhutdinov
International Conference on Learning Representations (ICLR), 2021, 2021
902021
Multi-armed bandits with correlated arms
S Gupta, S Chaudhari, G Joshi, O Yağan
IEEE Transactions on Information Theory, 2021, 2021
632021
A unified approach to translate classical bandit algorithms to the structured bandit setting
S Gupta, S Chaudhari, S Mukherjee, G Joshi, O Yağan
IEEE Journal on Selected Areas in Information Theory 1 (3), 840-853, 2020
41*2020
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
S Chaudhari, P Aggarwal, V Murahari, T Rajpurohit, A Kalyan, ...
arXiv preprint arXiv:2404.08555, 2024
192024
Personagym: Evaluating persona agents and llms
V Samuel, HP Zou, Y Zhou, S Chaudhari, A Kalyan, T Rajpurohit, ...
arXiv preprint arXiv:2407.18416, 2024
52024
From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries
H Wadhwa, R Seetharaman, S Aggarwal, R Ghosh, S Basu, S Srinivasan, ...
arXiv preprint arXiv:2406.12824, 2024
52024
RAGs to Style: Personalizing LLMs with Style Embeddings
A Neelakanteswara, S Chaudhari, H Zamani
Proceedings of the 1st Workshop on Personalization of Generative AI Systems …, 2024
32024
Energy-delay-distortion problem
R Vaze, S Chaudhari, A Choube, N Aggarwal
2018 Twenty Fourth National Conference on Communications (NCC), 1-6, 2018
12018
Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation
S Chaudhari, A Deshpande, BC da Silva, PS Thomas
Advances in Neural Information Processing Systems 38, 2024
2024
From Past to Future: Rethinking Eligibility Traces
D Gupta, SM Jordan, S Chaudhari, B Liu, PS Thomas, BC da Silva
Proceedings of the AAAI Conference on Artificial Intelligence 38 (11), 12253 …, 2024
2024
Learning Models and Evaluating Policies with Offline Off-Policy Data under Partial Observability
S Chaudhari, PS Thomas, BC da Silva
NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in …, 2023
2023
Distributional Off-Policy Evaluation for Slate Recommendations
S Chaudhari, D Arbour, G Theocharous, N Vlassis
Proceedings of the AAAI Conference on Artificial Intelligence, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–12