Follow
Baoxiang Wang
Baoxiang Wang
Assistant Professor, The Chinese University of Hong Kong Shenzhen
Verified email at cse.cuhk.edu.hk - Homepage
Title
Cited by
Cited by
Year
Contextual combinatorial cascading bandits
S Li, B Wang, S Zhang, W Chen
International conference on machine learning, 1245-1253, 2016
1342016
Privacy-preserving q-learning with functional noise in continuous spaces
B Wang, N Hegde
Advances in Neural Information Processing Systems 32, 2019
492019
Paid: Prioritizing app issues for developers by tracking user reviews over versions
C Gao, B Wang, P He, J Zhu, Y Zhou, MR Lyu
2015 IEEE 26th international symposium on software reliability engineering …, 2015
422015
Shapley counterfactual credits for multi-agent reinforcement learning
J Li, K Kuang, B Wang, F Liu, L Chen, F Wu, J Xiao
Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data …, 2021
362021
Metatrace Actor-Critic: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
K Young, B Wang, ME Taylor
International Joint Conference on Artificial Intelligence (IJCAI) 2019, 2018
26*2018
Multilinear extension of -submodular functions
B Wang, H Zhou
arXiv preprint arXiv:2107.07103, 2021
142021
Beyond winning and losing: modeling human motivations and behaviors using inverse reinforcement learning
B Wang, T Sun, SX Zheng
Artificial Intelligence and Interactive Digital Entertainment (AIIDE) 2019., 2018
13*2018
Deconfounded value decomposition for multi-agent reinforcement learning
J Li, K Kuang, B Wang, F Liu, L Chen, C Fan, F Wu, J Xiao
International Conference on Machine Learning, 12843-12856, 2022
92022
Improved regret bounds for linear adversarial mdps via linear optimization
F Kong, X Zhang, B Wang, S Li
arXiv preprint arXiv:2302.06834, 2023
52023
Online policy optimization for robust MDP
J Dong, J Li, B Wang, J Zhang
arXiv preprint arXiv:2209.13841, 2022
52022
Combinatorial bandits under strategic manipulations
J Dong, K Li, S Li, B Wang
Proceedings of the Fifteenth ACM International Conference on Web Search and …, 2022
52022
Policy optimization with second-order advantage information
J Li, B Wang
International Joint Conference on Artificial Intelligence (IJCAI) 2018 …, 2018
42018
Learning from good trajectories in offline multi-agent reinforcement learning
Q Tian, K Kuang, F Liu, B Wang
Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11672 …, 2023
32023
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
W Li, D Qiao, B Wang, X Wang, B Jin, H Zha
arXiv preprint arXiv:2305.10865, 2023
32023
Learning Adversarial Linear Mixture Markov Decision Processes with Bandit Feedback and Unknown Transition
C Zhao, R Yang, B Wang, S Li
The Eleventh International Conference on Learning Representations, 2022
32022
Learning Fair Representations via Distance Correlation Minimization
D Guo, C Wang, B Wang, H Zha
IEEE Transactions on Neural Networks and Learning Systems, 2022
32022
Algorithms and theory for supervised gradual domain adaptation
J Dong, S Zhou, B Wang, H Zhao
arXiv preprint arXiv:2204.11644, 2022
32022
Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation
J Dong, L Shen, Y Xu, B Wang
arXiv preprint arXiv:2202.13863, 2022
22022
Private Q-Learning with Functional Noise in Continuous Spaces
B Wang, N Hegde
The Multi-disciplinary Conference on Reinforcement Learning and Decision …, 2019
22019
Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback
C Zhao, R Yang, B Wang, X Zhang, S Li
arXiv preprint arXiv:2311.07876, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–20