Zongzhang Zhang

Cited by

	All	Since 2019
Citations	1231	1123
h-index	17	16
i10-index	30	27

300

150

225

2013201420152016201720182019202020212022202320247 9 8 23 6 46 86 155 183 258 300 141

Public access

View all

40 articles

1 article

available

not available

Based on funding mandates

Co-authors

Yan Zheng (郑岩)Tianjin UniversityVerified email at tju.edu.cn
Yingfeng Chen(陈赢峰)Fuxi AI Lab in NeteaseVerified email at mail.ustc.edu.cn
Tianpei YangUniversity of AlbertaVerified email at ualberta.ca
Wulong LiuHuawei Noah's Ark LabVerified email at huawei.com
Mykel J. KochenderferAssociate Professor, Stanford UniversityVerified email at stanford.edu
David HsuProfessor of Computer Science, National University of SingaporeVerified email at comp.nus.edu.sg
Wee Sun LeeProfessor, Department of Computer Science, National University of SingaporeVerified email at comp.nus.edu.sg
Aijun BaiGoogle ResearchVerified email at google.com
Yuzheng ZhuangSenior Researcher @ Huawei Noah's Ark LabVerified email at huawei.com
Feng WuAssociate Professor, University of Science and Technology of ChinaVerified email at ustc.edu.cn
Michael LittmanBrown UniversityVerified email at brown.edu
Zhan Wei LimNational University of SingaporeVerified email at comp.nus.edu.sg
Jianye HaoTianjin University

Zongzhang Zhang

Nanjing University

Verified email at nju.edu.cn - Homepage

Artificial Intelligence Reinforcement Learning Probabilistic Planning Imitation Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A survey on deep reinforcement learning Q Liu, JW Zhai, ZZ Zhang, S Zhong, Q Zhou, P Zhang, J Xu Chinese Journal of Computers 41 (1), 1-27, 2018	182	2018
深度强化学习综述刘全，翟建伟，章宗长，钟珊，周倩，章鹏，徐进计算机学报 41 (1), 1-27, 2018	103	2018
Weighted double Q-learning Z Zhang, Z Pan, MJ Kochenderfer IJCAI-2017, 3455-3461, 2017	98	2017
A deep Bayesian policy reuse approach against non-stationary agents Y Zheng, Z Meng, J Hao, Z Zhang, T Yang, C Fan NeurIPS-2018, 954-964, 2018	82	2018
Hierarchical deep multiagent reinforcement learning with temporal abstraction H Tang, J Hao, T Lv, Y Chen, Z Zhang, H Jia, C Ren, Y Zheng, Z Meng, ... arXiv preprint arXiv:1809.09332, 2018	73	2018
A survey on deep reinforcement learning L Quan, Z Jianwei, Z Zongchang, Z Shan, Z Qian Chinese Journal of Computers 41 (01), 1-27, 2018	53	2018
Weighted double deep multiagent reinforcement learning in stochastic cooperative environments Y Zheng, Z Meng, J Hao, Z Zhang PRICAI-2018, 421-429, 2018	45	2018
Multi-Agent Incentive Communication via Decentralized Teammate Modeling L Yuan, J Wang, F Zhang, C Wang, Z Zhang, Y Yu, C Zhang AAAI-2022, 9466-9474, 2022	33	2022
Deep Q-learning with prioritized sampling J Zhai, Q Liu, Z Zhang, S Zhong, H Zhu, P Zhang, C Sun ICONIP-2016, 13-22, 2016	33	2016
Efficient deep reinforcement learning via adaptive policy transfer T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Chen, C Fan, W Wang, W Liu, ... IJCAI-2020, 3094-3100, 2020	31	2020
Triple-GAIL: A multi-modal imitation learning framework with generative adversarial Nets C Fei, B Wang, Y Zhuang, Z Zhang, J Hao, H Zhang, X Ji, W Liu IJCAI-2020, 2929-2935, 2020	28	2020
Thompson sampling based Monte-Carlo planning in POMDPs A Bai, F Wu, Z Zhang, X Chen ICAPS-2014, 28-36, 2014	25	2014
Covering number for efficient heuristic-based POMDP planning Z Zhang, D Hsu, WS Lee ICML-2014, 28-36, 2014	25	2014
Adapt to Environment Sudden Changes by Learning a Context Sensitive Policy FM Luo, S Jiang, Y Yu, Z Zhang, YF Zhang AAAI-2022, 7637-7646, 2022	22	2022
Covering number as a complexity measure for POMDP planning and learning Z Zhang, M Littman, X Chen AAAI-2012, 1853-1859, 2012	21	2012
Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data F Zhang, C Jia, YC Li, L Yuan, Y Yu, Z Zhang ICLR-2023, 2023	19	2023
Multi-agent Dynamic Algorithm Configuration K Xue, J Xu, L Yuan, M Li, C Qian, Z Zhang, Y Yu NeurIPS-2022, 20147-20161, 2022	19	2022
Efficient Multi-agent Communication via Self-supervised Information Aggregation C Guan, F Chen, L Yuan, C Wang, H Yin, Z Zhang, Y Yu NeurIPS-2022, 1020-1033, 2022	17	2022
Efficient policy detecting and reusing for non-stationarity in markov games Y Zheng, J Hao, Z Zhang, Z Meng, T Yang, Y Li, C Fan Autonomous Agents and Multi-Agent Systems 35, 1-29, 2021	16	2021
Adaptive Online Packing-guided Search for POMDPs C Wu, G Yang, Z Zhang, Y Yu, D Li, W Liu NeurIPS-2021, 28419-28430, 2021	14	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors