Reward shaping based federated reinforcement learning Y Hu, Y Hua, W Liu, J Zhu IEEE Access 9, 67259-67267, 2021 | 17 | 2021 |
HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem Y Hua, X Wang, B Jin, W Li, J Yan, X He, H Zha Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data …, 2021 | 9* | 2021 |
VMAgent: A Practical Virtual Machine Scheduling Platform. J Sheng, S Cai, H Cui, W Li, Y Hua, B Jin, W Zhou, Y Hu, L Zhu, Q Peng, ... IJCAI, 5944-5947, 2022 | 7* | 2022 |
Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning W Li, X Wang, B Jin, J Sheng, Y Hua, H Zha Proceedings of the 20th International Conference on Autonomous Agents and …, 2021 | 7 | 2021 |
Learning Optimal" Pigovian Tax" in Sequential Social Dilemmas Y Hua, S Gao, W Li, B Jin, X Wang, H Zha arXiv preprint arXiv:2305.06227, 2023 | 5 | 2023 |
Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym J Sheng, Z Huang, C Shen, W Li, Y Hua, B Jin, H Zha, X Wang arXiv preprint arXiv:2312.03290, 2023 | 2 | 2023 |
面向城市交通信号优化的多智能体强化学习综述. 华贇, 王祥丰, 金博 Operations Research Transactions/Yunchouxue Xuebao 27 (2), 2023 | 2 | 2023 |
Learning Optimal “Pigovian Tax” in Sequential Social Dilemmas Yun Hua, Shang Gao, Wenhao Li, Bo Jin, Xiangfeng Wang, Hongyuan Zha AAMAS 2023, 2023 | | 2023 |
Sequential Viewpoint Selection and Grasping with Partial Observability Reinforcement Learning W Chen, Y Hua, B Jin, J Zhu, Q Ge, X Wang 2022 37th Youth Academic Annual Conference of Chinese Association of …, 2022 | | 2022 |
Reward Translation via Reward Machine in Semi-Alignable MDPs Y Hua, W Li, B Jin, B Wang, X He, H Zha, X Wang | | |
Can Language Agents Approach the Performance of RL? An Empirical Study On OpenAI Gym J Sheng, Z Huang, C Shen, W Li, Y Hua, B Jin, H Zha, X Wang | | |