强化学习
计算机科学
钢筋
混合(物理)
内在动机
人工智能
心理学
社会心理学
量子力学
物理
作者
Wei Li,Weiyan Liu,Shitong Shao,Shiyi Huang,Aiguo Song
出处
期刊:IEEE transactions on games
[Institute of Electrical and Electronics Engineers]
日期:2023-01-01
卷期号:: 1-13
被引量:1
标识
DOI:10.1109/tg.2023.3263013
摘要
Credit assignment is a critical problem in cooperative Multi-Agent Reinforcement Learning (MARL). To address this problem, current studies mainly rely on the intrinsic reward, which is directly summed with the global reward to generate a total reward. However, such kinds of intrinsic reward functions ignore the dependence among agents and inevitably limit the adaptivity and effectiveness of MARL methods. In this paper, we propose a novel method, Attention-based Intrinsic Reward Mixing Network (AIRMN), for credit assignment in MARL. Specifically, we design a new intrinsic reward network on the basis of the attention mechanism, in order to enhance the effectiveness of teamwork. Besides, we devise a new mixing network that combines the intrinsic and extrinsic rewards in a nonlinear and dynamic manner, so as to adapt the total reward to the variation of the environment. Experimental results on the battle games of StarCraft II demonstrate that AIRMN outperforms the state-of-the-art methods in terms of the average test win rate, and also validate that AIRMN can dynamically return the precise intrinsic reward to each agent based on their contributions to the team cooperation, thereby better dealing with the credit assignment problem.
科研通智能强力驱动
Strongly Powered by AbleSci AI