后悔
计算机科学
人口
任务(项目管理)
缩小
微分方程
数学优化
多智能体系统
动力学(音乐)
人工智能
应用数学
机器学习
数学
物理
数学分析
人口学
管理
社会学
声学
经济
作者
Zhen Wang,Chunjiang Mu,Shuyue Hu,Chen Chu,Xuelong Li
标识
DOI:10.24963/ijcai.2022/76
摘要
Understanding the learning dynamics in multiagent systems is an important and challenging task. Past research on multi-agent learning mostly focuses on two-agent settings. In this paper, we consider the scenario in which a population of infinitely many agents apply regret minimization in repeated symmetric games. We propose a new formal model based on the master equation approach in statistical physics to describe the evolutionary dynamics in the agent population. Our model takes the form of a partial differential equation, which describes how the probability distribution of regret evolves over time. Through experiments, we show that our theoretical results are consistent with the agent-based simulation results.
科研通智能强力驱动
Strongly Powered by AbleSci AI