Reputation-Based Interaction Promotes Cooperation With Reinforcement Learning

强化学习 计算机科学 声誉 人工智能 钢筋 工程类 社会科学 结构工程 社会学
作者
Tianyu Ren,Xiao‐Jun Zeng
出处
期刊:IEEE Transactions on Evolutionary Computation [Institute of Electrical and Electronics Engineers]
卷期号:28 (4): 1177-1188 被引量:26
标识
DOI:10.1109/tevc.2023.3304911
摘要

Dynamical interaction represents a fundamental coevolutionary rule that addresses the intricacies of cooperation in social dilemmas. It provides a normative account for the changes in ties within interaction networks in response to the behaviour of social partners. While considerable efforts have explored the role of partner selection in fostering cooperation, there remains a limited understanding of how agents learn to establish effective interaction patterns and adapt their connections accordingly. To bridge this knowledge gap, we leverage recent advancements in reinforcement learning and propose an adaptive interaction mechanism to investigate self-organization behaviour in the iterated prisoner’s dilemma game. Within this framework, artificial agents are trained using a self-regarding Roth-Erev algorithm, utilizing reputation as a dynamic signal to update their willingness to engage with neighbours. Additionally, these agents are endowed with the capability to sever inactive connections. Simulation results demonstrate the effectiveness of utilizing reinforcement learning and local information from reputation to capture the dynamics of interactions. Notably, we discover that the entangled coevolution of strategy and interaction network can facilitate the emergence and maintenance of cooperation, despite the optimal tolerance threshold for ineffective neighbours varying depending on the strength of the social dilemma. Furthermore, the emerging network topology presented in this work accurately captures the assortative mixing pattern observed in previous experiments and realistic evidence. Finally, we validate the simulation results through theoretical analysis and confirm the robustness of the proposed mechanism across populations of varying sizes and initial structures.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
刚刚
Ava应助Chen采纳,获得10
1秒前
zzj1904发布了新的文献求助10
1秒前
简单山水完成签到,获得积分10
1秒前
下载论文完成签到,获得积分10
2秒前
TigerOvO完成签到,获得积分10
3秒前
大嘻发布了新的文献求助10
4秒前
Conley发布了新的文献求助10
4秒前
无极微光应助zyqy采纳,获得20
5秒前
adydcm完成签到 ,获得积分10
7秒前
8秒前
爱笑凤凰完成签到,获得积分10
9秒前
11秒前
黎黎完成签到 ,获得积分10
11秒前
科研通AI6.4应助陈瑞鸥采纳,获得10
11秒前
小蘑菇应助大嘻采纳,获得10
11秒前
11秒前
chenjian发布了新的文献求助10
12秒前
14秒前
15秒前
15秒前
阿锐发布了新的文献求助10
16秒前
17秒前
胖瓶儿发布了新的文献求助10
17秒前
科研通AI2S应助云淡风轻采纳,获得10
18秒前
虹虹发布了新的文献求助10
19秒前
MX120251336发布了新的文献求助10
19秒前
华仔应助lili采纳,获得10
19秒前
aaaa应助科研包采纳,获得10
20秒前
20秒前
YMH完成签到,获得积分10
21秒前
hsu完成签到 ,获得积分10
22秒前
23秒前
23秒前
23秒前
超级鸡车完成签到,获得积分10
24秒前
所所应助安康采纳,获得10
25秒前
柳冷亦发布了新的文献求助10
26秒前
26秒前
高分求助中
Principles of Economics, 11th Edition 10000
Prescott's Microbiology: 2026 Release ISE 10000
University Physics with Modern Physics, 16th edition 10000
Cronologia da história de Macau 5000
Environmental Leverage in Times of Climate Crisis: Product Standards, Carbon Border Measures and Preferential Trade Agreements 1000
Interactions of Vowel Quality and Prosody in East Slavic 1000
Matrix Methods in Data Mining and Pattern Recognition 510
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 内科学 物理 复合材料 催化作用 细胞生物学 无机化学 光电子学 物理化学 电极 基因
热门帖子
关注 科研通微信公众号,转发送积分 7157994
求助须知:如何正确求助?哪些是违规求助? 8802180
关于积分的说明 18601158
捐赠科研通 6760036
什么是DOI,文献DOI怎么找? 3162161
关于科研通互助平台的介绍 2297528
邀请新用户注册赠送积分活动 2136831