发布文献求助

亲爱的研友该休息了！由于当前在线用户较少，发布求助请尽量完整地填写文献信息，科研通机器人24小时在线，伴您度过漫漫科研夜！身体可是革命的本钱，早点休息，好梦！

Design of a Multi-Agent Collaborative Decision-Making System Based on Reinforcement Learning

强化学习计算机科学模块化设计软件部署趋同（经济学）人工智能理论（学习稳定性）激励分布式计算资源配置资源（消歧）分散系统共享资源自适应系统多智能体系统机器学习培训（气象学）资源管理（计算）协议（科学）同伴学习

作者

Zhe Wei,Meiying Zhou

出处

期刊：International Journal of Pattern Recognition and Artificial Intelligence [World Scientific]
日期：2025-11-08

标识

DOI：10.1142/s021800142551022x

摘要

This paper proposes a modular multi-agent reinforcement learning framework that integrates Centralized Training with Decentralized Execution (CTDE), attention-based communication, and adaptive reward shaping. Built upon an extended Soft Actor–Critic algorithm, the system enables decentralized agents to learn robust policies under partial observability. A shared critic computes value estimates using the full global state, while decentralized actors use selectively aggregated peer messages via a task-driven attention mechanism. Adaptive reward shaping dynamically aligns agent incentives with global objectives, accelerating convergence. The system is evaluated on three benchmarks: Multi-Agent Particle Environment (MPE), StarCraft II Micromanagement Challenge (SMAC), and a custom Resource Allocation Simulator (RAS). Compared to MAPPO, MADDPG, and ISAC baselines, our method improves average episodic reward by 15–25%, reduces convergence steps by up to 40%, and enhances coordination scores significantly. Results also show superior stability across random seeds and reduced wall-clock training time, highlighting the method’s effectiveness for real-world deployment in dynamic multi-agent settings.

求助该文献

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: LPPQBB上传了应助文件

6秒前; hwen1998完成签到，获得积分10

12秒前; 英俊的铭的应助被糊涂的万采纳，获得10

18秒前; sissiarno完成签到，获得积分0

21秒前; swing完成签到，获得积分10

24秒前; 科研通AI6的应助被科研通管家采纳，获得10

26秒前; 科研通AI6的应助被科研通管家采纳，获得30

26秒前; 顾矜的应助被科研通管家采纳，获得10

26秒前; 隐形曼青的应助被科研通管家采纳，获得10

26秒前; 科研通AI2S上传了应助文件

36秒前; andrele上传了应助文件

44秒前; 情怀的应助被粥粥sqk采纳，获得10

50秒前; 嘻嘻完成签到，获得积分10

54秒前; 英俊的铭上传了应助文件

1分钟前; 糊涂的万发布了新的文献求助10

1分钟前; 小庄完成签到，获得积分10

1分钟前; 所所的应助被TT采纳，获得30

1分钟前; zcxxxxxxx完成签到，获得积分10

1分钟前; LPPQBB上传了应助文件

1分钟前; Jasper的应助被如意小丸子采纳，获得10

1分钟前; Jasper上传了应助文件

1分钟前; 如意小丸子完成签到，获得积分10

1分钟前; 如意小丸子发布了新的文献求助10

1分钟前; 传奇3的应助被如意小丸子采纳，获得10

1分钟前; 天天快乐上传了应助文件

1分钟前; 山野有雾都发布了新的文献求助10

2分钟前; andrele上传了应助文件

2分钟前; 李健上传了应助文件

2分钟前; 山野有雾都关闭了山野有雾都的文献求助

2分钟前; 可乐发布了新的文献求助10

2分钟前; 华仔的应助被科研通管家采纳，获得10

2分钟前; 情怀的应助被科研通管家采纳，获得10

2分钟前; Criminology34的应助被科研通管家采纳，获得10

2分钟前; 完美世界的应助被可乐采纳，获得10

2分钟前; 科研通AI2S上传了应助文件

2分钟前; hh完成签到，获得积分20

2分钟前; 自信号厂完成签到，获得积分0

2分钟前; kkk完成签到，获得积分20

3分钟前; 科研通AI2S上传了应助文件

3分钟前; 科研通AI6上传了应助文件

3分钟前

高分求助中: Encyclopedia of Quaternary Science Third edition 2025 12000; (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; The Social Work Ethics Casebook: Cases and Commentary (revised 2nd ed.). Frederic G. Reamer 800; Beyond the sentence : discourse and sentential form / edited by Jessica R. Wirth 600; Holistic Discourse Analysis 600; Vertébrés continentaux du Crétacé supérieur de Provence (Sud-Est de la France) 600; Vertebrate Palaeontology, 5th Edition 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 5335194; 求助须知：如何正确求助？哪些是违规求助？ 4473088; 关于积分的说明 13921260; 捐赠科研通 4367214; 什么是DOI，文献DOI怎么找？ 2399487; 邀请新用户注册赠送积分活动 1392583; 关于科研通互助平台的介绍 1363749

今日热心研友

朝阳区李知恩

electricelectric

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通