DeepCQ+: Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for Highly Dynamic Networks

计算机科学 强化学习 可扩展性 路由协议 分布式计算 稳健性(进化) 计算机网络 链路状态路由协议 无线路由协议 移动自组网 布线(电子设计自动化) 人工智能 数据库 基因 生物化学 网络数据包 化学
作者
Saeed Kaviani,Bo Ryu,Ejaz Ahmed,Kevin A. Larson,Le Anh Ngoc,Alex Yahja,Jae H. Kim
标识
DOI:10.1109/milcom52596.2021.9652948
摘要

Highly dynamic mobile ad-hoc networks (MANETs) remain as one of the most challenging environments to develop and deploy robust, efficient, and scalable routing protocols. In this paper, we present DeepCQ+ routing protocol which, in a novel manner, integrates emerging multi-agent deep reinforcement learning (MADRL) techniques into existing Q-learning-based routing protocols and their variants, and achieves persistently higher performance across a wide range of topology and mobility configurations. While keeping the overall protocol structure of the Q-learning-based routing protocols, DeepCQ+ replaces statically configured parameterized thresholds and hand-written rules with carefully designed MADRL agents such that no configuration of such parameters is required a priori. Extensive simulation shows that DeepCQ+ yields significantly increased end-to-end throughput with lower overhead and no apparent degradation of end-to-end delays (hop counts) compared to its Q-learning-based counterparts. Qualitatively, and perhaps more significantly, DeepCQ+ maintains remarkably similar performance gains under many scenarios that it was not trained for in terms of network sizes, mobility conditions, and traffic dynamics. To the best of our knowledge, this is the first successful application of the MADRL framework for the MANET routing problem that demonstrates a high degree of scalability and robustness even under the environments that are outside the trained range of scenarios. This implies that our MARL-based DeepCQ+ design solution significantly improves the performance of Q-learning-based CQ+ baseline approach for comparison and increases its practicality and explainability because the real-world MANET environment will likely vary outside the trained range of MANET scenarios. Additional techniques to further increase the gains in performance and scalability are discussed.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
宋瓜完成签到,获得积分10
1秒前
1秒前
狂野悟空完成签到,获得积分10
2秒前
Cc完成签到 ,获得积分10
2秒前
waswas完成签到,获得积分10
2秒前
碧蓝的灵安完成签到,获得积分10
3秒前
xyp我不想说完成签到,获得积分10
3秒前
清风完成签到 ,获得积分10
4秒前
lp完成签到 ,获得积分10
4秒前
demi2333完成签到,获得积分10
6秒前
网线发布了新的文献求助10
7秒前
糖豆子发布了新的文献求助10
8秒前
zszzzsss完成签到,获得积分10
8秒前
9秒前
真人完成签到 ,获得积分10
9秒前
9秒前
ykk完成签到 ,获得积分10
10秒前
JIAO完成签到,获得积分10
10秒前
陶兜兜发布了新的文献求助10
10秒前
11秒前
诗韵完成签到 ,获得积分10
11秒前
爱是无限大完成签到,获得积分0
12秒前
小小瑶瑶完成签到,获得积分10
12秒前
YiYi完成签到 ,获得积分10
13秒前
冷酷曼卉发布了新的文献求助10
14秒前
CZY完成签到,获得积分10
15秒前
BBB完成签到,获得积分10
16秒前
万能图书馆应助银点采纳,获得10
16秒前
fluttershy完成签到 ,获得积分10
16秒前
16秒前
wh完成签到,获得积分10
16秒前
haoran_man完成签到,获得积分10
17秒前
actor2006完成签到,获得积分10
17秒前
17秒前
LYH完成签到,获得积分10
17秒前
cd发布了新的文献求助10
17秒前
高高的咖啡豆完成签到 ,获得积分10
19秒前
嘟嘟大魔王完成签到,获得积分10
19秒前
那儿完成签到,获得积分10
19秒前
龙卷风发布了新的文献求助10
19秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Developing Genetic Editing Tools for Lysobacter 2000
Adhesion Science: Principles & Practice 800
The Graphene Handbook (2019 Edition) 700
Signals, Systems, and Signal Processing 610
IEST-RP-CC018: Cleanroom Cleaning and Sanitization: Operating and Monitoring Procedures 600
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6530556
求助须知:如何正确求助?哪些是违规求助? 8323303
关于积分的说明 17818648
捐赠科研通 5631906
什么是DOI,文献DOI怎么找? 2932283
邀请新用户注册赠送积分活动 1908910
关于科研通互助平台的介绍 1768209