Deep Reinforcement Learning-Based Multichannel Access for Industrial Wireless Networks With Dynamic Multiuser Priority

强化学习 计算机科学 马尔可夫决策过程 无线 无线网络 趋同(经济学) 马尔可夫过程 分布式计算 机器学习 电信 统计 数学 经济 经济增长
作者
Xiaoyu Liu,Chi Xu,Haibin Yu,Peng Zeng
出处
期刊:IEEE Transactions on Industrial Informatics [Institute of Electrical and Electronics Engineers]
卷期号:18 (10): 7048-7058 被引量:21
标识
DOI:10.1109/tii.2021.3139349
摘要

In Industry 4.0, massive heterogeneous industrial devices generate a great deal of data with different quality of service requirements, and communicate via industrial wireless networks (IWNs). However, the limited time-frequency resources of IWNs cannot well support the high concurrent access of massive industrial devices with strict real-time and reliable communication requirements. To address this problem, a deep reinforcement learning-based dynamic priority multichannel access (DRL-DPMCA) algorithm is proposed in this article. Firstly, according to the time-sensitivity of industrial data, industrial devices are assigned with different priorities, based on which their channel access probabilities are dynamically adjusted. Then, the Markov decision process is utilized to model the dynamic priority multichannel access problem. To cope with the explosion of state space caused by the multichannel access of massive industrial devices with dynamic priorities, DRL is used to establish the mapping from states to actions. Next, the long-term cumulative reward is maximized to obtain an effective policy. Especially, with joint consideration of the access reward and priority reward, a compound reward for multichannel access and dynamic priority is designed. For breaking the time correlation of training data while accelerating the convergence of DRL-DPMCA, an experience replay with experience-weight is proposed to store and sample experiences categorically. Besides, the gated recurrent unit, dueling architecture and step-by-step $\varepsilon$ -greedy method are employed to make states more comprehensive and reduce model oscillation. Extensive experiments show that, compared with slotted-Aloha and deep Q network algorithms, DRL-DPMCA converges quickly, and guarantees the highest channel access probability and the minimum queuing delay for high-priority industrial devices in the context of minimum access conflict and nearly 100% channel utilization.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
jian94完成签到,获得积分10
2秒前
QYY完成签到,获得积分10
2秒前
xfy完成签到,获得积分10
3秒前
xurui_s完成签到 ,获得积分10
3秒前
波西米亚发布了新的文献求助10
3秒前
zhaosiqi完成签到 ,获得积分10
3秒前
3秒前
初景发布了新的文献求助10
6秒前
gougou完成签到,获得积分10
7秒前
活力蘑菇完成签到 ,获得积分10
7秒前
老实幻姬完成签到,获得积分10
7秒前
sls完成签到,获得积分10
8秒前
许七安完成签到,获得积分10
9秒前
CSX完成签到 ,获得积分10
11秒前
青桔完成签到,获得积分10
11秒前
12秒前
科研女郎完成签到 ,获得积分10
12秒前
xiaoai完成签到 ,获得积分10
13秒前
波西米亚完成签到,获得积分10
13秒前
嬛嬛完成签到,获得积分10
16秒前
walker007发布了新的文献求助10
17秒前
Lzp完成签到 ,获得积分10
19秒前
20秒前
熊二完成签到,获得积分10
21秒前
俭朴的世界完成签到 ,获得积分0
22秒前
wujia发布了新的文献求助10
24秒前
没头脑和不高兴完成签到,获得积分10
24秒前
走心君完成签到,获得积分10
24秒前
tangzl完成签到 ,获得积分10
25秒前
drslytherin完成签到,获得积分10
26秒前
郑征完成签到,获得积分10
26秒前
姜勇完成签到,获得积分10
26秒前
虚拟的画板完成签到 ,获得积分10
27秒前
29秒前
Eine发布了新的文献求助10
29秒前
好好完成签到,获得积分10
29秒前
犹豫的若完成签到,获得积分10
29秒前
30秒前
木青完成签到,获得积分10
31秒前
fake完成签到 ,获得积分10
31秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
Signals, Systems, and Signal Processing 610
Research Methods for Business: A Skill Building Approach, 9th Edition 500
Research Methods for Applied Linguistics 500
Picture Books with Same-sex Parented Families Unintentional Censorship 444
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6414035
求助须知:如何正确求助?哪些是违规求助? 8232681
关于积分的说明 17476731
捐赠科研通 5466713
什么是DOI,文献DOI怎么找? 2888499
邀请新用户注册赠送积分活动 1865327
关于科研通互助平台的介绍 1703234