Dynamic path planning via Dueling Double Deep Q-Network (D3QN) with prioritized experience replay

计算机科学 路径(计算) 运动规划 人工智能 计算机网络 机器人
作者
Mehmet Gök
出处
期刊:Applied Soft Computing [Elsevier BV]
卷期号:158: 111503-111503 被引量:64
标识
DOI:10.1016/j.asoc.2024.111503
摘要

Path planning is a key requirement for mobile robots employed for different tasks such as rescue or transport missions. Conventional methods such as A⁎ or Dijkstra to tackle path planning problem need a premise map of the robot's environment. Nowadays, dynamic path planning is popular research topic, which drives mobile robots without prior static requirements. Deep reinforcement learning (DRL), which is another popular research area, is being harnessed to solve dynamic path planning problem by the researchers. In this study, Deep Q-Networks, which is a subdomain of DRL are opted to solve dynamic path planning problem. We first employ well known techniques Double Deep Q-Networks (D2QN) and Dueling Double Deep Q-Networks (D3QN) to train a model which can drive a mobile robot in environments with static and dynamic obstacles within 3 different configurations. Then we propose D3QN with Prioritized Experience Replay (PER) extension in order to further optimize the DRL model. We created a test bed to measure the performance of the DRL models against 99 randomly generated goal locations. According to our experiments, D3QN-PER method performs better than D2QN and D3QN in terms of path length and travel time to the goal without any collisions. Robot Operating System and Gazebo simulation environment is utilized to realize the training and testing environments, thus, the trained DRL models can be deployed to any ROS compatible robot seamlessly.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
昏睡的蟠桃应助科研通管家采纳,获得200
刚刚
NexusExplorer应助科研通管家采纳,获得10
刚刚
Wendy完成签到,获得积分10
刚刚
153266916完成签到 ,获得积分10
1秒前
斯文白白发布了新的文献求助10
1秒前
1秒前
1秒前
1秒前
jojo完成签到,获得积分10
2秒前
2秒前
clcl完成签到,获得积分10
2秒前
2秒前
OKOK完成签到,获得积分10
2秒前
2秒前
2秒前
隐形的若之完成签到,获得积分10
2秒前
kakaC完成签到,获得积分10
2秒前
有一颗卤蛋完成签到,获得积分10
2秒前
wulanshu完成签到,获得积分10
3秒前
clio完成签到,获得积分10
3秒前
天边的云发布了新的文献求助10
4秒前
我是微风完成签到,获得积分10
4秒前
adeno完成签到,获得积分10
4秒前
勤奋花瓣完成签到 ,获得积分10
4秒前
Jasper应助iris2333采纳,获得10
5秒前
wwwj完成签到,获得积分20
6秒前
Isaac完成签到 ,获得积分10
6秒前
调皮善斓完成签到,获得积分10
6秒前
华仔应助fy采纳,获得10
7秒前
7秒前
努努发布了新的文献求助10
7秒前
7秒前
celk2010完成签到,获得积分10
7秒前
CherishM发布了新的文献求助10
7秒前
FashionBoy应助蓝天采纳,获得10
8秒前
科研小白完成签到,获得积分10
8秒前
董晏殊完成签到 ,获得积分10
8秒前
8秒前
goKR发布了新的文献求助10
9秒前
调皮黄豆完成签到,获得积分10
9秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
晶种分解过程与铝酸钠溶液混合强度关系的探讨 8888
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6428435
求助须知:如何正确求助?哪些是违规求助? 8245046
关于积分的说明 17530026
捐赠科研通 5484055
什么是DOI,文献DOI怎么找? 2895278
邀请新用户注册赠送积分活动 1871480
关于科研通互助平台的介绍 1710861