发布文献求助

已入深夜，您辛苦了！由于当前在线用户较少，发布求助请尽量完整的填写文献信息，科研通机器人24小时在线，伴您度过漫漫科研夜！祝你早点完成任务，早点休息，好梦！

Learning to Navigate Through Complex Dynamic Environment With Modular Deep Reinforcement Learning

强化学习避障计算机科学模块化设计任务（项目管理）障碍物一般化人工智能建筑网络体系结构实时计算人机交互分布式计算机器人工程类移动机器人计算机网络艺术数学分析数学系统工程法学政治学视觉艺术操作系统

作者

Yuanda Wang,Haibo He,Changyin Sun

出处

期刊：IEEE transactions on games [Institute of Electrical and Electronics Engineers]
日期：2018-06-25 卷期号：10 (4): 400-412 被引量：86

标识

DOI：10.1109/tg.2018.2849942

摘要

In this paper, we propose an end-to-end modular reinforcement learning architecture for a navigation task in complex dynamic environments with rapidly moving obstacles. In this architecture, the main task is divided into two subtasks: local obstacle avoidance and global navigation. For obstacle avoidance, we develop a two-stream Q-network, which processes spatial and temporal information separately and generates action values. The global navigation subtask is resolved by a conventional Q-network framework. An online learning network and an action scheduler are introduced to first combine two pretrained policies, and then continue exploring and optimizing until a stable policy is obtained. The two-stream Q-network obtains better performance than the conventional deep Q-learning approach in the obstacle avoidance subtask. Experiments on the main task demonstrate that the proposed architecture can efficiently avoid moving obstacles and complete the navigation task at a high success rate. The modular architecture enables parallel training and also demonstrates good generalization capability in different environments.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』获奖名单已公布 🔥 (2025-4-2)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 舒心的芝麻完成签到，获得积分10

1秒前; 若有光发布了新的文献求助10

1秒前; JamesPei的应助被端庄的钢铁侠采纳，获得10

11秒前; 老虎皮发布了新的文献求助10

17秒前; 所所上传了应助文件

18秒前; hahaha完成签到，获得积分10

19秒前; JamesPei上传了应助文件

20秒前; 上官若男的应助被xiao_J采纳，获得10

21秒前; SciGPT的应助被若有光采纳，获得10

22秒前; 动听的晓博发布了新的文献求助10

23秒前; 林志伟完成签到，获得积分10

25秒前; 晓晴完成签到，获得积分20

26秒前; 端庄的钢铁侠发布了新的文献求助10

27秒前; 调皮小凡完成签到，获得积分10

28秒前; 万能图书馆的应助被Ryy采纳，获得10

31秒前; 科研通AI5上传了应助文件

33秒前; 楚阔发布了新的文献求助10

39秒前; bkagyin的应助被动听的晓博采纳，获得10

40秒前; Xiaoxiao上传了应助文件

42秒前; 月报月报完成签到，获得积分10

43秒前; 芒果你真甜完成签到，获得积分10

43秒前; 许许发布了新的文献求助10

44秒前; 碧蓝香芦完成签到，获得积分10

46秒前; fr发布了新的文献求助10

49秒前; 情怀上传了应助文件

50秒前; 动听的晓博完成签到，获得积分10

50秒前; zmnzmnzmn的应助被芒果你真甜采纳，获得10

51秒前; 上官若男上传了应助文件

52秒前; 端庄的钢铁侠完成签到，获得积分20

53秒前; 文迪发布了新的文献求助10

53秒前; 爆米花的应助被小杨采纳，获得10

54秒前; 冷先森EPC完成签到，获得积分10

55秒前; 马华化完成签到，获得积分0

56秒前; wzs222关注了科研通微信公众号

56秒前; kai9712发布了新的文献求助10

57秒前; 无花果的应助被务实道罡采纳，获得10

1分钟前; 慕青上传了应助文件

1分钟前; PangSir完成签到，获得积分10

1分钟前; 文迪完成签到，获得积分10

1分钟前; DRAZ发布了新的文献求助10

1分钟前

高分求助中: 【此为提示信息，请勿应助】请按要求发布求助，避免被关 20000; ISCN 2024 – An International System for Human Cytogenomic Nomenclature (2024) 3000; Continuum Thermodynamics and Material Modelling 2000; Encyclopedia of Geology (2nd Edition) 2000; 105th Edition CRC Handbook of Chemistry and Physics 1600; Maneuvering of a Damaged Navy Combatant 650; the MD Anderson Surgical Oncology Manual, Seventh Edition 300

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3777501; 求助须知：如何正确求助？哪些是违规求助？ 3322845; 关于积分的说明 10212016; 捐赠科研通 3038215; 什么是DOI，文献DOI怎么找？ 1667229; 邀请新用户注册赠送积分活动 798030; 科研通“疑难数据库（出版商）”最低求助积分说明 758193

今日热心研友

昏睡的蟠桃

文献看不懂

科研小民工

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通