Multi-Object Navigation Using Potential Target Position Policy Function

计算机科学 对象(语法) 人工智能 任务(项目管理) 强化学习 计算机视觉 功能(生物学) 职位(财务) 关系(数据库) 人机交互 数据挖掘 工程类 生物 进化生物学 经济 系统工程 财务
作者
Haitao Zeng,Xinhang Song,Shuqiang Jiang
出处
期刊:IEEE transactions on image processing [Institute of Electrical and Electronics Engineers]
卷期号:32: 2608-2619 被引量:7
标识
DOI:10.1109/tip.2023.3263110
摘要

Visual object navigation is an essential task of embodied AI, which is letting the agent navigate to the goal object under the user's demand. Previous methods often focus on single-object navigation. However, in real life, human demands are generally continuous and multiple, requiring the agent to implement multiple tasks in sequence. These demands can be addressed by repeatedly performing previous single task methods. However, by dividing multiple tasks into several independent tasks to perform, without the global optimization between different tasks, the agents' trajectories may overlap, reducing the efficiency of navigation. In this paper, we propose an efficient reinforcement learning framework with a hybrid policy for multi-object navigation, aiming to maximally eliminate noneffective actions. First, the visual observations are embedded to detect the semantic entities (such as objects). And the detected objects are memorized and projected into semantic maps, which can also be regarded as a long-term memory of the observed environment. Then a hybrid policy consisting of exploration and long-term planning strategies is proposed to predict the potential target position. In particular, when the target is directly oriented, the policy function makes long-term planning for the target based on the semantic map, which is implemented by a sequence of motion actions. In the alternative, when the target is not oriented, the policy function estimates an object's potential position toward exploring the most possible objects (positions) that have close relations to the target. The relation between different objects is obtained with prior knowledge, which is used to predict the potential target position by integrating with the memorized semantic map. And then a path to the potential target is planned by the policy function. We evaluate our proposed method on two large-scale 3D realistic environment datasets, Gibson and Matterport3D, and the experimental results demonstrate the effectiveness and generalization of the proposed method.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
丘比特应助wenduoxu采纳,获得10
刚刚
刚刚
欣xin发布了新的文献求助10
刚刚
完美世界应助xm采纳,获得10
1秒前
畔畔应助汤汤采纳,获得50
1秒前
安静听芹发布了新的文献求助10
1秒前
2秒前
2秒前
手术发布了新的文献求助10
2秒前
yankui_sun发布了新的文献求助10
3秒前
3秒前
Function完成签到,获得积分10
3秒前
柒柒完成签到,获得积分20
3秒前
jay_发布了新的文献求助10
4秒前
4秒前
4秒前
小羊发布了新的文献求助20
4秒前
妮妮完成签到 ,获得积分10
4秒前
j1kxm完成签到,获得积分10
5秒前
药学生完成签到,获得积分10
5秒前
SciGPT应助文艺花生采纳,获得10
5秒前
6秒前
李审绥完成签到,获得积分10
6秒前
6秒前
7秒前
7秒前
Alexy发布了新的文献求助20
7秒前
JamesPei应助超帅孱采纳,获得10
7秒前
笨笨烨华完成签到 ,获得积分10
7秒前
思源应助清爽的芹菜采纳,获得10
7秒前
文献查找完成签到,获得积分10
8秒前
佚名发布了新的文献求助10
8秒前
VERITAS完成签到,获得积分10
8秒前
Cecilia完成签到,获得积分10
9秒前
9秒前
9秒前
hyy完成签到,获得积分10
9秒前
9秒前
Elsa发布了新的文献求助10
9秒前
niuniu发布了新的文献求助10
9秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Metallurgy at high pressures and high temperatures 2000
Tier 1 Checklists for Seismic Evaluation and Retrofit of Existing Buildings 1000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 1000
The Organic Chemistry of Biological Pathways Second Edition 1000
Free parameter models in liquid scintillation counting 1000
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6331751
求助须知:如何正确求助?哪些是违规求助? 8148336
关于积分的说明 17101499
捐赠科研通 5387513
什么是DOI,文献DOI怎么找? 2856192
邀请新用户注册赠送积分活动 1833660
关于科研通互助平台的介绍 1684920