计算机科学
融合机制
机器人
图形
人工智能
人机交互
任务(项目管理)
任务分析
可执行文件
可解释性
服务机器人
编码器
认知
融合
理论计算机科学
哲学
神经科学
语言学
管理
脂质双层融合
经济
生物
操作系统
作者
Yongcheng Cui,Guohui Tian,Zhengsong Jiang,Mengyang Zhang,Yu Gu,Y. Wang
标识
DOI:10.1109/tcsvt.2023.3339292
摘要
Active Task Cognition (ATC) requires the robot to comprehend the current scene using the image within the field of view, enabling them to reason about appropriate and executable tasks, thus allowing the robot to achieve service task scene discovery capability similar to humans. This capability is paramount for robots to provide comfort and intelligent service while performing their tasks. To enhance home service robots' ATC capability, a multi-graph fusion mechanism based on Graph Attention Network (GAT) is proposed in this paper to model the semantic feature related to the task. First, a multi-graph fusion encoder is proposed to maximally capture the integrated features of objects, tasks, and scenes from the images, thereby obtaining a semantic representation related to the home service task from the robot's perspective. Next, to enhance the interpretability of the model, we propose a multi-task scene understanding decoder based on the attention mechanism to utilize the integration features of multi-graph fusion efficiently. Lastly, we present a loss function for multi-task scene understanding in the proposed Encoder-Decoder network model for scene comprehension. Furthermore, a new dataset comprising various daily household tasks is constructed in the experiments. Extensive experimental results indicate that the proposed method significantly enhances the robot's active cognitive abilities in service tasks, empowering it with advanced levels of intelligence.
科研通智能强力驱动
Strongly Powered by AbleSci AI