计算机科学
人工智能
RGB颜色模型
突出
模式识别(心理学)
边距(机器学习)
对象(语法)
滤波器(信号处理)
模态(人机交互)
深度图
判别式
嵌入
计算机视觉
图像(数学)
机器学习
作者
Fasheng Wang,Ruimin Wang,Fuming Sun
标识
DOI:10.1016/j.eswa.2022.119047
摘要
It is well-acknowledged that depth maps contain affluent spatial information which is crucial to explicitly distinguish the foreground and background in Salient Object Detection (SOD). With the help of depth maps, the performance has been pushed to the peak in SOD. Nevertheless, some depth maps with low quality are not potent for capturing accurate spatial information. Hence, it is not desirable to utilize depth maps indiscriminately. To this end, we propose a Discriminant and Cross-Modality Network (DCMNet) for RGB-D salient object detection. In DCMNet, we integrate a module named Depth Decomposition and Recomposition Module (DDRM) to filter depth maps with low quality. Thereafter, we conduct a quality enhancement procedure towards these detrimental depth maps. Meanwhile, we propose a Multi-Cross Attention Module (MCAM), which combines spatial attention with channel attention in a multi-cross way for better exploiting rich details about the salient object from RGB-stream and depth-stream. In addition, we employ Res2Net model to efficiently excavate foreground information and it is named as Image Pretraining Model (IPM). By embedding DDRM, MCAM and IPM, the accuracy has increased by a large margin. Extensive experiments manifest our proposed approach (DCMNet) outperforms the other 14 state-of-the-art methods on five challenging public datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI