突出
计算机科学
人工智能
模式识别(心理学)
先验概率
特征(语言学)
发电机(电路理论)
编码(集合论)
像素
源代码
目标检测
计算机视觉
集合(抽象数据类型)
贝叶斯概率
哲学
语言学
功率(物理)
物理
量子力学
程序设计语言
操作系统
作者
Yi Wang,Ruili Wang,Xiangjian He,Chi Lin,Tianzhu Wang,Qi Jia,Xin Fan
标识
DOI:10.1016/j.patcog.2024.110579
摘要
Weakly supervised salient object detection (WSOD) methods endeavor to boost sparse labels to get more salient cues in various ways. Among them, an effective approach is using pseudo labels from multiple unsupervised self-learning methods, but inaccurate and inconsistent pseudo labels could ultimately lead to detection performance degradation. To tackle this problem, we develop a new multi-source WSOD framework, WBNet, that can effectively utilize pseudo-background (non-salient region) labels combined with scribble labels to obtain more accurate salient features. We first design a comprehensive salient pseudo-mask generator from multiple self-learning features. Then, we pioneer the exploration of generating salient pseudo-labels via point-prompted and box-prompted Segment-Anything Models (SAM). Then, WBNet leverages a pixel-level Feature Aggregation Module (FAM), a mask-level Transformer-decoder (TFD), and an auxiliary Boundary Prediction Module (EPM) with a hybrid loss function to handle complex saliency detection tasks. Comprehensively evaluated with state-of-the-art methods on five widely used datasets, the proposed method significantly improves saliency detection performance. The code and results are publicly available at https://github.com/yiwangtz/WBNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI