Weakly-Supervised Camouflaged Object Detection via SAM-Guided Resolution Iteration Learning

计算机科学特征（语言学）特征学习人工智能背景（考古学）目标检测对象（语法）特征提取上下文模型计算机视觉视觉对象识别的认知神经科学模式识别（心理学）边距（机器学习）融合机制深度学习迭代法机器学习先验概率骨干网空间语境意识建筑可视化监督学习数据挖掘特征检测（计算机视觉）

作者

Yanliang Ge,Yuxi Zhong,Qiao Zhang,Hongbo Bi,Tian-Zhu Xiang

出处

期刊：IEEE Transactions on Big Data [IEEE Computer Society]
日期：2025-10-23 卷期号：12 (2): 403-415

标识

DOI：10.1109/tbdata.2025.3624975

摘要

Weakly supervised camouflaged object detection (WS-COD) aims to address the critical task of identifying visually assimilated objects concealed within heterogeneous backgrounds under sparse supervisory signals. However, current WS-COD frameworks suffer from compromised structural integrity, stemming from cross-hierarchical feature discrepancy and constrained cross-level information flow, which induces structural misalignment and context fragmentation in multi-granularity feature fusion. To overcome the limitation, we propose a novel SAM-guided Resolution Iteration Learning Network (SAM-RNet) that synergizes foundation model priors with multi-resolution feature refinement. Our technical contributions are threefold: (1) We utilize the Segment Anything Model (SAM) to produce high-quality masks, effectively mitigating supervision insufficiency through large-scale visual knowledge distillation. (2) We design a resolution iteration mechanism where high-resolution features progressively refine low-resolution counterparts through an Interactive Refinement Module (IRM) - a dual-branch architecture enabling hierarchical feature interaction and enhancement through branch collaboration and attention mechanism, complemented by an iterative feedback loss to enforce multi-scale feature learning. (3) We develop a Decoder with cross-layer fusion operations, enabling the aggregation of features from object and background contexts for precise object segmentation. Finally, extensive experiments demonstrate that SAM-RNet is superior to existing WS-COD methods across three COD datasets, achieving average improvements of 4.37%, 4.60%, 7.00%, and 24.06% in

$S_{\alpha }$

$E_{\phi }$

$F_{\beta }^{\omega }$

, and

$M$

, respectively.

求助该文献

最长约 10秒，即可获得该文献文件

Weakly-Supervised Camouflaged Object Detection via SAM-Guided Resolution Iteration Learning

今日热心研友