计算机科学
人工智能
光学(聚焦)
判别式
目标检测
计算机视觉
背景(考古学)
编码器
特征提取
模式识别(心理学)
古生物学
物理
光学
生物
操作系统
作者
Song Ze,Xudong Kang,Xiaohui Wei,Haibo Liu,Renwei Dian,Shutao Li
标识
DOI:10.1109/tip.2023.3266659
摘要
Camouflaged object detection (COD) aims to discover objects that blend in with the background due to similar colors or textures, etc. Existing deep learning methods do not systematically illustrate the key tasks in COD, which seriously hinders the improvement of its performance. In this paper, we introduce the concept of focus areas that represent some regions containing discernable colors or textures, and develop a two-stage focus scanning network for camouflaged object detection. Specifically, a novel encoder-decoder module is first designed to determine a region where the focus areas may appear. In this process, a multi-layer Swin transformer is deployed to encode global context information between the object and the background, and a novel cross-connection decoder is proposed to fuse cross-layer textures or semantics. Then, we utilize the multi-scale dilated convolution to obtain discriminative features with different scales in focus areas. Meanwhile, the dynamic difficulty aware loss is designed to guide the network paying more attention to structural details. Extensive experimental results on the benchmarks, including CAMO, CHAMELEON, COD10K, and NC4K, illustrate that the proposed method performs favorably against other state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI