计算机科学
对象(语法)
目标检测
组分(热力学)
特征(语言学)
时频分析
骨干网
人工智能
代表(政治)
计算机视觉
特征提取
人工神经网络
实时计算
软件部署
稳健性(进化)
钥匙(锁)
模式识别(心理学)
视觉对象识别的认知神经科学
干扰(通信)
频率响应
无线电频率
可视化
低频
射频识别
实体造型
信号处理
人类视觉系统模型
作者
Weiyun Liang,Jiesheng Wu,Yanfeng Wu,Xinyue Mu,Jing Xu
标识
DOI:10.1109/lsp.2024.3356416
摘要
Existing camouflaged object detection (COD) methods typically have large model parameters and computations, hindering their deployment in real-world applications. Although using lightweight backbones can help alleviate this problem, their weaker feature representation often leads to performance degradation. To address this issue, we observe that frequency information has shown effective for cumbersome networks, but its effectiveness for lightweight ones has not been thoroughly investigated. Biological studies indicate that the human visual system utilizes distinct neural pathways to respond to different frequency stimuli, contributing to specialization and efficiency. Motivated by this, we propose an efficient frequency injection module (FIM) to aid lightweight backbone features by separately injecting detailed high frequency and object-level low frequency cues at each stage. FIM can be used as a plug-and-play component in existing COD networks to enhance backbone features at a low cost. With FIM, our proposed frequency injection network (FINet) achieves competitive performance against most state-ofthe- art methods with much faster speed (692FPS for the input size of 384 x 384) and fewer parameters (3.74M). Source codes will be released at https://github.com/crrcoo/FINet.
科研通智能强力驱动
Strongly Powered by AbleSci AI