计算机科学
特征(语言学)
人工智能
红外线的
融合
模式识别(心理学)
传感器融合
特征提取
计算机视觉
领域(数学)
遥感
光学
物理
数学
地质学
哲学
语言学
纯数学
作者
Xiaohan Zhang,Xue Zhang,Si-Yuan Cao,Beinan Yu,Chenghao Zhang,Hui‐Liang Shen
标识
DOI:10.1109/tgrs.2024.3416470
摘要
Infrared small target detection (IRSTD) has made remarkable achievements in recent years. However, the core focus of current works lies on the philosophy of “increasing network complexity,” which leaves the crucial strategies behind performance improvement unclear. To handle this, we highlight two strategies of IRSTD: 1) multireceptive field perception and 2) effective feature fusion. Focusing on these strategies, we propose a multireceptive field perception and effective feature fusion network, named MRF3Net. Specifically, for multireceptive field perception, we devise a multiple perception encoder (MPE). For effective feature fusion, we devise a feature fusion encoder (FFE) and a feature fusion decoder (FFD). The former improves the encoding efficiency by reducing the interference information and preserving target details, and the latter fuses the target information of low-level and high-level features while eliminating noise. Experiments demonstrate that MRF3Net achieves state-of-the-art performance on popular public datasets while maintaining a fast inference speed of approximately 0.011 s/frame on a single NVIDIA GeForce 3070Ti GPU and 0.049 s/frame on the NVIDIA Jetson Orin. Notably, it significantly reduces parameter costs compared with the previous state-of-the-art approaches, validating its efficiency. Moreover, our MPE, FFE, and FFD have proved to be effective in enhancing other IRSTD approaches. Our code will be made available at: https://github.com/Temperature-ai/MRF3Net.
科研通智能强力驱动
Strongly Powered by AbleSci AI