杂乱
计算机科学
块(置换群论)
人工智能
水准点(测量)
方向(向量空间)
红外线的
噪音(视频)
GSM演进的增强数据速率
计算机视觉
像素
模式识别(心理学)
目标检测
编码(集合论)
图像(数学)
数学
雷达
光学
电信
物理
几何学
集合(抽象数据类型)
程序设计语言
地理
大地测量学
作者
Mingjin Zhang,Rui Zhang,Yuxiang Yang,Haichen Bai,Jing Zhang,Jie Guo
标识
DOI:10.1109/cvpr52688.2022.00095
摘要
Infrared small target detection (IRSTD) refers to extracting small and dim targets from blurred backgrounds, which has a wide range of applications such as traffic management and marine rescue. Due to the low signal-to-noise ratio and low contrast, infrared targets are easily submerged in the background of heavy noise and clutter. How to detect the precise shape information of infrared targets remains challenging. In this paper, we propose a novel infrared shape network (ISNet), where Taylor finite difference (TFD) -inspired edge block and two-orientation attention aggregation (TOAA) block are devised to address this problem. Specifically, TFD-inspired edge block aggregates and enhances the comprehensive edge information from different levels, in order to improve the contrast between target and background and also lay a foundation for extracting shape information with mathematical interpretation. TOAA block calculates the lowlevel information with attention mechanism in both row and column directions and fuses it with the high-level information to capture the shape characteristic of targets and suppress noises. In addition, we construct a new benchmark consisting of 1, 000 realistic images in various target shapes, different target sizes, and rich clutter backgrounds with accurate pixel-level annotations, called IRSTD-1k. Experiments on public datasets and IRSTD-1 k demonstrate the superiority of our approach over representative state-of-the-art IRSTD methods. The dataset and code are available at github.com/RuiZhang97/ISNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI