人工智能
计算机科学
目标检测
计算机视觉
合成孔径雷达
探测器
假警报
特征(语言学)
恒虚警率
特征提取
分类器(UML)
变压器
目标捕获
利用
动目标指示
雷达成像
模式识别(心理学)
雷达跟踪器
深度学习
逆合成孔径雷达
单发
异常检测
块(置换群论)
雷达
自动目标识别
卷积神经网络
作者
Yunxuan Hao,Kaicheng Fu,Chi Xu,Qiang Cheng
摘要
Video Synthetic Aperture Radar (ViSAR) moving target detection is a challenging task. Generally speaking, the scale of ViSAR moving targets is extremely small, causing the increase of detection difficulties. Existing methods do not perform well in ViSAR small target detection tasks and suffer from loss of information. Moreover, most of the existing methods failed to exploit the intrinsic temporal feature of ViSAR images which is significant for the moving target detection. To address these issues, YOLO Embrace Transformer (YET), a two-stage object detection network is proposed. The first stage network utilizes a model integrated with the Convolutional Block Attention Module (CBAM) which can extract channelwise and spatial-wise information. Meanwhile, fine-grained multi-scale features are merged by inter-channel feature fusion means, enabling the model to detect potential moving targets with a high rate. In the second stage, a classifier is designed to realize false alarm suppression based on temporal information, equipped with a Transformer module. Experiments on both public and private datasets verify the effectiveness of YET. It outperforms mainstream detectors and achieves better precision, indicating its superiority in ViSAR moving target detection task.
科研通智能强力驱动
Strongly Powered by AbleSci AI