Optimizing Slender Target Detection in Remote Sensing with Adaptive Boundary Perception

计算机科学初始化边界（拓扑）跳跃式监视探测器最小边界框卷积神经网络干扰（通信）人工智能感知计算机视觉图像（数学）数学频道（广播）电信神经科学生物程序设计语言数学分析

作者

Zhu Han,Donglin Jing

出处

期刊：Remote Sensing [Multidisciplinary Digital Publishing Institute]
日期：2024-07-19 卷期号：16 (14): 2643-2643 被引量：2

链接

mdpi.com mdpi.comdoi.org

标识

DOI：10.3390/rs16142643

摘要

Over the past few years, target detectors that utilize Convolutional Neural Networks have gained extensive application in the domain of remote sensing (RS) imagery. Recently, optimizing bounding boxes has consistently been a hot topic in the research field. However, existing methods often fail to take into account the interference caused by the shape and orientation changes of RS targets with high aspect ratios during training, leading to challenges in boundary perception when dealing with RS targets that have large aspect ratios. To deal with this challenge, our study introduces the Adaptive Boundary Perception Network (ABP-Net), a novel two-stage approach consisting of pre-training and training phases, which enhances the boundary perception of CNN-based detectors. In the pre-training phase, involving the initialization of our model’s backbone network and the label assignment, the traditional label assignment with a fixed IoU threshold fails to fully cover the critical information of slender targets, resulting in the detector missing lots of high-quality positive samples. To overcome this drawback, we design a Shape-Sensitive (S-S) label assignment strategy that can improve the boundary shape perception by dynamically adjusting the IoU threshold according to the aspect ratios of the targets so that the high-quality samples with critical features can be divided into positive samples. Moreover, during the training phase, minor angle differences of the slender bounding box may cause a significant change in the value of the loss function, producing unstable gradients. Such drastic gradient changes make it difficult for the model to find a stable update direction when optimizing the bounding box parameters, resulting in difficulty with the model convergence. To this end, we propose the Robust–Refined loss function (R-R), which can enhance the boundary localization perception by focusing on low-error samples and suppressing the gradient amplification of difficult samples, thereby improving the model stability and convergence. Experiments on UCAS-AOD and HRSC2016 datasets validate our specialized detector for high-aspect-ratio targets, improving performance, efficiency, and accuracy with straightforward operation and quick deployment.

求助该文献

Optimizing Slender Target Detection in Remote Sensing with Adaptive Boundary Perception

今日热心研友