计算机科学
初始化
边界(拓扑)
跳跃式监视
探测器
最小边界框
卷积神经网络
干扰(通信)
人工智能
感知
计算机视觉
图像(数学)
数学
频道(广播)
电信
神经科学
生物
程序设计语言
数学分析
出处
期刊:Remote Sensing
[Multidisciplinary Digital Publishing Institute]
日期:2024-07-19
卷期号:16 (14): 2643-2643
被引量:2
摘要
Over the past few years, target detectors that utilize Convolutional Neural Networks have gained extensive application in the domain of remote sensing (RS) imagery. Recently, optimizing bounding boxes has consistently been a hot topic in the research field. However, existing methods often fail to take into account the interference caused by the shape and orientation changes of RS targets with high aspect ratios during training, leading to challenges in boundary perception when dealing with RS targets that have large aspect ratios. To deal with this challenge, our study introduces the Adaptive Boundary Perception Network (ABP-Net), a novel two-stage approach consisting of pre-training and training phases, which enhances the boundary perception of CNN-based detectors. In the pre-training phase, involving the initialization of our model’s backbone network and the label assignment, the traditional label assignment with a fixed IoU threshold fails to fully cover the critical information of slender targets, resulting in the detector missing lots of high-quality positive samples. To overcome this drawback, we design a Shape-Sensitive (S-S) label assignment strategy that can improve the boundary shape perception by dynamically adjusting the IoU threshold according to the aspect ratios of the targets so that the high-quality samples with critical features can be divided into positive samples. Moreover, during the training phase, minor angle differences of the slender bounding box may cause a significant change in the value of the loss function, producing unstable gradients. Such drastic gradient changes make it difficult for the model to find a stable update direction when optimizing the bounding box parameters, resulting in difficulty with the model convergence. To this end, we propose the Robust–Refined loss function (R-R), which can enhance the boundary localization perception by focusing on low-error samples and suppressing the gradient amplification of difficult samples, thereby improving the model stability and convergence. Experiments on UCAS-AOD and HRSC2016 datasets validate our specialized detector for high-aspect-ratio targets, improving performance, efficiency, and accuracy with straightforward operation and quick deployment.
科研通智能强力驱动
Strongly Powered by AbleSci AI