计算机科学
人工智能
卷积(计算机科学)
目标检测
特征提取
背景(考古学)
特征(语言学)
遥感
计算机视觉
适应性
光学(聚焦)
保险丝(电气)
比例(比率)
图像融合
模式识别(心理学)
图像(数学)
人工神经网络
地理
工程类
生态学
语言学
哲学
物理
电气工程
地图学
考古
光学
生物
作者
Xing Rong,Zhihua Zhang,Hao Yuan,Shaobin Zhang
标识
DOI:10.1117/1.jrs.18.016507
摘要
Remote sensing images are characterized by complex feature backgrounds and large target scale differences, so object detection for remote sensing images is a challenging problem. This work proposes a one-stage structure remote sensing image object detection model called GODANet. First, the GODANet incorporates a Global Context Network (GCNet) in the feature extraction structure. The GCNet focuses the model on the image region of interest from a global perspective. Second, the output layer utilizes an omni-dimensional dynamic convolution technique, allowing for more flexible adaptation to targets or edges in specific regions. Finally, an adaptive spatial feature fusion structure, IR-ASFF, which fuses improved-RFB (IRFB) modules is proposed to fuse the critical information of multiple levels of features to realize the adaptability to object detection at different scales. The GODANet efficiently aggregates network performance and possesses two main advantages: adaptability to multi-scale targets and focus on features of interest. The mean average precision (mAP) on the DIOR dataset and the NWPU VHR-10 dataset reached 93.7% and 92.9%, respectively, and compared with YOLOv7, the mAP was improved by 3.1% and 0.3%, respectively. Therefore, we believe the GODANet suits remote sensing image detection tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI