计算机科学
联营
人工智能
棱锥(几何)
目标检测
特征(语言学)
任务(项目管理)
对象(语法)
模式识别(心理学)
比例(比率)
计算机视觉
特征提取
工程类
哲学
物理
光学
系统工程
量子力学
语言学
作者
Guiping Wu,Lidong Liu,Zixiang Liu,Yao Liu,Tao Gao
标识
DOI:10.1109/lgrs.2023.3291505
摘要
Object detection is a fundamental task in the analysis and interpretation of remote sensing images. However, compared to natural images, remote sensing images are characterized by broad diversity in object scales, fuzzy objects, and complex background, which bring great challenges to object detection. For overcoming the above problems, a task alignment interaction and cross-scale guidance enhancement network (TCNet) is proposed in this letter. Firstly, a generalized mean spatial pyramid pooling (GeMSPP) is designed and embedded in the backbone to adapt to changes of complex environment and reduce loss of features. Secondly, cross-scale guided enhancement network (CGEN) is proposed to generate high-quality non-aliasing multi-scale target features for each feature level by guiding the fusion of deep features and enhancing feature expression. Thirdly, Task alignment interactive head (TAIH) is adopted to enhance the classification and regression accuracy of the prediction box, so as to suppress background interference and highlight object features. Experiments conducted on public DIOR and RSOD datasets illustrate that the proposed modules can effectively improve the accuracy of detection and our network has superior performance compared with other state-of-the-art detectors.
科研通智能强力驱动
Strongly Powered by AbleSci AI