人工智能
计算机科学
特征(语言学)
比例(比率)
网(多面体)
图像(数学)
计算机视觉
对象(语法)
基于对象
目标检测
模式识别(心理学)
地图学
数学
地理
哲学
语言学
几何学
作者
Ziyang Lan,Fengyuan Zhuang,Zhijie Lin,Riqing Chen,Lifang Wei,Taotao Lai,Changcai Yang
标识
DOI:10.1109/lgrs.2024.3382090
摘要
Object detection in scenes captured by unmanned aerial vehicles (UAV) is an active research area. However, the performance and efficiency of current small object detection models for UAV images are far from reaching the desired level. The inherent limitations of the features of the small objects themselves and the inconsistency of the contextual information in the feature maps lead to a degradation of the final detection performance. In this letter, to improve the performance of UAV image small object detection, we propose a multi-scale feature optimization network, named MFO-Net. We have designed three crucial modules: feature optimization fusion (FOF) module, multi-scale localized feature aggregation (MLFA) module, and feature enhancement (FE) module. FOF module enhances the fusion of features with inconsistent contexts at different levels by learning pixel-wise displacement, facilitating more effective feature fusion, which further helps focus on and capture critical information about small objects. MLFA module aggregates richer contextual information through multi-branch stripe convolution blocks, while the FE module extracts richer gradient flow information, suppresses incompatible information, and enhances feature representation capability. We conduct extensive experiments on the challenging VisDrone2019 dataset and compare the results against those obtained from the state-of-the-art methods. The experimental results show that MFO-Net performs better than other detectors. Specifically, MFO-Net achieves the best performance with 22.3% AP, 38.9% AP 50 , and 22.5% AP 75 on VisDrone2019. Code: https://github.com/Lanziyang121/MFO-Net.
科研通智能强力驱动
Strongly Powered by AbleSci AI