职位(财务)
关系(数据库)
计算机科学
频域
对象(语法)
目标检测
计算机视觉
人工智能
领域(数学分析)
遥感
模式识别(心理学)
地质学
数据挖掘
数学
数学分析
经济
财务
作者
Lei Hu,Jiwen Yuan,Bailiang Cheng,Qizhi Xu
标识
DOI:10.1109/tgrs.2025.3601828
摘要
Small object detection in UAV images is one of the critical aspects for its widespread application. However, due to limited feature extraction for small object and complex backgrounds, there remain significant issues of missed detections and false alarms. This paper proposes a real-time small object detection network for UAV images based on cross spatial frequency domain and position relation (CSFPR-RTDETR). First, we propose a cross spatial frequency domain hybrid (CSFH) feature extraction network, which incorporates frequency domain processing based on the CSP network to effectively capture global contextual features and enhance the distinction between small objects and backgrounds. Second, we propose a position relation decoder that incorporates the two novel geometric priors: IoU and relative angle. Through rational characterization of spatial correlations, this design significantly strengthens the spatial perception capability of model, thereby improving the detection performance for densely distributed small objects. Finally, we design an efficient small-object high-frequency hybrid encoder, integrating the P2 detection head and proposing a mixed high-frequency enhancement fusion module (MHE-Fusion) to extract fine-grained high-frequency features of small objects, further boosting detection performance. Experimental results demonstrate that CSFPR-RTDETR achieves superior performance on the VisDrone, AI-TOD, and HIT-UAV datasets, with mAP50 metrics reaching 42.3%, 55.4% and 83.1% respectively, which is better than other SOTA models. Compared to RT-DETR, CSFPR-RTDETR reduces the parameters of the network by 29.1% while significantly enhancing detection performance: the mAP50 metrics reach notable improvements of 4.6%, 4.4%, and 1.5% on the three datasets, respectively. The source code is available at https://github.com/HuLei-JXNU/CSFPR-RTDETR.
科研通智能强力驱动
Strongly Powered by AbleSci AI