计算机科学
棱锥(几何)
人工智能
目标检测
任务(项目管理)
特征(语言学)
计算机视觉
比例(比率)
实时计算
机器学习
模式识别(心理学)
语言学
量子力学
光学
物理
哲学
经济
管理
作者
Mian Pan,Weijie Xia,Haibin Yu,Xinzhi Hu,Wenyu Cai,Jianguang Shi
出处
期刊:Remote Sensing
[Multidisciplinary Digital Publishing Institute]
日期:2023-12-12
卷期号:15 (24): 5698-5698
被引量:5
摘要
Vehicle detection based on unmanned aerial vehicle (UAV) aerial images plays a significant role in areas such as traffic monitoring and management, disaster relief, and more, garnering extensive attention from researchers in recent years. However, datasets acquired from UAV platforms inevitably suffer from issues such as imbalanced class distribution, severe background interference, numerous small objects, and significant target scale variance, presenting substantial challenges to practical vehicle detection applications based on this platform. Addressing these challenges, this paper proposes an object detection model grounded in a background suppression pyramid network and multi-scale task adaptive decoupled head. Firstly, the model implements a long-tail feature resampling algorithm (LFRA) to solve the problem of imbalanced class distribution in the dataset. Next, a background suppression pyramid network (BSPN) is integrated into the Neck segment of the model. This network not only reduces the interference of redundant background information but also skillfully extracts features of small target vehicles, enhancing the ability of the model to detect small objects. Lastly, a multi-scale task adaptive decoupled head (MTAD) with varied receptive fields is introduced, enhancing detection accuracy by leveraging multi-scale features and adaptively generating relevant features for classification and detection. Experimental results indicate that the proposed model achieves state-of-the-art performance on lightweight object detection networks. Compared to the baseline model PP-YOLOE-s, our model improves the AP50:95 on the VisDrone-Vehicle dataset by 1.9%.
科研通智能强力驱动
Strongly Powered by AbleSci AI