计算机科学
目标检测
块(置换群论)
骨干网
人工智能
实时计算
特征提取
数据挖掘
变压器
卷积神经网络
模式识别(心理学)
计算机网络
工程类
电压
电气工程
几何学
数学
作者
Y B Liu,Miao He,Bin Hui
出处
期刊:Drones
[Multidisciplinary Digital Publishing Institute]
日期:2025-02-14
卷期号:9 (2): 143-143
被引量:1
标识
DOI:10.3390/drones9020143
摘要
Object detection is a fundamental capability that enables drones to perform various tasks. However, achieving a suitable equilibrium between performance, efficiency, and lightweight design continues to be a significant challenge for current algorithms. To address this issue, we propose an enhanced small object detection transformer model called ESO-DETR. First, we present a gated single-head attention backbone block, known as the GSHA block, which enhances the extraction of local details. Besides, ESO-DETR utilizes the multiscale multihead self-attention mechanism (MMSA) to efficiently manage complex features within its backbone network. We also introduce a novel and efficient feature fusion pyramid network for enhanced small object detection, termed ESO-FPN. This network integrates large convolutional kernels with dual-domain attention mechanisms. Lastly, we introduce the EMASlideVariFocal loss (ESVF Loss), which dynamically adjusts the weights to improve the model’s focus on more challenging samples. In comparison with the baseline model, ESO-DETR demonstrates enhancements of 3.9% and 4.0% in the mAP50 metric on the VisDrone and HIT-UAV datasets, respectively, while also reducing parameters by 25%. These results highlight the capability of ESO-DETR to improve detection accuracy while maintaining a lightweight and efficient structure.
科研通智能强力驱动
Strongly Powered by AbleSci AI