计算机科学
目标检测
无人机
规范化(社会学)
失败
人工智能
背景(考古学)
卷积(计算机科学)
推论
延迟(音频)
计算机视觉
模式识别(心理学)
并行计算
生物
古生物学
社会学
电信
遗传学
人工神经网络
人类学
作者
Bowei Du,Yecheng Huang,Jiaxin Chen,Di Huang
标识
DOI:10.1109/cvpr52729.2023.01291
摘要
Object detection on drone images with low-latency is an important but challenging task on the resource-constrained unmanned aerial vehicle (UAV) platform. This paper investigates optimizing the detection head based on the sparse convolution, which proves effective in balancing the accuracy and efficiency. Nevertheless, it suffers from inadequate integration of contextual information of tiny objects as well as clumsy control of the mask ratio in the presence of foreground with varying scales. To address the issues above, we propose a novel global context-enhanced adaptive sparse convolutional network (CEASC). It first develops a context-enhanced group normalization (CE-GN) layer, by replacing the statistics based on sparsely sampled features with the global contextual ones, and then designs an adaptive multi-layer masking strategy to generate optimal mask ratios at distinct scales for compact foreground coverage, promoting both the accuracy and efficiency. Extensive experimental results on two major benchmarks, i.e. VisDrone and UAVDT, demonstrate that CEASC remarkably reduces the GFLOPs and accelerates the inference procedure when plugging into the typical state-of-the-art detection frameworks (e.g. RetinaNet and GFL V1) with competitive performance. Code is available at https://github.com/Cuogeihong/CEASC.
科研通智能强力驱动
Strongly Powered by AbleSci AI