Pareto Refocusing for Drone-View Object Detection

计算机科学目标检测背景（考古学）人工智能瓶颈无人机帕累托原理任务（项目管理）突出对象（语法）计算机视觉机器学习空间语境意识模式识别（心理学）地理数学工程类系统工程考古嵌入式系统数学优化生物遗传学

作者

Jiaxu Leng,Mengjingcheng Mo,Yinghua Zhou,Chenqiang Gao,Weisheng Li,Xinbo Gao

出处

期刊：IEEE Transactions on Circuits and Systems for Video Technology [Institute of Electrical and Electronics Engineers]
日期：2022-09-28 卷期号：33 (3): 1320-1334 被引量：39

标识

DOI：10.1109/tcsvt.2022.3210207

摘要

Drone-view Object Detection (DOD) is a meaningful but challenging task. It hits a bottleneck due to two main reasons: (1) The high proportion of difficult objects (e.g., small objects, occluded objects, etc.) makes the detection performance unsatisfactory. (2) The unevenly distributed objects make detection inefficient. These two factors also lead to a phenomenon, obeying the Pareto principle, that some challenging regions occupying a low area proportion of the image have a significant impact on the final detection while the vanilla regions occupying the major area have a negligible impact due to the limited room for performance improvement. Motivated by the human visual system that naturally attempts to invest unequal energies in things of hierarchical difficulty for recognizing objects effectively, this paper presents a novel Pareto Refocusing Detection (PRDet) network that distinguishes the challenging regions from the vanilla regions under reverse-attention guidance and refocuses the challenging regions with the assistance of the region-specific context. Specifically, we first propose a Reverse-attention Exploration Module (REM) that excavates the potential position of difficult objects by suppressing the features which are salient to the commonly used detector. Then, we propose a Region-specific Context Learning Module (RCLM) that learns to generate specific contexts for strengthening the understanding of challenging regions. It is noteworthy that the specific context is not shared globally but unique for each challenging region with the exploration of spatial and appearance cues. Extensive experiments and comprehensive evaluations on the VisDrone2021-DET and UAVDT datasets demonstrate that the proposed PRDet can effectively improve the detection performance, especially for those difficult objects, outperforming state-of-the-art detectors. Furthermore, our method also achieves significant performance improvements on the DTU-Drone dataset for power inspection.

求助该文献

最长约 10秒，即可获得该文献文件

Pareto Refocusing for Drone-View Object Detection

今日热心研友