计算机科学
水准点(测量)
目标检测
稳健性(进化)
人工智能
最小边界框
标杆管理
计算机视觉
跳跃式监视
视频跟踪
机器学习
模式识别(心理学)
对象(语法)
图像(数学)
地理
生物化学
化学
大地测量学
营销
业务
基因
作者
Xinyi Ying,Chao Xiao,Wei An,Ruojing Li,Xu He,Boyang Li,Xu Cao,Zhaoxu Li,Yingqian Wang,Mingyuan Hu,Qingyu Xu,Zaiping Lin,Miao Li,Shilin Zhou,Weidong Sheng,Li Liu
标识
DOI:10.1109/tpami.2025.3544621
摘要
Visible-thermal small object detection (RGBT SOD) is a significant yet challenging task with a wide range of applications, including video surveillance, traffic monitoring, search and rescue. However, existing studies mainly focus on either visible or thermal modality, while RGBT SOD is rarely explored. Although some RGBT datasets have been developed, the insufficient quantity, limited diversity, unitary application, misaligned images and large target size cannot provide an impartial benchmark to evaluate RGBT SOD algorithms. In this paper, we build the first large-scale benchmark with high diversity for RGBT SOD (namely RGBT-Tiny), including 115 paired sequences, 93 K frames and 1.2 M manual annotations. RGBT-Tiny contains abundant objects (7 categories) and high-diversity scenes (8 types that cover different illumination and density variations). Note that, over 81% of objects are smaller than 16×16, and we provide paired bounding box annotations with tracking ID to offer an extremely challenging benchmark with wide-range applications, such as RGBT image fusion, object detection and tracking. In addition, we propose a scale adaptive fitness (SAFit) measure that exhibits high robustness on both small and large objects. The proposed SAFit can provide reasonable performance evaluation and promote detection performance. Based on the proposed RGBT-Tiny dataset, extensive evaluations have been conducted with IoU and SAFit metrics, including 32 recent state-of-the-art algorithms that cover four different types (i.e., visible generic detection, visible SOD, thermal SOD and RGBT object detection). Project is available at https://github.com/XinyiYing/RGBT-Tiny.
科研通智能强力驱动
Strongly Powered by AbleSci AI