行人检测
多光谱图像
计算机科学
人工智能
计算机视觉
探测器
模态(人机交互)
编码(集合论)
行人
领域(数学分析)
图像(数学)
模式识别(心理学)
数学
地理
数学分析
考古
电信
集合(抽象数据类型)
程序设计语言
作者
Jiwon Kim,Hyeongjun Kim,Taejoo Kim,Namil Kim,Yukyung Choi
出处
期刊:IEEE robotics and automation letters
日期:2021-07-26
卷期号:6 (4): 7846-7853
被引量:91
标识
DOI:10.1109/lra.2021.3099870
摘要
Multispectral pedestrian detection has been actively studied as a promising multi-modality solution to handle illumination and weather changes. Most multi-modality approaches carry the assumption that all inputs are fully-overlapped. However, these kinds of data pairs are not common in practical applications due to the complexity of the existing sensor configuration. In this letter, we tackle multispectral pedestrian detection, where all input data are not paired. To this end, we propose a novel single-stage detection framework that leverages multi-label learning to learn input state-aware features by assigning a separate label according to the given state of the input image pair. We also present a novel augmentation strategy by applying geometric transformations to synthesize the unpaired multispectral images. In extensive experiments, we demonstrate the efficacy of the proposed method under various real-world conditions, such as fully-overlapped images and partially-overlapped images, in stereo-vision. Code and a demonstration video are available at https://github.com/sejong-rcv/MLPD-Multi-Label-Pedestrian-Detection.
科研通智能强力驱动
Strongly Powered by AbleSci AI