TFDet: Target-Aware Fusion for RGB-T Pedestrian Detection

行人检测 行人 人工智能 计算机视觉 计算机科学 融合 RGB颜色模型 模式识别(心理学) 工程类 运输工程 语言学 哲学
作者
Xue Zhang,Xiaohan Zhang,Jiangtao Wang,Jiacheng Ying,Zehua Sheng,Heng Yu,Chunguang Li,Hui‐Liang Shen
出处
期刊:IEEE transactions on neural networks and learning systems [Institute of Electrical and Electronics Engineers]
卷期号:36 (7): 13276-13290 被引量:36
标识
DOI:10.1109/tnnls.2024.3443455
摘要

Pedestrian detection plays a critical role in computer vision as it contributes to ensuring traffic safety. Existing methods that rely solely on RGB images suffer from performance degradation under low-light conditions due to the lack of useful information. To address this issue, recent multispectral detection approaches have combined thermal images to provide complementary information and have obtained enhanced performances. Nevertheless, few approaches focus on the negative effects of false positives (FPs) caused by noisy fused feature maps. Different from them, we comprehensively analyze the impacts of FPs on detection performance and find that enhancing feature contrast can significantly reduce these FPs. In this article, we propose a novel target-aware fusion strategy for multispectral pedestrian detection, named TFDet. The target-aware fusion strategy employs a fusion-refinement paradigm. In the fusion phase, we reveal the parallel- and cross-channel similarities in RGB and thermal features and learn an adaptive receptive field to collect useful information from both features. In the refinement phase, we use a segmentation branch to discriminate the pedestrian features from the background features. We propose a correlation-maximum loss function to enhance the contrast between the pedestrian features and background features. As a result, our fusion strategy highlights pedestrian-related features and suppresses unrelated ones, generating more discriminative fused features. TFDet achieves state-of-the-art performance on two multispectral pedestrian benchmarks, KAIST and LLVIP, with absolute gains of 0.65% and 4.1% over the previous best approaches, respectively. TFDet can easily extend to multiclass object detection scenarios. It outperforms the previous best approaches on two multispectral object detection benchmarks, FLIR and M3FD, with absolute gains of 2.2% and 1.9%, respectively. Importantly, TFDet has comparable inference efficiency to the previous approaches and has remarkably good detection performance even under low-light conditions, which is a significant advancement for ensuring road safety. The code will be made publicly available at https://github.com/XueZ-phd/TFDet.git.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
cdercder应助fzd采纳,获得10
刚刚
wxinli应助超级小凝采纳,获得10
刚刚
1秒前
molihuakai应助sss采纳,获得10
1秒前
汉堡包应助姜jiang采纳,获得10
1秒前
大个应助多情的兔子采纳,获得10
2秒前
磊磊磊发布了新的文献求助10
2秒前
mulberry完成签到,获得积分10
2秒前
3秒前
3秒前
3秒前
完美世界应助zzz采纳,获得10
3秒前
NexusExplorer应助郝剑身采纳,获得10
4秒前
田様应助小包子采纳,获得10
4秒前
浮游应助liyiliyi117采纳,获得10
5秒前
5秒前
6秒前
6秒前
田様应助duhdhd采纳,获得10
7秒前
无花果应助一郭红烧肉采纳,获得30
7秒前
JohnLocke发布了新的文献求助10
7秒前
8秒前
8秒前
结实冰蓝完成签到,获得积分10
9秒前
脑洞疼应助Astralys采纳,获得10
9秒前
Fairy完成签到,获得积分10
9秒前
SciGPT应助无限丹珍采纳,获得10
9秒前
英吉利25发布了新的文献求助10
9秒前
ysf发布了新的文献求助10
10秒前
cdercder应助Yangaaa采纳,获得10
10秒前
蒋谷兰完成签到,获得积分10
10秒前
布丁发布了新的文献求助10
10秒前
11秒前
11秒前
11秒前
11秒前
12秒前
12秒前
聪聪冲冲发布了新的文献求助10
13秒前
13秒前
高分求助中
Annie Ernaux: De la perte au corps glorieux 600
类器官构建与应用:从基础到前沿 500
Petrology and Plate Tectonics,2025 500
Optical Coating Design with the Essential Macleod 400
A revision of Limenitis helmanni and its related species (Nymphalidae) from Central and South China 400
Moore's Clinically Oriented Anatomy 10th Edition 400
Direct and Iterative Linear System Solvers 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6789202
求助须知:如何正确求助?哪些是违规求助? 8510600
关于积分的说明 18124207
捐赠科研通 6098230
什么是DOI,文献DOI怎么找? 3021608
邀请新用户注册赠送积分活动 1998386
关于科研通互助平台的介绍 1986608