行人检测
计算机科学
红外线的
人工智能
网(多面体)
计算机视觉
图像融合
行人
融合
图像(数学)
光学
物理
数学
工程类
语言学
哲学
几何学
运输工程
作者
Xinyan Xiong,Jie Yang,Yanxuan Jiang,Zhichao Chen,Zhicheng Feng,Fan Li
标识
DOI:10.1088/1361-6501/adcf41
摘要
Abstract Pedestrian detection is essential in computer vision for enhancing traffic safety. Traditional pedestrian detection methods often struggle with target distortion and detail blurring in complex environments, leading to missed detections and false positives. We propose the Dual-Stream Convolutional Channel Network (DMSC-Net), which leverages a CMFENet backbone integrated with a Squeeze-and-Excitation Block (SEB) to effectively fuse visible and infrared images. In the neck, the Cross-Modal Multiscale Fusion Module (CMFM) is employed to enhance multiscale feature representation. This design effectively addresses challenges such as occlusion, overlap, and the detection of small edge objects. We implement extensive experiments on the public LLVIP and M3FD datasets, and compared with previous methods, our method achieves better performance. On the LLVIP dataset, the proposed algorithm achieves an mAP@50 of 98.3%, with reductions in parameters and computational costs of 51.5% and 35.6%, respectively, compared to the baseline model YOLOv8n. This algorithm improves pedestrian detection under adverse conditions while its lightweight design supports practical deployment.
科研通智能强力驱动
Strongly Powered by AbleSci AI