行人检测
计算机科学
人工智能
特征(语言学)
情态动词
比例(比率)
行人
模式识别(心理学)
计算机视觉
特征提取
工程类
地理
地图学
哲学
运输工程
化学
高分子化学
语言学
作者
Shuai Hao,Jiahao Li,Xizi Sun,Xu Ma,Beiyi An,Tian He
标识
DOI:10.1109/tits.2024.3483892
摘要
To address the problem of traditional pedestrian detection methods being subject to random interference from the external environment and insufficient utilization of pedestrian feature information, a novel multi-modal pedestrian detection network called MDFOaNet is proposed. The proposed detection network consists of two key components: a marine predator-based multi-scale image fusion module and a pedestrian detection module with an enhanced target visual accuracy attention model. In the fusion module, a contrast-based layered enhancement method for infrared images and a sharpness-based enhancement method for visible images are proposed for the problem of blurred pedestrian features in images. Moreover, to control the trade-off between fusion sub-layers, a dynamic image reconstruction model that relies on adaptive optimization based on marine predators is designed. Meanwhile, in the pedestrian detection module, to pay more attention to the main information in image and ignore some irrelevant information, an EVAM attention model is designed under the framework of YOLOv5s detection network, which improves the saliency of pedestrian targets and suppress the background interference. The experimental results show that compared with nine typical algorithms, the proposed algorithm can achieve accurate detection of multi-scale targets in complex environments, and is significantly superior to the compared detection algorithms in both subjective and objective evaluation indicators. The mAP and recall rates of the proposed network can reach 88.9% and 87.5%, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI