计算机科学
人工智能
背景(考古学)
模式识别(心理学)
视觉对象识别的认知神经科学
卷积(计算机科学)
保险丝(电气)
判别式
情态动词
计算机视觉
对象(语法)
人工神经网络
工程类
古生物学
电气工程
化学
高分子化学
生物
标识
DOI:10.1109/cisp-bmei53629.2021.9624318
摘要
In recent years, object recognition has received more and more attention, but how to effectively overcome the visual ambiguity problem in the bad environment such as low illumination is still a very challenging problem. On the other hand, we consider the context information of the object to provide extra discriminative information. Therefore, multi-modal recognition based on the strong correlation between different parts of the target has the potential to solve the above problems. In this work, we propose a novel infrared-visible complementary recognition network (IV-CRN) to solve these problems. The IV-CRN includes a fusion branch and a global branch. The fusion branch uses a brightness-based weighted fusion algorithm to fuse the complementary information in the bimodal state and the convolution gated recurrent unit (ConvGRU) to extract target context information. The global branch extracts global information and complements the fragmented information of the fusion branch. We conducted comparative experiments and ablation experiments on two challenging datasets. The experimental results show that our method has good performance compared with the existing multi-mode object recognition methods. Compared with the method based on single modal image, this method also significantly improves the object accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI