稳健性(进化)
计算机科学
人工智能
卷积神经网络
频道(广播)
光学(聚焦)
模式识别(心理学)
鉴定(生物学)
构造(python库)
对偶(语法数字)
图像(数学)
计算机视觉
艺术
计算机网络
生物化学
化学
物理
植物
文学类
生物
光学
基因
程序设计语言
作者
Baoguang Qi,Yi Chen,Qiang Liu,Xiaohai He,Linbo Qing,Ray E. Sheriff,Honggang Chen
标识
DOI:10.1109/tim.2023.3269117
摘要
Multiple people can have similar appearances and portions of images can be occluded or have viewpoint changes in real scenarios, causing the increased difficulty of person reidentification (Re-ID).To address these problems, we propose a dual-channel person Re-ID algorithm that integrates person Re-ID image-text pairs into the classification network for end-toend learning.We construct an image channel and a text channel, and subsequently extract visual information and text information using a convolutional neural network and a simple recurrent units network, respectively.The text information is used to assist in the learning of visual information, consequently improving the robustness of the visual information.In addition, the visual features are divided into two branches to calculate the global and local features.Global features focus on the overall appearance of a person, whereas local features provide more fine-grained detail.Text information is more accurate and reliable, and it is thus more robust to occlusion and viewpoint changes.Visual information complemented by text information can describe a person more accurately and reliably.Extensive experiments demonstrate our method achieves state-of-the-art performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI