变压器
计算机科学
行人
人工智能
计算机视觉
姿势
特征提取
模式识别(心理学)
工程类
电压
运输工程
电气工程
作者
Yusong Qin,Pu Huang,Yao Zhou,Zengxi Huang
标识
DOI:10.1109/prai59366.2023.10331936
摘要
Person re-identification(Re-ID) is a crucial task in computer vision, which aims to match pedestrian images captured in non-overlapping camera views. It has significant implications for public safety applications. However, current methods primarily rely on global features and thus are vulnerable to background clutters and occlusion. To address these issues, this paper proposes a part-based Re-ID method using the Swin Transformer. In this method, we first utilize human pose estimation method to detect the body keypoints of a pedestrian. We then extract local windows and patches centered at these keypoints, so as to minimize the involvement of background and occlusion information in the down-stream re-identification. These body parts in pose guided windows are fed to the Swin Transformer network. The hierarchical structure of Swin Transformer also enables the proposed method more robust to multi-scale problems. Our experimental results on both holistic and occluded datasets, i.e., Market-1501, DukeMTMC-reID, and Occluded-Duke, demonstrate that the proposed method is on par with the state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI