计算机科学
秩(图论)
任务(项目管理)
图像(数学)
差异(会计)
过程(计算)
特征(语言学)
学习排名
人工智能
匹配(统计)
数据挖掘
图像检索
模式识别(心理学)
机器学习
排名(信息检索)
语言学
哲学
业务
统计
数学
管理
会计
组合数学
经济
操作系统
作者
Chenyang Zhang,Jun Feng,Wang Jiaqing
标识
DOI:10.1109/icceai55464.2022.00091
摘要
Text-based person search (TBPS) is a task aiming to retrieve a target person image that most meet the given text description from a large image gallery. However, due to the huge modality gap, intra-class variance and limited data, it is very challenging to solve this fine-grained cross-modal retrieval task. Therefore, this paper proposes a method based on semantically self-aligned network (SSAN) and re-rank post-processing to solve above challenges. Firstly, in the training process, this paper adopts a more effective data augmentation strategy to solve the problem of limited data. Secondly, different from most TBPS tasks, this paper uses ResNet101 as the feature extractor of image branch to extract the global and fine-grained features of image more effectively. Thirdly, this paper adds a re-rank post-processing to improve the performance of person search. Extensive experiments demonstrate that the method proposed in this paper achieves a competitive performance in CUHK-PEDES dataset, and has been significantly improved over the previous models in Rank-1, Rank-5, Rank-10 and mAP.
科研通智能强力驱动
Strongly Powered by AbleSci AI