已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

Image-Specific Information Suppression and Implicit Local Alignment for Text-Based Person Search

计算机科学 特征(语言学) 模态(人机交互) 模式 图像(数学) 人工智能 集合(抽象数据类型) 情报检索 模式识别(心理学) 计算机视觉 社会科学 语言学 哲学 社会学 程序设计语言
作者
Shuanglin Yan,Hao Tang,Liyan Zhang,Jinhui Tang
出处
期刊:IEEE transactions on neural networks and learning systems [Institute of Electrical and Electronics Engineers]
卷期号:35 (12): 17973-17986 被引量:93
标识
DOI:10.1109/tnnls.2023.3310118
摘要

Text-based person search (TBPS) is a challenging task that aims to search pedestrian images with the same identity from an image gallery given a query text. In recent years, TBPS has made remarkable progress, and state-of-the-art (SOTA) methods achieve superior performance by learning local fine-grained correspondence between images and texts. However, most existing methods rely on explicitly generated local parts to model fine-grained correspondence between modalities, which is unreliable due to the lack of contextual information or the potential introduction of noise. Moreover, the existing methods seldom consider the information inequality problem between modalities caused by image-specific information. To address these limitations, we propose an efficient joint multilevel alignment network (MANet) for TBPS, which can learn aligned image/text feature representations between modalities at multiple levels, and realize fast and effective person search. Specifically, we first design an image-specific information suppression (ISS) module, which suppresses image background and environmental factors by relation-guided localization (RGL) and channel attention filtration (CAF), respectively. This module effectively alleviates the information inequality problem and realizes the alignment of information volume between images and texts. Second, we propose an implicit local alignment (ILA) module to adaptively aggregate all pixel/word features of image/text to a set of modality-shared semantic topic centers and implicitly learn the local fine-grained correspondence between modalities without additional supervision and cross-modal interactions. Also, a global alignment (GA) is introduced as a supplement to the local perspective. The cooperation of global and local alignment modules enables better semantic alignment between modalities. Extensive experiments on multiple databases demonstrate the effectiveness and superiority of our MANet.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
雨后完成签到 ,获得积分10
2秒前
王者归来完成签到,获得积分10
5秒前
7秒前
Ksoso完成签到,获得积分10
8秒前
所所应助caixk采纳,获得10
12秒前
sittingduck完成签到,获得积分10
14秒前
weihe完成签到,获得积分10
14秒前
16秒前
leungzzz完成签到,获得积分10
19秒前
21秒前
刘医生发布了新的文献求助10
22秒前
JamesPei应助泷生采纳,获得10
24秒前
赘婿应助lazycath03采纳,获得10
27秒前
caixk发布了新的文献求助10
28秒前
35秒前
指已成殇完成签到,获得积分10
38秒前
39秒前
哎小伙子完成签到,获得积分10
42秒前
科研小白_菜完成签到 ,获得积分10
42秒前
LJC完成签到,获得积分10
43秒前
笨笨听双完成签到 ,获得积分10
43秒前
啦啦啦啦啦完成签到 ,获得积分10
44秒前
哎小伙子发布了新的文献求助10
45秒前
刘鑫慧发布了新的文献求助10
45秒前
lazycath03完成签到,获得积分10
47秒前
泷生完成签到,获得积分10
47秒前
max完成签到,获得积分10
47秒前
Wsssss完成签到,获得积分10
49秒前
49秒前
49秒前
50秒前
Suo完成签到,获得积分10
51秒前
长度2到完成签到,获得积分10
52秒前
Ksoso关注了科研通微信公众号
52秒前
53秒前
JingyuanZeng发布了新的文献求助10
54秒前
耍酷鼠标完成签到 ,获得积分0
55秒前
eclo完成签到 ,获得积分10
55秒前
泷生发布了新的文献求助10
55秒前
泷生发布了新的文献求助10
55秒前
高分求助中
Clinical Epidemiology: The Essentials, 6e 10000
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Graphene Handbook (2019 Edition) 800
Adhesion Science: Principles & Practice 800
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
The Immune System (Fifth Edition) 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6570122
求助须知:如何正确求助?哪些是违规求助? 8349029
关于积分的说明 17886831
捐赠科研通 5698810
什么是DOI,文献DOI怎么找? 2944692
邀请新用户注册赠送积分活动 1920589
关于科研通互助平台的介绍 1797715