The impact of introducing textual semantics on item instance retrieval with highly similar appearance: An empirical study

图像检索 相似性(几何) 计算机科学 语义学(计算机科学) 特征(语言学) 人工智能 维数(图论) 图像(数学) 情报检索 模式识别(心理学) 特征向量 空格(标点符号) 数学 操作系统 哲学 语言学 程序设计语言 纯数学
作者
Bo Li,Jiang Zhu,Lingyun Dai,Hui Jing,Zhizheng Huang
出处
期刊:Image and Vision Computing [Elsevier BV]
卷期号:143: 104925-104925
标识
DOI:10.1016/j.imavis.2024.104925
摘要

Feature representation plays an important role in image instance retrieval (IIR). In practical applications, we find that items of different categories but highly similar in appearance are easy to become the objects of incorrect retrieval. We analyze that extracting features from the appearance dimension alone may cause objects with similar appearance to have smaller similar distances in feature space. But the appearance is not the only factor that determines whether the item is the same, and the difference in the shooting angle will also amplify the appearance difference of the same item in the image. In this paper, through detailed empirical study, we verify a conjecture that by introducing text semantics and fusing it with appearance features, the similarity distance of falsely retrieved objects in feature space can be corrected, thus improving the retrieval effectiveness of image instance retrieval tasks in highly similar appearance data. We introduce textual semantics for image instances based on the image-text cross-modal model. Specifically, we enhance the proportion of appearance similar items based on three open-source datasets (Products-10 k, RP2k and Stanford products) of item instances, and add multi-angle image samples of the same item to enlarge the difference of the same item. Subsequently, we have embarked on baseline experiments for appearance features and textual features from the perspectives of shooting angle similarity and visual character similarity, to explore the advantages of multiple strategies for fusing textual semantics with appearance features. Then, we examine the effect of our method on fine-grained item instance retrieval methods with state-of-the-art. Resultantly, taking mean Average Precision (mAP) as the quantitative metric and averaging experimental results, our method has an obvious improvement over the appearance and textual baselines, where the improvement of appearance feature baselines is generally more obvious than that of textual feature baselines (e.g., in our expanded RP2k dataset, from the perspective of shooting angle similarity, the mAP of the appearance feature baseline is nearly 19.62, the textual feature baseline is 32.45, our method is 43.19. From perspective of visual character similarity, the values are 27.14, 43.59, 54.76, respectively). Moreover, our methods outperform the state-of-the-art fine-grained item instance retrieval methods with improvements of nearly 13.05% and 22.49% on expanded Products-10 k and RP2k, respectively.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
自然的泽浩完成签到 ,获得积分10
2秒前
3秒前
3秒前
4秒前
所所应助窗外的天气采纳,获得10
4秒前
5秒前
5秒前
体贴鹰发布了新的文献求助10
5秒前
5秒前
吵闹完成签到,获得积分10
7秒前
无限的芝士完成签到 ,获得积分10
7秒前
7秒前
晴雨发布了新的文献求助10
9秒前
火星上无春完成签到 ,获得积分10
9秒前
GAOjiale发布了新的文献求助10
9秒前
qianfengming完成签到,获得积分10
9秒前
怡然的怜烟应助kiyo_v采纳,获得10
9秒前
Lucas应助福福福福福采纳,获得10
10秒前
adi发布了新的文献求助10
11秒前
缓慢的秋莲完成签到,获得积分10
11秒前
JamesPei应助sum42采纳,获得10
12秒前
12秒前
13秒前
莎莎发布了新的文献求助10
13秒前
只想发SCI完成签到,获得积分10
15秒前
ccccc发布了新的文献求助10
16秒前
成就的安阳完成签到,获得积分10
18秒前
ffff发布了新的文献求助10
19秒前
19秒前
灏蝻完成签到,获得积分10
19秒前
19秒前
赘婿应助yyyy采纳,获得10
20秒前
20秒前
天天快乐应助是人采纳,获得10
20秒前
22秒前
受伤的小松鼠应助kiyo_v采纳,获得10
22秒前
nuclear1002应助haohao采纳,获得10
22秒前
22秒前
英姑应助wj采纳,获得10
23秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Pulse width control of a 3-phase inverter with non sinusoidal phase voltages 777
Signals, Systems, and Signal Processing 610
Research Methods for Applied Linguistics: A Practical Guide 600
Research Methods for Applied Linguistics 500
Chemistry and Physics of Carbon Volume 15 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6406859
求助须知:如何正确求助?哪些是违规求助? 8226035
关于积分的说明 17445340
捐赠科研通 5459574
什么是DOI,文献DOI怎么找? 2884893
邀请新用户注册赠送积分活动 1861329
关于科研通互助平台的介绍 1701779