The impact of introducing textual semantics on item instance retrieval with highly similar appearance: An empirical study

图像检索 相似性(几何) 计算机科学 语义学(计算机科学) 特征(语言学) 人工智能 维数(图论) 图像(数学) 情报检索 模式识别(心理学) 特征向量 空格(标点符号) 数学 程序设计语言 语言学 哲学 纯数学 操作系统
作者
Bo Li,Jiang Zhu,Lingyun Dai,Hui Jing,Zhizheng Huang
出处
期刊:Image and Vision Computing [Elsevier]
卷期号:143: 104925-104925
标识
DOI:10.1016/j.imavis.2024.104925
摘要

Feature representation plays an important role in image instance retrieval (IIR). In practical applications, we find that items of different categories but highly similar in appearance are easy to become the objects of incorrect retrieval. We analyze that extracting features from the appearance dimension alone may cause objects with similar appearance to have smaller similar distances in feature space. But the appearance is not the only factor that determines whether the item is the same, and the difference in the shooting angle will also amplify the appearance difference of the same item in the image. In this paper, through detailed empirical study, we verify a conjecture that by introducing text semantics and fusing it with appearance features, the similarity distance of falsely retrieved objects in feature space can be corrected, thus improving the retrieval effectiveness of image instance retrieval tasks in highly similar appearance data. We introduce textual semantics for image instances based on the image-text cross-modal model. Specifically, we enhance the proportion of appearance similar items based on three open-source datasets (Products-10 k, RP2k and Stanford products) of item instances, and add multi-angle image samples of the same item to enlarge the difference of the same item. Subsequently, we have embarked on baseline experiments for appearance features and textual features from the perspectives of shooting angle similarity and visual character similarity, to explore the advantages of multiple strategies for fusing textual semantics with appearance features. Then, we examine the effect of our method on fine-grained item instance retrieval methods with state-of-the-art. Resultantly, taking mean Average Precision (mAP) as the quantitative metric and averaging experimental results, our method has an obvious improvement over the appearance and textual baselines, where the improvement of appearance feature baselines is generally more obvious than that of textual feature baselines (e.g., in our expanded RP2k dataset, from the perspective of shooting angle similarity, the mAP of the appearance feature baseline is nearly 19.62, the textual feature baseline is 32.45, our method is 43.19. From perspective of visual character similarity, the values are 27.14, 43.59, 54.76, respectively). Moreover, our methods outperform the state-of-the-art fine-grained item instance retrieval methods with improvements of nearly 13.05% and 22.49% on expanded Products-10 k and RP2k, respectively.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
lisa完成签到,获得积分10
1秒前
ginbei完成签到,获得积分10
4秒前
6秒前
粉色海洋完成签到 ,获得积分10
6秒前
7秒前
kevindm完成签到,获得积分10
11秒前
糊涂涂完成签到 ,获得积分10
12秒前
jor666发布了新的文献求助10
13秒前
燕返完成签到,获得积分10
13秒前
wangting发布了新的文献求助10
13秒前
北城南笙完成签到,获得积分10
13秒前
晓爽完成签到,获得积分10
14秒前
华仔应助露露采纳,获得10
15秒前
CharlotteBlue应助weiling采纳,获得50
17秒前
秋雪瑶应助ginbei采纳,获得10
20秒前
桐桐应助踏实语蕊采纳,获得10
20秒前
个性的紫菜应助从容一刀采纳,获得10
21秒前
23秒前
充电宝应助hygge采纳,获得10
24秒前
wangting完成签到,获得积分10
25秒前
pear完成签到,获得积分10
26秒前
Hanni完成签到 ,获得积分10
27秒前
饱满的复天完成签到 ,获得积分10
27秒前
你爹发布了新的文献求助10
28秒前
28秒前
29秒前
weist完成签到,获得积分10
30秒前
32秒前
32秒前
风中乘风完成签到 ,获得积分10
33秒前
AHU发布了新的文献求助10
33秒前
34秒前
成功的大白菜完成签到,获得积分10
36秒前
科研通AI2S应助科研通管家采纳,获得10
39秒前
NexusExplorer应助科研通管家采纳,获得10
39秒前
科研通AI2S应助科研通管家采纳,获得10
39秒前
小二郎应助科研通管家采纳,获得10
39秒前
zhongu应助科研通管家采纳,获得10
39秒前
rrrrrrry发布了新的文献求助10
39秒前
39秒前
高分求助中
Thermodynamic data for steelmaking 3000
Teaching Social and Emotional Learning in Physical Education 900
Counseling With Immigrants, Refugees, and Their Families From Social Justice Perspectives pages 800
藍からはじまる蛍光性トリプタンスリン研究 400
Cardiology: Board and Certification Review 400
[Lambert-Eaton syndrome without calcium channel autoantibodies] 340
NEW VALUES OF SOLUBILITY PARAMETERS FROM VAPOR PRESSURE DATA 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2362833
求助须知:如何正确求助?哪些是违规求助? 2070901
关于积分的说明 5174474
捐赠科研通 1799108
什么是DOI,文献DOI怎么找? 898441
版权声明 557785
科研通“疑难数据库(出版商)”最低求助积分说明 479476