相似性(几何)
计算机科学
情报检索
情态动词
散列函数
排名(信息检索)
语义相似性
人工智能
程序设计语言
图像(数学)
化学
高分子化学
作者
Xiaoqing Liu,Huanqiang Zeng,Yifan Shi,Jianqing Zhu,Chih‐Hsien Hsia,Kai‐Kuang Ma
标识
DOI:10.1109/tmm.2023.3254199
摘要
The amount of multi-modal data available on the Internet is enormous. Cross-modal hash retrieval maps heterogeneous cross-modal data into a single Hamming space to offer fast and flexible retrieval services. However, existing cross-modal methods mainly rely on the feature-level similarity between multi-modal data and ignore the relationship between relative rankings and label-level fine-grained similarity of neighboring instances. To overcome these issues, we propose a novel D eep C ross-modal H ashing based on S emantic C onsistent R anking (DCH-SCR) that comprehensively investigates the intra-modal semantic similarity relationship. Firstly, to the best of our knowledge, it is an early attempt to preserve semantic similarity for cross-modal hashing retrieval by combining label-level and feature-level information. Secondly, the inherent gap between modalities is narrowed by developing a ranking alignment loss function. Thirdly, the compact and efficient hash codes are optimized based on the common semantic space. Finally, we use the gradient to specify the optimization direction and introduce the Normalized Discounted Cumulative Gain (NDCG) to achieve varying optimization strengths for data pairs with different similarities. Extensive experiments on three real-world image-text retrieval datasets demonstrate the superiority of DCH-SCR over several state-of-the-art cross-modal retrieval methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI