UCPM: Uncertainty-Guided Cross-Modal Retrieval with Partially Mismatched Pairs

情态动词计算机科学人工智能模式识别（心理学）算法数学材料科学高分子化学

作者

Quanxing Zha,Xin Liu,Yiu‐ming Cheung,Shu‐Juan Peng,Xing Xu,Nannan Wang

出处

期刊：IEEE transactions on image processing [Institute of Electrical and Electronics Engineers]
日期：2025-01-01 卷期号：: 1-1

链接

nih.govdoi.org

标识

DOI：10.1109/tip.2025.3574918

摘要

The manual annotation of perfectly aligned labels for cross-modal retrieval (CMR) is incredibly labor-intensive. As an alternative, the collection of co-occurring data pairs from the Internet is a remarkably cost-effective way, but which, inevitably induces the Partially Mismatched Pairs (PMPs) and therefore significantly degrades the retrieval performance without particular treatment. Previous efforts often utilize the pair-wise similarity to filter out the mismatched pairs, and such operation is highly sensitive to mismatched or ambiguous data and thus leads to sub-optimal performance. To alleviate these concerns, we propose an efficient approach, termed UCPM, i.e., Uncertainty-guided Cross-modal retrieval with Partially Mismatched pairs, which can significantly reduce the adverse impact of mismatched data pairs. Specifically, a novel Uncertainty Guided Division (UGD) strategy is sophisticatedly designed to divide the corrupted training data into confident matched (clean), easily-identifiable mismatched (noisy) and hardly-determined hard subsets, and the derived uncertainty can simultaneously guide the informative pair learning while reducing the negative impact of potential mismatched pairs. Meanwhile, an effective Uncertainty Self-Correction (USC) mechanism is concurrently presented to accurately identify and rectify the fluctuated uncertainty during the training process, which further improves the stability and reliability of the estimated uncertainty. Besides, a Trusted Margin Loss (TML) is newly designed to enhance the discriminability between those hard pairs, by dynamically adjusting their soft margins to amplify the positive contributions of matched pairs while suppressing the negative impacts of mismatched pairs. Extensive experiments on three widely-used benchmark datasets, verify the effectiveness and reliability of UCPM compared with the existing SOTA approaches, and significantly improve the robustness in both synthetic and real-world PMPs. The code is available at: https://github.com/qxzha/UCPM.

求助该文献

最长约 10秒，即可获得该文献文件

UCPM: Uncertainty-Guided Cross-Modal Retrieval with Partially Mismatched Pairs

今日热心研友