变压器
计算机科学
人工智能
模式识别(心理学)
计算机视觉
情报检索
工程类
电气工程
电压
作者
Danilo Dordevic,S. Kumar
出处
期刊:Cornell University - arXiv
日期:2024-09-02
标识
DOI:10.48550/arxiv.2409.01082
摘要
We introduce the Evidential Transformer, an uncertainty-driven transformer model for improved and robust image retrieval. In this paper, we make several contributions to content-based image retrieval (CBIR). We incorporate probabilistic methods into image retrieval, achieving robust and reliable results, with evidential classification surpassing traditional training based on multiclass classification as a baseline for deep metric learning. Furthermore, we improve the state-of-the-art retrieval results on several datasets by leveraging the Global Context Vision Transformer (GC ViT) architecture. Our experimental results consistently demonstrate the reliability of our approach, setting a new benchmark in CBIR in all test settings on the Stanford Online Products (SOP) and CUB-200-2011 datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI