计算机科学
判别式
发电机(电路理论)
人工智能
公制(单位)
特征(语言学)
提取器
相似性(几何)
模式识别(心理学)
班级(哲学)
特征向量
机器学习
图像(数学)
功率(物理)
运营管理
物理
语言学
哲学
量子力学
工艺工程
工程类
经济
作者
Ruixuan Gao,Hsuan Su,Shitala Prasad,Ping Tang
标识
DOI:10.1016/j.imavis.2023.104869
摘要
Metric-based methods aim to predict class labels by computing the similarity between samples using distance functions, which is the mainstream approach to few-shot learning. However, the limited representational space of feature vectors and appearance variations among congenetic samples still present challenges. We propose a Multisemantic Information Fusion Network (MIFN) to address these problems. A Lower-level Feature generator (LF-generator), which is an unsupervised module, adaptively activates high-response regions of objects to introduce discriminative semantic details. Meanwhile, a Higher-level Feature extractor (HF-extractor) learns global semantic information with human cognition to minimise the impact of appearance variations. We integrate the coarse outputs of these two modules, which complement each other to jointly promote more precise predictions. Furthermore, considering the importance of prototypes, we redefine the sampling strategy of the triplet loss and utilise it as an auxiliary loss to sharpen the decision boundary at the prototype level, facilitating subsequent classification. Our experimental results demonstrate the competitiveness of our approach in both general few-shot classification (mini-ImageNet and tiered-ImageNet) and cross-domain problems (CUB, Caltech-101, Stanford-dogs, and Stanford-cars) with minimal bells and whistles.
科研通智能强力驱动
Strongly Powered by AbleSci AI