嵌入
判别式
人工智能
计算机科学
鉴别器
模式识别(心理学)
发电机(电路理论)
图像(数学)
特征向量
特征(语言学)
水准点(测量)
上下文图像分类
相似性(几何)
电信
功率(物理)
语言学
物理
哲学
大地测量学
量子力学
探测器
地理
作者
Gaojie Li,Yaochen Li,Jingle Liu,Wei Guo,Wenneng Tang,Yuehu Liu
标识
DOI:10.1109/tmm.2024.3353457
摘要
Existing zero-shot learning based image classification methods transform the zero-shot learning problem into supervised learning by applying generative adversarial network (GAN) to synthesize visual features of unseen classes. However, the visual features generated by the generator tend to be biased towards seen classes, and the discriminator is too weak to generate high-quality image features. To solve these problems, we propose a novel zero-shot food image classification method based on low dimensional embedding of visual features. Our method applies reinforced semantic guidance to increase the discriminative ability of the model by enhancing the strong distribution of input features. Moreover, the visual space is utilized as the embedding space to reduce the bias towards seen classes by reducing the distance between semantic information and visual features in the embedding space. Finally, the feature distribution of unseen classes is further specified by improving the prototype similarity function. Extensive experiments on three food datasets and four general benchmark datasets demonstrate the effectiveness of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI