计算机科学
人工智能
语义空间
残余物
知识图
图形
机器学习
分类器(UML)
平滑的
模式识别(心理学)
理论计算机科学
算法
计算机视觉
作者
Jiwei Wei,Yang Yang,Jingjing Li,Lei Zhu,Lin Zuo,Heng Tao Shen
标识
DOI:10.1145/3338533.3366552
摘要
Most existing Zero-Shot Learning (ZSL) approaches adopt the semantic space as a bridge to classify unseen categories. However, it is difficult to transfer knowledge from seen categories to unseen categories through semantic space, since the correlations among categories are uncertain and ambiguous in the semantic space. In this paper, we formulated zero-shot learning as a classifier weight regression problem. Specifically, we propose a novel Residual Graph Convolution Network (ResGCN) which takes word embeddings and knowledge graph as inputs and outputs a visual classifier for each category. ResGCN can effectively alleviate the problem of over-smoothing and over-fitting. During the test, an unseen image can be classified by ranking the inner product of its visual feature and predictive visual classifiers. Moreover, we provide a new method to build a better knowledge graph. Our approach not only further enhances the correlations among categories, but also makes it easy to add new categories to the knowledge graph. Experiments conducted on the large-scale ImageNet 2011 21K dataset demonstrate that our method significantly outperforms existing state-of-the-art approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI