计算机科学
利用
语义学(计算机科学)
嵌入
人工智能
图形
语义鸿沟
模式识别(心理学)
知识图
可视化
语义记忆
图像(数学)
机器学习
自然语言处理
图像检索
理论计算机科学
生物
计算机安全
神经科学
认知
程序设计语言
作者
Beibei Yu,Cheng Xie,Yujin Wang,Hongxin Xiang
标识
DOI:10.1109/iske54062.2021.9755335
摘要
Semantic fusion of combining attributes or text with knowledge graph has recently shown great potential for the task of Zero-shot Learning (ZSL). However, there is still a problem to be solved that insufficient expression of semantic features in embedding. Especially in universal datasets with an extreme challenge, such as ImageNet, these datasets are required to recognize unseen classes without corresponding annotation. To make ZSL more applicable in the real world, we found that efficiently understanding and using the image content is often the key to distinguishing objects in human recognition patterns. Thus we propose a method with transforming the visual information into semantics to alleviate the semantic gap between image and semantic description. This method allows us to exploit the Common Sense Knowledge Graph based on the hierarchical structure and the visual graph based on visual correlation concurrently. Compared with several state-of-the-art methods, the proposed method has achieved good performance in ImageNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI