Transductive Learning With Prior Knowledge for Generalized Zero-Shot Action Recognition

计算机科学人工智能嵌入分类器（UML）语义鸿沟模式识别（心理学）机器学习利用集合（抽象数据类型）图像（数学）图像检索计算机安全程序设计语言

作者

Taiyi Su,Hanli Wang,Qiuping Qi,Lei Wang,Bin He

出处

期刊：IEEE Transactions on Circuits and Systems for Video Technology [Institute of Electrical and Electronics Engineers]
日期：2023-06-09 卷期号：34 (1): 260-273 被引量：9

标识

DOI：10.1109/tcsvt.2023.3284977

摘要

It is challenging to achieve generalized zero-shot action recognition. Different from the conventional zero-shot tasks which assume that the instances of the source classes are absent in the test set, the generalized zero-shot task studies the case that the test set contains both the source and the target classes. Due to the gap between visual feature and semantic embedding as well as the inherent bias of the learned classifier towards the source classes, the existing generalized zero-shot action recognition approaches are still far less effective than traditional zero-shot action recognition approaches. Facing these challenges, a novel transductive learning with prior knowledge (TLPK) model is proposed for generalized zero-shot action recognition. First, TLPK learns the prior knowledge which assists in bridging the gap between visual features and semantic embeddings, and preliminarily reduces the bias caused by the visual-semantic gap. Then, a transductive learning method that employs unlabeled target data is designed to overcome the bias problem in an effective manner. To achieve this, a target semantic-available approach and a target semantic-free approach are devised to utilize the target semantics in two different ways, where the target semantic-free approach exploits prior knowledge to produce well-performed semantic embeddings. By exploring the usage of the aforementioned prior-knowledge learning and transductive learning strategies, TLPK significantly bridges the visual-semantic gap and alleviates the bias between the source and the target classes. The experiments on the benchmark datasets of HMDB51 and UCF101 demonstrate the effectiveness of the proposed model compared to the state-of-the-art methods. The source code of this work can be found in https://mic.tongji.edu.cn

求助该文献

最长约 10秒，即可获得该文献文件

Transductive Learning With Prior Knowledge for Generalized Zero-Shot Action Recognition

今日热心研友