计算机科学
人工智能
图形
学习排名
机器学习
深度学习
学习迁移
非线性降维
无监督学习
模式识别(心理学)
排名(信息检索)
降维
理论计算机科学
作者
Lucas Barbosa de Almeida,Lucas Pascotti Valem,Daniel Carlos Guimarães Pedronette
标识
DOI:10.1109/icip46576.2022.9897911
摘要
Despite the impressive advances obtained by supervised deep learning approaches on retrieval and classification tasks, how to acquire labeled data for training remains a challenging bottleneck. In this scenario, the need for developing more effective content-based retrieval approaches capable of taking advantage of multimodal information and advances in unsupervised learning becomes imperative. Based on such observations, we propose two novel approaches that combine Graph Convolutional Networks (GCNs) with rank-based manifold learning methods. The GCN models were trained in an unsupervised way, using the Deep Graph Infomax algorithm, and the proposed approaches employ recent rank-based manifold learning methods. Multimodal information is exploited through pre-trained CNNs via transfer learning for extracting audio, image, and video features. The proposed approaches were evaluated on three public action recognition datasets. High-effective results were obtained, reaching relative gains up to +29.44% of MAP compared to baseline approaches without GCNs. The experimental evaluation also considered classical and recent baselines in the literature.
科研通智能强力驱动
Strongly Powered by AbleSci AI