计算机科学
代表(政治)
杠杆(统计)
人工智能
动作(物理)
对偶(语法数字)
模糊逻辑
编码(集合论)
动作识别
机器学习
程序设计语言
法学
政治学
政治
量子力学
集合(抽象数据类型)
文学类
物理
班级(哲学)
艺术
作者
Wenxuan Liu,Xian Zhong,Zhuo Zhou,Kui Jiang,Zheng Wang,Chia‐Wen Lin
标识
DOI:10.1109/tip.2023.3273459
摘要
Multi-view action recognition aims to identify action categories from given clues. Existing studies ignore the negative influences of fuzzy views between view and action in disentangling, commonly arising the mistaken recognition results. To this end, we regard the observed image as the composition of the view and action components, and give full play to the advantages of multiple views via the adaptive cooperative representation among these two components, forming a Dual-Recommendation Disentanglement Network (DRDN) for multi-view action recognition. Specifically, 1) For the action, we leverage a multi-level Specific Information Recommendation (SIR) to enhance the interaction among intricate activities and views. SIR offers a more comprehensive representation of activities, measuring the trade-off between global and local information. 2) For the view, we utilize a Pyramid Dynamic Recommendation (PDR) to learn a complete and detailed global representation by transferring features from different views. It is explicitly restricted to resist the fuzzy noise influence, focusing on positive knowledge from other views. Our DRDN aims for complete action and view representation, where PDR directly guides action to disentangle with view features and SIR considers mutual exclusivity of view and action clues. Extensive experiments have indicated that the multi-view action recognition method DRDN we proposed achieves state-of-the-art performance over powerful competitors on several standard benchmarks. The code will be available at https://github.com/51cloud/DRDN.
科研通智能强力驱动
Strongly Powered by AbleSci AI