判别式
计算机科学
人工智能
模式识别(心理学)
异常检测
嵌入
特征向量
编码器
特征学习
变压器
特征(语言学)
Boosting(机器学习)
语言学
量子力学
操作系统
物理
哲学
电压
作者
Chao Huang,Chengliang Liu,Jie Wen,Lian Wu,Yong Xu,Qiuping Jiang,Yaowei Wang
标识
DOI:10.1109/tcyb.2022.3227044
摘要
Weakly supervised video anomaly detection is generally formulated as a multiple instance learning (MIL) problem, where an anomaly detector learns to generate frame-level anomaly scores under the supervision of MIL-based video-level classification. However, most previous works suffer from two drawbacks: 1) they lack ability to model temporal relationships between video segments and 2) they cannot extract sufficient discriminative features to separate normal and anomalous snippets. In this article, we develop a weakly supervised temporal discriminative (WSTD) paradigm, that aims to leverage both temporal relation and feature discrimination to mitigate the above drawbacks. To this end, we propose a transformer-styled temporal feature aggregator (TTFA) and a self-guided discriminative feature encoder (SDFE). Specifically, TTFA captures multiple types of temporal relationships between video snippets from different feature subspaces, while SDFE enhances the discriminative powers of features by clustering normal snippets and maximizing the separability between anomalous snippets and normal centers in embedding space. Experimental results on three public benchmarks indicate that WSTD outperforms state-of-the-art unsupervised and weakly supervised methods, which verifies the superiority of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI