计算机科学
卷积神经网络
人工智能
联营
深度学习
特征(语言学)
棱锥(几何)
模式识别(心理学)
代表(政治)
机器学习
政治
光学
物理
哲学
语言学
法学
政治学
作者
Lang He,Jonathan Cheung-Wai Chan,Zhongmin Wang
出处
期刊:Neurocomputing
[Elsevier BV]
日期:2020-10-14
卷期号:422: 165-175
被引量:110
标识
DOI:10.1016/j.neucom.2020.10.015
摘要
Artificial intelligence (AI) has incorporated various automatic systems and frameworks to diagnose the severity of depression using hand-crafted features. However, process of feature selection needs domain knowledge and is still time-consuming and subjective. Deep learning technology has been successfully adopted for depression recognition. Most previous works pre-train the deep models on large databases followed by fine-tuning with depression databases (i.e., AVEC2013, AVEC2014). In the present paper we propose an integrated framework – Deep Local Global Attention Convolutional Neural Network (DLGA-CNN) for depression recognition, which adopts CNN with attention mechanism as well as weighted spatial pyramid pooling (WSPP) to learn a deep and global representation. Two branches are introduced: Local Attention based CNN (LA-CNN) focuses on the local patches, while Global Attention based CNN (GA-CNN) learns the global patterns from the entire facial region. To capture the complementary information between the two branches, Local–Global Attention-based CNN (LGA-CNN) is proposed. After feature aggregation, WSPP is used to learn the depression patterns. Comprehensive experiments on AVEC2013 and AVEC2014 depression databases have demonstrated that the proposed method is capable of mining the underlying depression patterns of facial videos and outperforms the most of the state-of-the-art video-based depression recognition approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI