唤醒
情绪识别
情绪分析
计算机科学
认知心理学
功能连接
人工智能
心理学
神经科学
作者
Feng Zhang,Xicheng Li,Chee Peng Lim,Qiang Hua,Chun-Ru Dong,Junhai Zhai
标识
DOI:10.1016/j.inffus.2022.07.006
摘要
• Provides a framework combining deep learning with brain-like innate structures • Introduces time-dependent interactions into Transformer to model emotional coherence • Embeds a gating mechanism to identify the distinctions of different modalities Multimodal sentiment analysis and emotion recognition has become an increasingly popular research area, where the biggest challenge is to efficiently fuse the input information from different modality. The recent success is largely credited to the attention-based models, e.g., transformer and its variants. However, the attention-based mechanism often neglects the coherency of human emotion due to its parallel structure. Inspired by the emotional arousal model in cognitive science, a Deep Emotional Arousal Network (DEAN) that is capable of simulating the emotional coherence is proposed in this paper, which incorporates the time dependence into the parallel structure of transformer. The proposed DEAN model consists of three components, i.e., a cross-modal transformer is devised to simulate the functions of perception analysis system of humans; a multimodal BiLSTM system is developed to imitate the cognitive comparator, and a multimodal gating block is introduced to mimic the activation mechanism in human emotional arousal model. We perform extensive comparison and ablation studies on three benchmarks for multimodal sentiment analysis and emotion recognition. The empirical results indicate that DEAN achieves state-of-the-art performance, and useful insights are derived from the results.
科研通智能强力驱动
Strongly Powered by AbleSci AI