Image-Text Multimodal Emotion Classification via Multi-View Attentional Network

计算机科学 人工智能 情绪识别 特征提取 图像(数学) 模式识别(心理学) 自然语言处理
作者
Xiaocui Yang,Shi Feng,Daling Wang,Yifei Zhang
出处
期刊:IEEE Transactions on Multimedia [Institute of Electrical and Electronics Engineers]
卷期号:23: 4014-4026 被引量:234
标识
DOI:10.1109/tmm.2020.3035277
摘要

Compared with single-modal content, multimodal data can express users’ feelings and sentiments more vividly and interestingly. Therefore, multimodal sentiment analysis has become a popular research topic. However, most existing methods either learn modal sentiment feature independently, without considering their correlations, or they simply integrate multimodal features. In addition, most publicly available multimodal datasets are labeled by sentiment polarities, while the emotions expressed by users are specific. Based on this observation, in this paper, we build a large-scale image-text emotion dataset (i.e., labeled by different emotions), called TumEmo, with more than 190,000 instances from Tumblr. 1 We further propose a novel multimodal emotion analysis model based on the Multi-view Attentional Network (MVAN), which utilizes a memory network that is continually updated to obtain the deep semantic features of image-text. The model includes three stages: feature mapping, interactive learning, and feature fusion. In the feature mapping stage, we leverage image features from an object viewpoint and a scene viewpoint to capture effective information for multimodal emotion analysis. Then, an interactive learning mechanism is adopted that uses the memory network; this mechanism extracts single-modal emotion features and interactively models the cross-view dependencies between the image and text. In the feature fusion stage, multiple features are deeply fused using a multilayer perceptron and a stacking-pooling module. The experimental results on the MVSA-Single, MVSA-Multiple, and TumEmo datasets show that the proposed MVAN outperforms strong baseline models by large margins.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
爱格儿发布了新的文献求助10
刚刚
范莉发布了新的文献求助10
1秒前
小草没发布了新的文献求助10
4秒前
荔枝酱果冻完成签到,获得积分10
7秒前
8秒前
xixi完成签到,获得积分10
9秒前
9秒前
充电宝应助sunhang526采纳,获得10
11秒前
11秒前
xixi发布了新的文献求助10
13秒前
zdjzdj完成签到 ,获得积分10
17秒前
18秒前
18秒前
道松先生发布了新的文献求助10
19秒前
zhangshenrong发布了新的文献求助10
20秒前
97_完成签到,获得积分10
20秒前
XYY发布了新的文献求助30
21秒前
biscuit关注了科研通微信公众号
22秒前
科研CY发布了新的文献求助10
24秒前
ping完成签到 ,获得积分10
24秒前
24秒前
Q同学完成签到,获得积分10
24秒前
25秒前
25秒前
26秒前
26秒前
26秒前
26秒前
26秒前
26秒前
26秒前
Moonpie应助科研通管家采纳,获得10
26秒前
小蘑菇应助科研通管家采纳,获得10
26秒前
无花果应助科研通管家采纳,获得10
26秒前
27秒前
今后应助科研通管家采纳,获得10
27秒前
丘比特应助科研通管家采纳,获得10
27秒前
Moonpie应助科研通管家采纳,获得10
27秒前
加菲丰丰应助科研通管家采纳,获得10
27秒前
彭于晏应助科研通管家采纳,获得20
27秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Psychopathic Traits and Quality of Prison Life 1000
Development Across Adulthood 1000
Chemistry and Physics of Carbon Volume 18 800
The formation of Australian attitudes towards China, 1918-1941 660
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6450658
求助须知:如何正确求助?哪些是违规求助? 8262825
关于积分的说明 17604562
捐赠科研通 5515053
什么是DOI,文献DOI怎么找? 2903396
邀请新用户注册赠送积分活动 1880407
关于科研通互助平台的介绍 1722274