Transformer-based multimodal feature enhancement networks for multimodal depression detection integrating video, audio and remote photoplethysmograph signals

计算机科学 模式 人工智能 特征提取 模态(人机交互) 特征(语言学) 音频信号 变压器 模式识别(心理学) 语音识别 工程类 哲学 社会学 电气工程 电压 语言学 社会科学 语音编码
作者
Huiting Fan,Xingnan Zhang,Yingying Xu,Jiangxiong Fang,Shiqing Zhang,Xiaoming Zhao,Jun Yu
出处
期刊:Information Fusion [Elsevier]
卷期号:104: 102161-102161
标识
DOI:10.1016/j.inffus.2023.102161
摘要

Depression stands as one of the most widespread psychological disorders and has garnered increasing attention. Currently, how to effectively achieve automatic multimodal depression detection for assisting doctors in early diagnosis of depression, has become an important and challenging issue. To address this issue, this work proposes Transformer-based feature enhancement networks for multimodal depression detection. The proposed method effectively integrates three modalities including video, audio and remote photoplethysmographic (rPPG) signals for multimodal depression detection, in which the rPPG modality is introduced as an additional modality for enhancing the effectiveness of multimodal depression detection. The proposed method consists of three key steps: multimodal feature extraction for video, audio and rPPG modalities, Transformer-based multimodal feature enhancement (TMFE), and graph fusion networks (GFN) based multimodal fusion and depression prediction. More specially, in the stage of multimodal feature extraction, for video and audio modalities we employ deep convolutional neural networks (CNN) to extract the corresponding high-level video and audio features, respectively. For rPPG modality, we adopt a short-time end-to-end rPPG estimation framework to extract the rPPG signal values.The TMFE module stacks multiple Transformers such as the inter-modal, intra-modal, and tri-modal Transformers to jointly capture the dynamics and relationships within and between modalities for each time-step of input sequences. The GFN module is designed to effectively fuse the obtained feature representations from different modalities while maintaining the interactions between them simultaneously. Finally, the obtained shared feature representations of all modalities are fed into a multilayer perceptrons (MLP) network to implement final depression detection tasks. Extensive experiments are conducted on two public datasets such as AVEC2013 and AVEC2014, and experimental results demonstrate the validity of the proposed method on depression detection tasks.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
英俊的铭应助张丁采纳,获得10
4秒前
5秒前
freeabc完成签到,获得积分10
5秒前
天才小能喵应助kingmantj采纳,获得10
5秒前
蓝从发布了新的文献求助10
6秒前
CodeCraft应助积极焦采纳,获得10
7秒前
freeabc发布了新的文献求助10
9秒前
小二郎应助郭敬杰采纳,获得10
9秒前
SRn嘿嘿发布了新的文献求助10
10秒前
11秒前
YY完成签到 ,获得积分10
11秒前
11秒前
12秒前
鱼遇完成签到,获得积分10
12秒前
月颜发布了新的文献求助10
15秒前
肖保定完成签到,获得积分10
15秒前
英俊的铭应助SRn嘿嘿采纳,获得10
16秒前
张丁发布了新的文献求助10
16秒前
17秒前
18秒前
直率的惜寒完成签到,获得积分20
21秒前
hxd发布了新的文献求助10
21秒前
Fourteen完成签到,获得积分10
22秒前
张丁完成签到,获得积分10
22秒前
隐形远航发布了新的文献求助10
22秒前
七月完成签到,获得积分10
22秒前
Loooong完成签到,获得积分0
22秒前
炙热的渊思完成签到,获得积分10
24秒前
郭敬杰发布了新的文献求助10
24秒前
科研通AI2S应助郴欧尼采纳,获得10
26秒前
26秒前
小马甲应助hxd采纳,获得10
27秒前
麻呢呢完成签到,获得积分10
27秒前
柳浪完成签到,获得积分10
28秒前
HOPE完成签到,获得积分10
30秒前
30秒前
zhuyy完成签到,获得积分10
31秒前
32秒前
32秒前
33秒前
高分求助中
【本贴是提醒信息,请勿应助】请在求助之前详细阅读求助说明!!!! 20000
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
The Three Stars Each: The Astrolabes and Related Texts 900
Yuwu Song, Biographical Dictionary of the People's Republic of China 800
Multifunctional Agriculture, A New Paradigm for European Agriculture and Rural Development 600
Challenges, Strategies, and Resiliency in Disaster and Risk Management 500
Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2482324
求助须知:如何正确求助?哪些是违规求助? 2144747
关于积分的说明 5471145
捐赠科研通 1867118
什么是DOI,文献DOI怎么找? 928115
版权声明 563071
科研通“疑难数据库(出版商)”最低求助积分说明 496509