清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

Transformer-based multimodal feature enhancement networks for multimodal depression detection integrating video, audio and remote photoplethysmograph signals

计算机科学 模式 人工智能 特征提取 模态(人机交互) 特征(语言学) 音频信号 变压器 模式识别(心理学) 语音识别 工程类 哲学 社会学 电气工程 电压 语言学 社会科学 语音编码
作者
Huiting Fan,Xingnan Zhang,Yingying Xu,Jiangxiong Fang,Shiqing Zhang,Xiaoming Zhao,Jun Yu
出处
期刊:Information Fusion [Elsevier BV]
卷期号:104: 102161-102161 被引量:44
标识
DOI:10.1016/j.inffus.2023.102161
摘要

Depression stands as one of the most widespread psychological disorders and has garnered increasing attention. Currently, how to effectively achieve automatic multimodal depression detection for assisting doctors in early diagnosis of depression, has become an important and challenging issue. To address this issue, this work proposes Transformer-based feature enhancement networks for multimodal depression detection. The proposed method effectively integrates three modalities including video, audio and remote photoplethysmographic (rPPG) signals for multimodal depression detection, in which the rPPG modality is introduced as an additional modality for enhancing the effectiveness of multimodal depression detection. The proposed method consists of three key steps: multimodal feature extraction for video, audio and rPPG modalities, Transformer-based multimodal feature enhancement (TMFE), and graph fusion networks (GFN) based multimodal fusion and depression prediction. More specially, in the stage of multimodal feature extraction, for video and audio modalities we employ deep convolutional neural networks (CNN) to extract the corresponding high-level video and audio features, respectively. For rPPG modality, we adopt a short-time end-to-end rPPG estimation framework to extract the rPPG signal values. The TMFE module stacks multiple Transformers such as the inter-modal, intra-modal, and tri-modal Transformers to jointly capture the dynamics and relationships within and between modalities for each time-step of input sequences. The GFN module is designed to effectively fuse the obtained feature representations from different modalities while maintaining the interactions between them simultaneously. Finally, the obtained shared feature representations of all modalities are fed into a multilayer perceptrons (MLP) network to implement final depression detection tasks. Extensive experiments are conducted on two public datasets such as AVEC2013 and AVEC2014, and experimental results demonstrate the validity of the proposed method on depression detection tasks.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
38秒前
Deannn778发布了新的文献求助10
42秒前
城北徐公完成签到,获得积分10
44秒前
思源应助Deannn778采纳,获得10
52秒前
1分钟前
路路完成签到 ,获得积分10
1分钟前
2568269431完成签到 ,获得积分10
1分钟前
王波完成签到 ,获得积分10
2分钟前
充电宝应助科研通管家采纳,获得10
2分钟前
2分钟前
忘忧Aquarius完成签到,获得积分10
2分钟前
斯文的炳完成签到,获得积分10
2分钟前
charih完成签到 ,获得积分10
2分钟前
lixuebin完成签到 ,获得积分10
3分钟前
常有李完成签到,获得积分10
3分钟前
廖梦琪完成签到 ,获得积分10
3分钟前
yxdjzwx完成签到,获得积分10
3分钟前
胡国伦完成签到 ,获得积分10
3分钟前
3分钟前
Marciu33发布了新的文献求助10
3分钟前
文静灵阳完成签到 ,获得积分10
3分钟前
微卫星不稳定完成签到 ,获得积分10
4分钟前
科研通AI2S应助Virtual采纳,获得10
4分钟前
共享精神应助科研通管家采纳,获得10
4分钟前
飞翔的企鹅完成签到,获得积分10
4分钟前
4分钟前
4分钟前
4分钟前
chcmy完成签到 ,获得积分0
5分钟前
耕牛热完成签到,获得积分10
5分钟前
糟糕的翅膀完成签到,获得积分10
5分钟前
追风少年完成签到 ,获得积分10
5分钟前
龙猫爱看书完成签到,获得积分10
6分钟前
6分钟前
Come发布了新的文献求助20
6分钟前
科研通AI5应助Xuancheng_SINH采纳,获得10
6分钟前
Alex-Song完成签到 ,获得积分0
6分钟前
沉沉完成签到 ,获得积分0
6分钟前
Marciu33发布了新的文献求助10
6分钟前
6分钟前
高分求助中
(应助此贴封号)【重要!!请各位详细阅读】【科研通的精品贴汇总】 10000
Pediatric Injectable Drugs 500
Instant Bonding Epoxy Technology 500
Methodology for the Human Sciences 500
ASHP Injectable Drug Information 2025 Edition 400
DEALKOXYLATION OF β-CYANOPROPIONALDEYHDE DIMETHYL ACETAL 400
March's Advanced Organic Chemistry: Reactions, Mechanisms, and Structure 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 4377751
求助须知:如何正确求助?哪些是违规求助? 3873203
关于积分的说明 12068445
捐赠科研通 3516366
什么是DOI,文献DOI怎么找? 1929560
邀请新用户注册赠送积分活动 971163
科研通“疑难数据库(出版商)”最低求助积分说明 869841