发布文献求助

亲爱的研友该休息了！由于当前在线用户较少，发布求助请尽量完整的填写文献信息，科研通机器人24小时在线，伴您度过漫漫科研夜！身体可是革命的本钱，早点休息，好梦！

Learning Aligned Audiovisual Representations for Multimodal Sentiment Analysis

计算机科学杠杆（统计）编码器模式多模式学习人工智能语音识别任务（项目管理）标记数据变压器背景（考古学）机器学习自然语言处理古生物学社会科学物理管理量子力学电压社会学经济生物操作系统

作者

Chaoyue Ding,Daoming Zong,Baoxiang Li,Ken Zheng,Dinghao Zhou,Jiakui Li,Qunyan Zhou

标识

DOI：10.1145/3607865.3613184

摘要

In this paper, we present the solutions to the MER-SEMI subchallenge of Multimodal Emotion Recognition Challenge (MER 2023). This subchallenge focuses on predicting discrete emotions for a small subset of unlabeled videos within the context of semi-supervised learning. Participants are provided with a combination of labeled and large amounts of unlabeled videos. Our preliminary experiments on labeled videos demonstrate that this task is primarily driven by the video and audio modalities, while the text modality plays a relatively weaker role in emotion prediction. To address this challenge, we propose the Video-Audio Transformer (VAT), which takes raw signals as inputs and extracts multimodal representations. VAT comprises a video encoder, an audio encoder, and a cross-modal encoder. To leverage the vast amount of unlabeled data, we introduce a contrastive loss to align the image and audio representations before fusing them through cross-modal attention. Additionally, to enhance the model's ability to learn from noisy video data, we apply momentum distillation, a self-training method that learns from pseudo-targets generated by a momentum model. Furthermore, we fine-tune VAT on annotated video data specifically for emotion recognition. Experimental results on the MER-SEMI task have shown the effectiveness of the proposed VAT model. Notably, our model ranks first (0.891) on the leaderboard. Our project is publicly available at https://github.com/dingchaoyue/Multimodal-Emotion-Recognition-MER-and-MuSe-2023-Challenges.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』获奖名单已公布 🔥 (2025-4-2)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 神勇颜发布了新的文献求助10

2秒前; 尕尕娃娃328完成签到，获得积分10

4秒前; CodeCraft上传了应助文件

8秒前; 欣慰妙海完成签到，获得积分10

11秒前; 冷酷果汁发布了新的文献求助10

13秒前; 阿哲完成签到，获得积分10

14秒前; 大模型的应助被good猫妮采纳，获得10

22秒前; 科研通管家关闭了啦啦啦的文献求助

23秒前; 啦啦啦发布了新的文献求助10

24秒前; 科研通AI5的应助被哦哦哦哦哦拖拉大王采纳，获得10

45秒前; 晓书完成签到，获得积分10

52秒前; 科研通AI5上传了应助文件

53秒前; 哦哦哦哦哦拖拉大王发布了新的文献求助10

59秒前; 大模型上传了应助文件

1分钟前; good猫妮发布了新的文献求助10

1分钟前; 芷兰丁香完成签到，获得积分10

1分钟前; 哈哈哈完成签到，获得积分10

1分钟前; iu1392发布了新的文献求助10

1分钟前; 优秀的易文完成签到，获得积分10

1分钟前; 萧瑟处完成签到，获得积分10

1分钟前; wanci的应助被科研通管家采纳，获得10

1分钟前; 852的应助被科研通管家采纳，获得10

1分钟前; CipherSage上传了应助文件

1分钟前; 科研通AI2S上传了应助文件

1分钟前; 葵魁发布了新的文献求助10

1分钟前; 香蕉觅云上传了应助文件

1分钟前; 砍柴少年发布了新的文献求助10

1分钟前; good猫妮完成签到，获得积分20

2分钟前; zgx完成签到，获得积分10

2分钟前; yangquanquan完成签到，获得积分10

2分钟前; good猫妮关注了科研通微信公众号

2分钟前; 周俊磊完成签到，获得积分10

2分钟前; 周俊磊关注了科研通微信公众号

2分钟前; iu1392发布了新的文献求助10

2分钟前; 慕青的应助被Rhythm采纳，获得10

2分钟前; 茉莉雨完成签到，获得积分10

2分钟前; Rhythm完成签到，获得积分10

2分钟前; 慕青上传了应助文件

2分钟前; 杨doctor关注了科研通微信公众号

2分钟前; Ade阿德完成签到，获得积分10

2分钟前

高分求助中: Technologies supporting mass customization of apparel: A pilot project 600; Introduction to Strong Mixing Conditions Volumes 1-3 500; Tip60 complex regulates eggshell formation and oviposition in the white-backed planthopper, providing effective targets for pest control 400; A Field Guide to the Amphibians and Reptiles of Madagascar - Frank Glaw and Miguel Vences - 3rd Edition 400; China Gadabouts: New Frontiers of Humanitarian Nursing, 1941–51 400; The Healthy Socialist Life in Maoist China, 1949–1980 400; Walking a Tightrope: Memories of Wu Jieping, Personal Physician to China's Leaders 400

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3798450; 求助须知：如何正确求助？哪些是违规求助？ 3343875; 关于积分的说明 10317895; 捐赠科研通 3060562; 什么是DOI，文献DOI怎么找？ 1679604; 邀请新用户注册赠送积分活动 806731; 科研通“疑难数据库（出版商）”最低求助积分说明 763296

今日热心研友

优秀的流沙

繁荣的心情

平常的毛豆

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通