对话
计算机科学
动力学(音乐)
语音识别
说话人识别
情绪识别
跟踪(教育)
人工智能
心理学
沟通
教育学
作者
Jiawei Chen,Peijie Huang,Guotai Huang,Qianer Li,Yuhong Xu
标识
DOI:10.1109/icassp49357.2023.10094810
摘要
Emotion Recognition in Conversation (ERC) has considerable prospects due to its wide range of applications. Most existing works integrate speaker information statically and capture a relatively consistent atmosphere in conversation. However, these works poorly track the emotional state dynamics of each party in a conversation and focus on emotion consistency. The speakers' emotional states are independent but influence each other during the conversation. To address the above issues, we propose a Speaker Dynamics Tracking Network (SDTN) for ERC. Specifically, SDTN can dynamically track the local and global speaker states during emotional flow in conversation and capture implicit stimulation of emotional shift. Extensive experiments on MELD and EmoryNLP datasets demonstrate the superiority and effectiveness of our proposed SDTN model, and confirm that every designed module consistently benefits the performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI