计算机科学
情绪分析
人工智能
语言模型
对话框
变压器
自然语言处理
机器学习
语音识别
量子力学
物理
万维网
电压
作者
Shun Katada,Shogo Okada,Kazunori Komatani
标识
DOI:10.1145/3536221.3556576
摘要
One of the main challenges in realizing dialog systems is adapting to a user's sentiment state in real time. Large-scale language models, such as BERT, have achieved excellent performance in sentiment estimation; however, the use of only linguistic information from user utterances in sentiment estimation still has limitations. In fact, self-reported sentiment is not necessarily expressed by user utterances. To mitigate the issue that the true sentiment state is not expressed as observable signals, psychophysiology and affective computing studies have focused on physiological signals that capture involuntary changes related to emotions. We address this problem by efficiently introducing time-series physiological signals into a state-of-the-art language model to develop an adaptive dialog system. Compared with linguistic models based on BERT representations, physiological long short-term memory (LSTM) models based on our proposed physiological signal processing method have competitive performance. Moreover, we extend our physiological signal processing method to the Transformer language model and propose the Time-series Physiological Transformer (TPTr), which captures sentiment changes based on both linguistic and physiological information. In ensemble models, our proposed methods significantly outperform the previous best result (p < 0.05).
科研通智能强力驱动
Strongly Powered by AbleSci AI