计算机科学
脑电图
比例(比率)
自然语言处理
人工智能
心理学
地图学
神经科学
地理
作者
Yongquan Hu,Shuning Zhang,Ting Dang,Hong Jia,Flora D. Salim,Wen Hu,Aaron Quigley
标识
DOI:10.1145/3675094.3678494
摘要
Integrating physiological signals such as electroencephalogram (EEG), with other data such as interview audio, may offer valuable multimodal insights into psychological states or neurological disorders.Recent advancements with Large Language Models (LLMs) position them as prospective "health agents" for mental health assessment.However, current research predominantly focus on single data modalities, presenting an opportunity to advance understanding through multimodal data.Our study aims to advance this approach by investigating multimodal data using LLMs for mental health assessment, specifically through zero-shot and few-shot prompting.Three datasets are adopted for depression and emotion classifications incorporating EEG, facial expressions, and audio (text).The results indicate that multimodal information confers substantial advantages over single modality approaches in mental health assessment.Notably, integrating EEG alongside commonly used LLM modalities such as audio and images demonstrates promising potential.Moreover, our findings reveal that 1-shot learning offers greater benefits compared to zero-shot learning methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI