文字2vec
计算机科学
嵌入
瓶颈
领域(数学分析)
人工智能
机器学习
编码器
文字嵌入
语音识别
模式识别(心理学)
自然语言处理
数学
操作系统
数学分析
嵌入式系统
作者
Heinrich Dinkel,Pingyue Zhang,Mengyue Wu,Kai Yu
出处
期刊:Cornell University - arXiv
日期:2019-10-29
被引量:4
摘要
Depression detection research has increased over the last few decades as this disease is becoming a socially-centered problem. One major bottleneck for developing automatic depression detection methods lies in the limited data availability. Recently, pretrained text-embeddings have seen success in sparse data scenarios, while pretrained audio embeddings are rarely investigated. This paper proposes DEPA, a self-supervised, Word2Vec like pretrained depression audio embedding method for depression detection. An encoder-decoder network is used to extract DEPA on sparse-data in-domain (DAIC) and large-data out-domain (switchboard, Alzheimer's) datasets. With DEPA as the audio embedding, performance significantly outperforms traditional audio features regarding both classification and regression metrics. Moreover, we show that large-data out-domain pretraining is beneficial to depression detection performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI