精神疾病
社会化媒体
计算机科学
深度学习
社会学习
机器学习
数据科学
人工智能
心理健康
心理学
万维网
精神科
知识管理
作者
Zuhang Ding,Zhongyan Wang,Yeyubei Zhang,Yuchen Cao,Yunchong Liu,Xiaorui Shen,Yexin Tian,Jianglai Dai
标识
DOI:10.1038/s41598-025-99167-6
摘要
Social media platforms provide valuable insights into mental health trends by capturing user-generated discussions on conditions such as depression, anxiety, and suicidal ideation. Machine learning (ML) and deep learning (DL) models have been increasingly applied to classify mental health conditions from textual data, but selecting the most effective model involves trade-offs in accuracy, interpretability, and computational efficiency. This study evaluates multiple ML models, including logistic regression, random forest, and LightGBM, alongside DL architectures such as ALBERT and Gated Recurrent Units (GRUs), for both binary and multi-class classification of mental health conditions. Our findings indicate that ML and DL models achieve comparable classification performance on medium-sized datasets, with ML models offering greater interpretability through variable importance scores, while DL models are more robust to complex linguistic patterns. Additionally, ML models require explicit feature engineering, whereas DL models learn hierarchical representations directly from text. Logistic regression provides the advantage of capturing both positive and negative associations between features and mental health conditions, whereas tree-based models prioritize decision-making power through split-based feature selection. This study offers empirical insights into the advantages and limitations of different modeling approaches and provides recommendations for selecting appropriate methods based on dataset size, interpretability needs, and computational constraints.
科研通智能强力驱动
Strongly Powered by AbleSci AI