可解释性
过度拟合
机器学习
人工智能
计算机科学
蛋白质组学
新颖性
生物标志物发现
数据科学
透视图(图形)
人工神经网络
心理学
生物
生物化学
社会心理学
基因
作者
Charlotte Adams,Wout Bittremieux
标识
DOI:10.1080/14789450.2025.2545828
摘要
Machine learning holds significant promise for accelerating biomarker discovery in clinical proteomics, yet its real-world impact remains limited by widespread methodological pitfalls and unrealistic expectations. In this perspective, we critically examine the application of machine learning for biomarker discovery in clinical proteomics, emphasizing that algorithmic novelty alone cannot compensate for issues such as small sample sizes, batch effects, overfitting, data leakage, and poor model generalization. We caution against the uncritical application of complex models, such as deep learning architectures, that often exacerbate these problems, offering limited interpretability and negligible performance gains in typical clinical proteomics datasets. Instead, we advocate for the realistic and responsible use of machine learning, grounded in rigorous study design, appropriate validation strategies, and transparent, reproducible modeling practices. Emphasizing simplicity, interpretability, and domain awareness over hype-driven complexity is essential if machine learning is to fulfill its translational potential in the clinic.
科研通智能强力驱动
Strongly Powered by AbleSci AI