过度拟合
机器学习
医学
人工智能
优势和劣势
样品(材料)
生物医学
数据类型
计算机科学
数据科学
数据挖掘
生物信息学
人工神经网络
认识论
哲学
生物
化学
程序设计语言
色谱法
作者
Kathryn M. Kingsmore,Christopher Puglisi,Amrie C. Grammer,Peter E. Lipsky
标识
DOI:10.1038/s41584-021-00708-w
摘要
Machine learning (ML) is a computerized analytical technique that is being increasingly employed in biomedicine. ML often provides an advantage over explicitly programmed strategies in the analysis of multidimensional information by recognizing relationships in the data that were not previously appreciated. As such, the use of ML in rheumatology is increasing, and numerous studies have employed ML to classify patients with rheumatic autoimmune inflammatory diseases (RAIDs) from medical records and imaging, biometric or gene expression data. However, these studies are limited by sample size, the accuracy of sample labelling, and absence of datasets for external validation. In addition, there is potential for ML models to overfit or underfit the data and, thereby, these models might produce results that cannot be replicated in an unrelated dataset. In this Review, we introduce the basic principles of ML and discuss its current strengths and weaknesses in the classification of patients with RAIDs. Moreover, we highlight the successful analysis of the same type of input data (for example, medical records) with different algorithms, illustrating the potential plasticity of this analytical approach. Altogether, a better understanding of ML and the future application of advanced analytical techniques based on this approach, coupled with the increasing availability of biomedical data, may facilitate the development of meaningful precision medicine for patients with RAIDs.
科研通智能强力驱动
Strongly Powered by AbleSci AI