接收机工作特性
医学
肿瘤科
逻辑回归
特征选择
比例危险模型
朴素贝叶斯分类器
肺癌
放射基因组学
人工智能
内科学
机器学习
计算机科学
支持向量机
无线电技术
作者
Victor Lee,Nicholas Moore,Joshua Doyle,Daniel Hicks,Patrick Oh,Shari Bodofsky,Sajid Hossain,Abhijit A. Patel,Sanjay Aneja,Robert Homer,Henry S. Park
摘要
PURPOSE Lymph node metastasis (LNM) significantly affects prognosis and treatment strategies in non–small cell lung cancer (NSCLC). Current diagnostic methods, including imaging and histopathology, have limited sensitivity and specificity. This study aims to develop and evaluate machine learning (ML) models that predict LNM in NSCLC using single-nucleotide polymorphism (SNP) data from The Cancer Genome Atlas. METHODS A cohort of 542 patients with NSCLC with comprehensive SNP data were analyzed. After preprocessing, feature selection was performed using chi-square tests to identify SNPs significantly associated with LNM. Twelve ML models, including Logistic Regression, Naive Bayes, and Support Vector Machines, were trained and evaluated using bootstrapped data sets. Model performance was assessed using metrics such as accuracy, area under the receiver operating characteristic curve (AUC), and F1 score. Shapley additive explanations values were used for feature interpretability, and survival analysis was conducted to assess clinical outcomes. RESULTS Naive Bayes and Logistic Regression models achieved the highest predictive performance, with median AUCs of 0.93 and 0.91, respectively. Key SNPs, including mutations in TANC2 , KCNT2 , and CENPF , were consistently identified as predictive features. Survival analysis demonstrated significant differences in outcomes on the basis of model-predicted LNM status (log-rank P = .0268). Feature selection improved model accuracy and robustness, highlighting the biological relevance of selected SNPs. CONCLUSION ML models leveraging primary tumor SNP data can enhance LNM prediction in NSCLC, outperforming traditional diagnostic methods. These findings underscore the potential of integrating genomics and ML to develop noninvasive biomarkers, enabling precise risk stratification and personalized treatment strategies in oncology.
科研通智能强力驱动
Strongly Powered by AbleSci AI