拉曼光谱
肺癌
光谱学
癌症
材料科学
计算机科学
医学物理学
纳米技术
光学
医学
物理
病理
内科学
天文
作者
Rahat Ullah,Kiran Perveen,Imran Rehan,Saranjam Khan
标识
DOI:10.1088/1402-4896/adc214
摘要
Abstract This study investigates the use of machine learning to distinguish between lung cancer patients and healthy individuals by analyzing the chemical composition of serum samples via Raman spectroscopy. Sera samples from confirmed lung cancer patients alongside with control samples from healthy individuals were collected. Notable spectral differences were observed at different Raman shifts between the cancerous and healthy samples. Dimensionality reduction was performed using Principal Component Analysis (PCA), and the biochemical variations were analyzed using an advanced ensemble learning method—specifically, the Extreme Gradient Boosting (XGBoost) algorithm. The model's predictions were validated through cross-checking with the K-Nearest Neighbors (KNN) algorithm. The XGBoost model, evaluated through 10-fold cross-validation, outperformed KNN, achieving 97% accuracy, 98% sensitivity, and a precision and specificity of 96%. These results highlight the potential of Raman spectroscopy combined with machine learning as an effective, non-invasive tool for early detection and screening of lung cancer.
科研通智能强力驱动
Strongly Powered by AbleSci AI