Early diagnosis of brain metastases using cerebrospinal fluid cell‐free DNA‐based breakpoint motif and mutational features in lung cancer

医学 断点 肺癌 胎儿游离DNA 脑脊液 DNA 癌症研究 病理 遗传学 生物 基因 怀孕 染色体易位 胎儿 产前诊断
作者
Xueting Qin,Yujun Bai,Shizhen Zhou,Hongjin Shi,Xiaoli Liu,Song Wang,Xiaoying Wu,Jiaohui Pang,Xi Song,Xiaojun Fan,Qiuxiang Ou,Yang Xu,Hua Bao,Li Li,Jun Li,Yang Shao,Shuanghu Yuan
出处
期刊:Clinical and translational medicine [Springer Science+Business Media]
卷期号:13 (3) 被引量:4
标识
DOI:10.1002/ctm2.1221
摘要

In this study, we developed a sensitive machine learning model with a remarkable capacity to predict brain metastases (BM) in lung cancer patients using the breakpoint motif (BPM) features in cerebrospinal fluid (CSF) circulating tumour DNA (ctDNA). We have also assessed the mutational profile in CSF ctDNA, revealing promising BM-related prognostic biomarkers in lung cancer patients. BM is frequently associated with a short life expectancy and a high mortality rate in lung cancer patients.1 Early detection and timely treatment help to ameliorate the disease severity for lung cancer BM (LCBM). Brain magnetic resonance imaging (MRI) is the preferred method to evaluate the number, size and location of BM, but it lacks clear guidance to indicate the appropriate timing for screening. Cancer treatment may also obscure contrast enhancement, making the BM diagnosis more challenging.2 Meanwhile, CSF cytology provides valuable information about the pathologic conditions of cells involved in the central nervous system (CNS) and its coverings but is not sensitive enough for definitive diagnosis and highly relies on the pathologist's experience. Therefore, exploring sensitive and accurate methods is essential for promoting the early detection of LCBM. Plasma cell-free DNA (cfDNA) analysis has been widely adopted for assessing genomic features of cancer patients, monitoring response to treatment, quantifying minimal residual disease, and examining therapy resistance.3-7 Particularly, Guo et al. have leveraged the elastic-net logistic regression algorithm to integrate the 6 bp BPM feature in plasma cfDNA and successfully built a sensitive model for stage I lung adenocarcinoma detection.8 As CSF ctDNA has been gaining credibility for its high capability of detecting somatic genetic alterations in patients with CNS malignancies,9 this study aims to develop a robust model for the sensitive detection of LCBM using genetic features derived from CSF ctDNA. In this study, 76.6% of lung cancer patients (62/81) were diagnosed with parenchymal BM with or without other types of CNS diseases by enhanced brain MRI and/or computerized tomography (CT) scan (Table S1). CSF cytology was performed for 71 patients initially admitted to our hospital as a complementary approach for diagnosing leptomeningeal metastasis. All 81 patients underwent lumbar puncture to collect CSF for targeted next-generation sequencing (NGS), followed by extraction of BPM and mutational features for modelling (Supplementary Material). According to the BM status and the relationship with follow-up time, the 81 patients were classified into three subgroups, including 62 POS patients (patients whose BM status was already positive at CSF sampling), 10 NEG patients (patients whose BM status was negative at CSF sampling and remained unchanged during the follow-up) and nine NTP patients (patients whose BM status turned from negative at CSF sampling to positive during the follow-up). As NTP patients were generally located between POS and NEG patients in the principal component analysis (Figure S1), we, therefore, assigned 70 patients with definitive BM status at CSF sampling (62 POS and eight randomly selected NEG) to the training cohort to develop the BM detection model and 11 patients (nine NTP and two randomly selected NEG) to the testing cohort for independent evaluation of the model performance (Figure 1A). Since the predictive model built solely on CSF ctDNA status showed a relatively high false-positive rate in detecting LCBM (Figure S2), we wondered if incorporating the ctDNA status feature into the model based on BPM features of CSF ctDNA using elastic-net logistic regression, hereafter referred to as "integrated model", could help improve the model performance (Figure 1B). In the training cohort, the integrated model achieved an area under the curve (AUC) of 0.940 (95% confidence interval [CI]: 0.885–0.995), which was slightly better than the BPM model with an AUC of 0.929 (95% CI: 0.862–0.997, Figure 2A). Both models performed similarly in distinguishing lung cancer patients with different BM or ctDNA status (Figure 2B,C and Figure S3A,B). Furthermore, both models were tested against different patient subgroups and persisted in high performance regardless of patients' clinical characteristics, such as age, ctDNA status, smoking and treatment history (Figure 2D and Figure S3C). At 90% sensitivity, comparable high specificities were achieved by both models when tested in the matched cohorts (Figure 2E and Figure S3D). Next, we assessed our models' performance in the testing cohort comprising 9 NTP and 2 NEG patients. Interestingly, the integrated model achieved an AUC of 0.833 (95% CI: 0.4681–1), whereas the BPM model performed slightly better, with an AUC of 0.944 (95% CI: 0.7905–1, Figure 3A). The lower AUC of the integrated model might be explained by the inclusion of three BM-negative patients who tested positive for CSF ctDNA (two for training and one for testing). The positive CSF ctDNA result might be because CSF ctDNA results indicate BM status earlier than conventional neurological imaging, either because genomic changes have not yet caused organic pathologic changes or because the organic disease cannot be detected at an early stage. While not statistically significant, higher risk scores were associated with shorter BM detection times in both models (Figure 3B and Figure S4A). It is worth noting that the BPM model not only distinguished all seven high-risk patients from low-risk but also outperformed the integrated model for its lower false-negative rates in predicting BM of low-risk patients (BPM model: 50% versus integrated model: 66.7%; Figure 3C and Figure S4B). Additionally, the risk score computed by the BPM model better reflected BM-free survival (BMS) than the integrated model (Figure 3D and Figure S4C). Overall, these findings suggested that the BPM model performs better in predicting LCBM. Incorporating the CSF ctDNA status feature into the BPM model could not further improve the model's performance. To determine which motif contributed mostly to the model's predictive power, we performed a hierarchical clustering analysis in the training cohort using motifs with non-zero coefficients in the BPM model (Figure 3E). The CGTTCG motif was found to have the most positive coefficient, which showed an upward trend in three patient subgroups categorized by BM status (Figure 3F). In contrast, the GGAAAT motif, which had the greatest negative coefficient, presented an opposite trend in these patients (Figure 3G). In the testing cohort, a similar distribution pattern of the CGTTCG motif was observed (Figure S5). However, GGAAAT did not show the expected trend due to the limited sample size. Lastly, we performed comprehensive genomic profiling using CSF ctDNA mutational features to identify BM-associated genetic alterations in 80 lung cancer patients with known clinical outcomes. The most frequent genomic alterations were in the EGFR, TP53, RB1, CDKN2A, and CDKN2B genes (Figure 4A). Noteworthy, we emphasized alterations in the DNA-damage response (DDR)-related pathways for their role in leptomeningeal metastasis development.10 At univariate analysis, RB1 variants, EGFR amplification, and the Fanconi Anemia (FA) pathway alterations were individually associated with BMS (Figure 4B–G and Table S2). RB1 variants and EGFR amplification in CSF ctDNA of lung cancer patients remained independently associated with an inferior prognosis in the multivariate model (P = 0.028 and 0.023, respectively; Figure 4H). As a proof-of-concept pilot study exploring the clinical application of BPM profiling in the sensitive detection of BM with a machine-learning model in lung cancer patients, our study has a few limitations. The limited sample size may potentially compromise the credibility of our BM predictive model. Expanding the cohort size is warranted to improve the statistical power of a more accurate estimation of the risk score in lung cancer patients. In addition, although most patients in our study developed parenchymal BM during progression, the study cohort comprises various BM types due to sample availability. The cfDNA and BPM profiles may differ and need to be further investigated. Therefore, we plan to conduct a more extensive study and develop a BPM model capable of identifying patients with different BM types, which may add significant value to the current model for its clinical utility. In summary, we established a robust BM predictive model using the BPM features of CSF ctDNA and profiled genomic alterations associated with BM in lung cancer patients. Our study provides insights into the potential use of CSF ctDNA sequencing for the early detection of LCBM and disease management. We thank the patients, their families, and the investigators and research staff involved. This work was supported by the Natural Science Foundation of China (NSFC81872475, NSFC82073345) and the Jinan Clinical Medicine Science and Technology Innovation Plan (202019060). Song Wang, Xiaoying Wu, Jiaohui Pang, Xi Song, Xiaojun Fan, Qiuxiang Ou, Yang Xu, Hua Bao and Yang Shao are employees of Nanjing Geneseeq Technology Inc. The remaining authors declare no conflict of interest. Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
hkh发布了新的文献求助10
1秒前
Tumbleweed668发布了新的文献求助10
1秒前
2秒前
Birdy发布了新的文献求助30
2秒前
孙不缺完成签到,获得积分10
3秒前
曲夜白完成签到 ,获得积分10
4秒前
4秒前
谢富杰发布了新的文献求助10
6秒前
拼搏一曲完成签到,获得积分10
6秒前
南北完成签到,获得积分10
6秒前
hwq123完成签到,获得积分10
7秒前
科研通AI5应助tian采纳,获得10
7秒前
李健应助bosszjw采纳,获得10
8秒前
淡然冬灵发布了新的文献求助100
9秒前
11秒前
歪歪完成签到 ,获得积分10
14秒前
douzi完成签到,获得积分10
15秒前
学术z完成签到,获得积分10
15秒前
yeyeye发布了新的文献求助10
16秒前
phuocnlh完成签到,获得积分10
17秒前
langbuyu完成签到,获得积分10
17秒前
17秒前
23333完成签到,获得积分10
17秒前
18秒前
18秒前
coolkid完成签到 ,获得积分10
19秒前
AKYDXS完成签到,获得积分10
20秒前
小蘑菇应助苯二氮卓采纳,获得10
21秒前
bosszjw发布了新的文献求助10
21秒前
善良的背包完成签到,获得积分10
21秒前
LIBALA完成签到,获得积分10
23秒前
tian发布了新的文献求助10
23秒前
快船总冠军完成签到 ,获得积分10
24秒前
hss完成签到 ,获得积分10
26秒前
小肚溜圆完成签到,获得积分10
28秒前
欣喜的高跟鞋完成签到,获得积分20
30秒前
叁叁完成签到 ,获得积分10
30秒前
双马尾小男生2完成签到,获得积分10
30秒前
drhwang完成签到,获得积分10
34秒前
谢富杰发布了新的文献求助10
34秒前
高分求助中
【此为提示信息,请勿应助】请按要求发布求助,避免被关 20000
ISCN 2024 – An International System for Human Cytogenomic Nomenclature (2024) 3000
Continuum Thermodynamics and Material Modelling 2000
Encyclopedia of Geology (2nd Edition) 2000
105th Edition CRC Handbook of Chemistry and Physics 1600
Maneuvering of a Damaged Navy Combatant 650
Fashion Brand Visual Design Strategy Based on Value Co-creation 350
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3777834
求助须知:如何正确求助?哪些是违规求助? 3323321
关于积分的说明 10213925
捐赠科研通 3038575
什么是DOI,文献DOI怎么找? 1667549
邀请新用户注册赠送积分活动 798161
科研通“疑难数据库(出版商)”最低求助积分说明 758290