生物
微生物群
计算生物学
基因组
人类微生物组计划
人类基因组
稳健性(进化)
参考文献
假阳性悖论
1000基因组计划
生物信息学
遗传学
人体微生物群
计算机科学
基因
人工智能
单核苷酸多态性
基因型
作者
Gregory D. Sepich‐Poore,Daniel McDonald,Evguenia Kopylova,Caitlin Guccione,Qiyun Zhu,George I. Austin,Carolina S. Carpenter,Serena Fraraccio,Stephen Wandro,Tomasz Kościółek,Stefan Janssen,Jessica L. Metcalf,Se Jin Song,Jad Kanbar,Sandrine Miller‐Montgomery,Robert K. Heaton,Rana R. McKay,Sandip Pravin Patel,Austin D. Swafford,Tal Korem
出处
期刊:Oncogene
[Springer Nature]
日期:2024-02-23
卷期号:43 (15): 1127-1148
被引量:25
标识
DOI:10.1038/s41388-024-02974-w
摘要
Abstract In 2020, we identified cancer-specific microbial signals in The Cancer Genome Atlas (TCGA) [1]. Multiple peer-reviewed papers independently verified or extended our findings [2–12]. Given this impact, we carefully considered concerns by Gihawi et al. [13] that batch correction and database contamination with host sequences artificially created the appearance of cancer type-specific microbiomes. (1) We tested batch correction by comparing raw and Voom-SNM-corrected data per-batch, finding predictive equivalence and significantly similar features. We found consistent results with a modern microbiome-specific method (ConQuR [14]), and when restricting to taxa found in an independent, highly-decontaminated cohort. (2) Using Conterminator [15], we found low levels of human contamination in our original databases (~1% of genomes). We demonstrated that the increased detection of human reads in Gihawi et al. [13] was due to using a newer human genome reference. (3) We developed Exhaustive, a method twice as sensitive as Conterminator, to clean RefSeq. We comprehensively host-deplete TCGA with many human (pan)genome references. We repeated all analyses with this and the Gihawi et al. [13] pipeline, and found cancer type-specific microbiomes. These extensive re-analyses and updated methods validate our original conclusion that cancer type-specific microbial signatures exist in TCGA, and show they are robust to methodology.
科研通智能强力驱动
Strongly Powered by AbleSci AI