拷贝数变化
基因组
稳健性(进化)
生物
人口
计算机科学
人类基因组
计算生物学
遗传学
基因
社会学
人口学
作者
Xihong Wang,Zhuqing Zheng,Yi Cai,Ting Chen,Chao Li,Weiwei Fu,Yu Jiang
出处
期刊:GigaScience
[University of Oxford]
日期:2017-12-01
卷期号:6 (12)
被引量:71
标识
DOI:10.1093/gigascience/gix115
摘要
The increasing amount of sequencing data available for a wide variety of species can be theoretically used for detecting copy number variations (CNVs) at the population level. However, the growing sample sizes and the divergent complexity of nonhuman genomes challenge the efficiency and robustness of current human-oriented CNV detection methods. Here, we present CNVcaller, a read-depth method for discovering CNVs in population sequencing data. The computational speed of CNVcaller was 1–2 orders of magnitude faster than CNVnator and Genome STRiP for complex genomes with thousands of unmapped scaffolds. CNV detection of 232 goats required only 1.4 days on a single compute node. Additionally, the Mendelian consistency of sheep trios indicated that CNVcaller mitigated the influence of high proportions of gaps and misassembled duplications in the nonhuman reference genome assembly. Furthermore, multiple evaluations using real sheep and human data indicated that CNVcaller achieved the best accuracy and sensitivity for detecting duplications. The fast generalized detection algorithms included in CNVcaller overcome prior computational barriers for detecting CNVs in large-scale sequencing data with complex genomic structures. Therefore, CNVcaller promotes population genetic analyses of functional CNVs in more species.
科研通智能强力驱动
Strongly Powered by AbleSci AI