基因组
箱子
基因组
生物
计算机科学
计算生物学
遗传学
算法
基因
作者
Samuel T. N. Aroney,R.B. Newell,Gene W. Tyson,Ben J. Woodcroft
标识
DOI:10.1101/2024.11.24.625082
摘要
Abstract Recovery of microbial genomes from metagenomic datasets has provided genomic representation for hundreds of thousands of species from diverse biomes. However, low abundance microorganisms are often missed due to insufficient genomic coverage. Here we present Bin Chicken, an algorithm which substantially improves genome recovery through automated, targeted selection of metagenomes for coassembly based on shared marker gene sequences derived from raw reads. Marker gene sequences that are divergent from known reference genomes can be further prioritised, providing an efficient means of recovering highly novel genomes. Applying Bin Chicken to public metagenomes and coassembling 800 sample-groups recovered 77,562 microbial genomes, including the first genomic representatives of 6 phyla, 41 classes, and 24,028 species. These genomes expand the genomic tree of life and uncover a wealth of novel microbial lineages for further research.
科研通智能强力驱动
Strongly Powered by AbleSci AI