康蒂格
计算机科学
生物
质粒
计算生物学
基因组
基因组
可扩展性
基因
遗传学
数据库
作者
Mikhail Kolmogorov,Mikhail Rayko,Jeffrey Yuan,Evgeny Polevikov,Pavel A. Pevzner
摘要
Abstract Long-read sequencing technologies substantially improved assemblies of many isolate bacterial genomes as compared to fragmented assemblies produced with short-read technologies. However, assembling complex metagenomic datasets remains a challenge even for the state-of-the-art long-read assemblers. To address this gap, we present the metaFlye assembler and demonstrate that it generates highly contiguous and accurate metagenome assemblies. In contrast to short-read metagenomics assemblers that typically fail to reconstruct full-length 16S RNA genes, metaFlye captures many 16S RNA genes within long contigs, thus providing new opportunities for analyzing the microbial “dark matter of life”. We also demonstrate that long-read metagenome assemblers significantly improve full-length plasmid and virus reconstruction as compared to short-read assemblers and reveal many novel plasmids and viruses.
科研通智能强力驱动
Strongly Powered by AbleSci AI