基因组
基因组
生物
纳米孔测序
DNA测序
计算生物学
核糖体RNA
进化生物学
环境DNA
顺序装配
基因组学
生物多样性
系统发育树
基因
遗传学
生态学
转录组
基因表达
作者
Mantas Sereika,Aaron J. Mussig,Chenjing Jiang,Kalinka Sand Knudsen,Thomas Bygh Nymann Jensen,Francesca Petriglieri,Yuhong Yang,Vibeke Rudkjøbing Jørgensen,Francesco Delogu,Emil A. Sørensen,Per Halkjær Nielsen,Caitlin M. Singleton,Philip Hugenholtz,Mads Albertsen
标识
DOI:10.1038/s41564-025-02062-z
摘要
The emergence of high-throughput, long-read DNA sequencing has enabled recovery of microbial genomes from environmental samples at scale. However, expanding the terrestrial microbial genome catalogue has been challenging due to the enormous complexity of these environments. Here we performed deep, long-read Nanopore sequencing of 154 soil and sediment samples collected during the Microflora Danica project, yielding genomes of 15,314 previously undescribed microbial species, recovered using our custom mmlong2 workflow. The recovered microbial genomes span 1,086 previously uncharacterized genera and expand the phylogenetic diversity of the prokaryotic tree of life by 8%. The long-read assemblies also enabled the recovery of thousands of complete ribosomal RNA operons, biosynthetic gene clusters and CRISPR-Cas systems. Furthermore, the incorporation of the recovered genomes into public genomic databases substantially improved species-level classification rates for soil and sediment metagenomic datasets. These findings demonstrate that long-read sequencing allows cost-effective recovery of high-quality microbial genomes from highly complex ecosystems, which remain an untapped source of biodiversity.
科研通智能强力驱动
Strongly Powered by AbleSci AI