基因组
生物
鉴定(生物学)
计算生物学
分类等级
同源(生物学)
生物分类
基因
模式识别(心理学)
人工智能
遗传学
进化生物学
计算机科学
分类单元
古生物学
生态学
作者
David Ainsworth,Michael J.E. Sternberg,Come Raczy,Sarah Butcher
摘要
k-SLAM is a highly efficient algorithm for the characterization of metagenomic data. Unlike other ultra-fast metagenomic classifiers, full sequence alignment is performed allowing for gene identification and variant calling in addition to accurate taxonomic classification. A k-mer based method provides greater taxonomic accuracy than other classifiers and a three orders of magnitude speed increase over alignment based approaches. The use of alignments to find variants and genes along with their taxonomic origins enables novel strains to be characterized. k-SLAM's speed allows a full taxonomic classification and gene identification to be tractable on modern large data sets. A pseudo-assembly method is used to increase classification accuracy by up to 40% for species which have high sequence homology within their genus.
科研通智能强力驱动
Strongly Powered by AbleSci AI