基因组
生物
人病毒体
康蒂格
计算生物学
生物分类
进化生物学
遗传学
基因组
基因
作者
Sarah François,Philippe Roumagnac,Marie Frayssinet,Darren J. Martin,Mylène Ogliastro,Rémy Froissart
出处
期刊:Virus Research
[Elsevier BV]
日期:2018-01-15
卷期号:244: 230-234
被引量:8
标识
DOI:10.1016/j.virusres.2017.11.011
摘要
Metagenomics studies have revolutionized the field of biology by revealing the presence of many previously unisolated and uncultured micro-organisms. However, one of the main problems encountered in metagenomic studies is the high percentage of sequences that cannot be assigned taxonomically using commonly used similarity-based approaches (e.g. BLAST or HMM). These unassigned sequences are allegorically called « dark matter » in the metagenomic literature and are often referred to as being derived from new or unknown organisms. Here, based on published and original metagenomic datasets coming from virus-like particle enriched samples, we present and quantify the improvement of viral taxonomic assignment that is achievable with a new similarity-based approach. Indeed, prior to any use of similarity based taxonomic assignment methods, we propose assembling contigs from short reads as is currently routinely done in metagenomic studies, but then to further map unassembled reads to the assembled contigs. This additional mapping step increases significantly the proportions of taxonomically assignable sequence reads from a variety -plant, insect and environmental (estuary, lakes, soil, feces) - of virome studies.
科研通智能强力驱动
Strongly Powered by AbleSci AI