注释
假基因
基因组
基因
计算生物学
生物
基因注释
基因组计划
遗传学
作者
Mei-Yee Law,Kevin L. Childs,Michael S. Campbell,Joshua C. Stein,Andrew Olson,Carson Holt,Nicholas Panchy,Jikai Lei,Dian Jiao,Carson M. Andorf,Carolyn J. Lawrence,Doreen Ware,Shin‐Han Shiu,Yanni Sun,Ning Jiang,Mark Yandell
出处
期刊:Plant Physiology
[Oxford University Press]
日期:2014-11-10
卷期号:167 (1): 25-39
被引量:51
标识
DOI:10.1104/pp.114.245027
摘要
Abstract The large size and relative complexity of many plant genomes make creation, quality control, and dissemination of high-quality gene structure annotations challenging. In response, we have developed MAKER-P, a fast and easy-to-use genome annotation engine for plants. Here, we report the use of MAKER-P to update and revise the maize (Zea mays) B73 RefGen_v3 annotation build (5b+) in less than 3 h using the iPlant Cyberinfrastructure. MAKER-P identified and annotated 4,466 additional, well-supported protein-coding genes not present in the 5b+ annotation build, added additional untranslated regions to 1,393 5b+ gene models, identified 2,647 5b+ gene models that lack any supporting evidence (despite the use of large and diverse evidence data sets), identified 104,215 pseudogene fragments, and created an additional 2,522 noncoding gene annotations. We also describe a method for de novo training of MAKER-P for the annotation of newly sequenced grass genomes. Collectively, these results lead to the 6a maize genome annotation and demonstrate the utility of MAKER-P for rapid annotation, management, and quality control of grasses and other difficult-to-annotate plant genomes.
科研通智能强力驱动
Strongly Powered by AbleSci AI