转座因子
注释
基因组
基因组计划
生物
串联重复
遗传学
后转座子
参考基因组
比较基因组学
计算生物学
DNA测序
基因组学
DNA
基因
作者
Christopher W. Benson,Pedro Heringer,Shujun Ou
出处
期刊:CSH Protocols
[Cold Spring Harbor Laboratory Press]
日期:2024-09-05
被引量:1
标识
DOI:10.1101/pdb.prot108578
摘要
Transposable elements (TEs) and tandem repeat arrays are ubiquitous components of genomes across all domains of life. Many types of repetitive DNA do not appear to encode for functional proteins, and those that do, typically only code for enzymes involved in their own replication. Nevertheless, repetitive DNA sequences can significantly alter genome structure, and can have a profound impact on an organism's biology at both the molecular and organismal levels. Advances in long-read sequencing technology have enabled the resolution of previously collapsed contigs and scaffolds that are rich in repeats, which has made the accurate annotation of TEs and other repetitive sequences a crucial early step in genome analysis. Here, we provide a detailed tutorial for streamlined annotation of TEs and repeats in the genome of the model plant Zea mays (maize). Maize is ideally suited to illustrate these procedures due to its repeat-rich genome and the volume of publicly available and high-quality genomic resources. We outline four possible approaches for TE and repeat annotation, each aimed at accommodating a different set of scientific interests. Additionally, we demonstrate how to evaluate annotation quality, and provide scripts to help graphically depict TE and repeat landscapes. Although the protocol is tailored for maize, we also offer pointers for researchers working on other systems throughout and expect that these procedures will be broadly applicable to any eukaryotic genome.
科研通智能强力驱动
Strongly Powered by AbleSci AI