TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads

生物 卫星DNA 串联重复 基因组 康蒂格 计算生物学 遗传学 DNA测序 卫星 DNA 基因 工程类 航空航天工程
作者
Petr Novák,Laura Ávila Robledillo,Andrea Koblížková,Iva Vrbová,Pavel Neumann,Jir̆ı́ Macas
出处
期刊:Nucleic Acids Research [Oxford University Press]
卷期号:45 (12): e111-e111 被引量:320
标识
DOI:10.1093/nar/gkx257
摘要

Satellite DNA is one of the major classes of repetitive DNA, characterized by tandemly arranged repeat copies that form contiguous arrays up to megabases in length. This type of genomic organization makes satellite DNA difficult to assemble, which hampers characterization of satellite sequences by computational analysis of genomic contigs. Here, we present tandem repeat analyzer (TAREAN), a novel computational pipeline that circumvents this problem by detecting satellite repeats directly from unassembled short reads. The pipeline first employs graph-based sequence clustering to identify groups of reads that represent repetitive elements. Putative satellite repeats are subsequently detected by the presence of circular structures in their cluster graphs. Consensus sequences of repeat monomers are then reconstructed from the most frequent k-mers obtained by decomposing read sequences from corresponding clusters. The pipeline performance was successfully validated by analyzing low-pass genome sequencing data from five plant species where satellite DNA was previously experimentally characterized. Moreover, novel satellite repeats were predicted for the genome of Vicia faba and three of these repeats were verified by detecting their sequences on metaphase chromosomes using fluorescence in situ hybridization.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
1秒前
2秒前
2秒前
3秒前
Lucas应助香蕉猴子啦啦啦采纳,获得10
3秒前
吃货发布了新的文献求助10
3秒前
无极微光应助酶来研去采纳,获得40
4秒前
reai发布了新的文献求助30
4秒前
4秒前
易安发布了新的文献求助10
4秒前
5秒前
5秒前
自觉的豌豆完成签到,获得积分10
6秒前
123123完成签到,获得积分10
6秒前
6秒前
脑洞疼应助科研通管家采纳,获得10
6秒前
6秒前
6秒前
脑洞疼应助科研通管家采纳,获得10
6秒前
6秒前
6秒前
6秒前
6秒前
田様应助科研通管家采纳,获得10
6秒前
6秒前
思源应助科研通管家采纳,获得10
6秒前
田様应助科研通管家采纳,获得10
7秒前
深情安青应助科研通管家采纳,获得10
7秒前
思源应助科研通管家采纳,获得10
7秒前
7秒前
深情安青应助科研通管家采纳,获得10
7秒前
完美世界应助科研通管家采纳,获得10
7秒前
7秒前
完美世界应助科研通管家采纳,获得10
7秒前
7秒前
7秒前
酷波er应助科研通管家采纳,获得10
7秒前
7秒前
酷波er应助科研通管家采纳,获得10
7秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Encyclopedia of Quaternary Science Reference Third edition 6000
Encyclopedia of Forensic and Legal Medicine Third Edition 5000
Introduction to strong mixing conditions volume 1-3 5000
Aerospace Engineering Education During the First Century of Flight 3000
Agyptische Geschichte der 21.30. Dynastie 3000
Les Mantodea de guyane 2000
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5785091
求助须知:如何正确求助?哪些是违规求助? 5685673
关于积分的说明 15466575
捐赠科研通 4914208
什么是DOI,文献DOI怎么找? 2645113
邀请新用户注册赠送积分活动 1592892
关于科研通互助平台的介绍 1547293