Introduction of the Python script STRinNGS for analysis of STR regions in FASTQ or BAM files and expansion of the Danish STR sequence database to 11 STRs

生物 遗传学 微卫星 基因座(遗传学) 序列数据库 大规模并行测序 STR复用系统 DNA测序 等位基因 基因
作者
Susanne Lunøe Friis,Anders Buchard,Eszter Rockenbauer,Claus Børsting,Niels Morling
出处
期刊:Forensic Science International-genetics [Elsevier BV]
卷期号:21: 68-75 被引量:35
标识
DOI:10.1016/j.fsigen.2015.12.006
摘要

This work introduces the in-house developed Python application STRinNGS for analysis of STR sequence elements in BAM or FASTQ files. STRinNGS identifies sequence reads with STR loci by their flanking sequences, it analyses the STR sequence and the flanking regions, and generates a report with the assigned SNP-STR alleles. The main output file from STRinNGS contains all sequences with read counts above 1% of the total number of reads per locus. STR sequences are automatically named according to the nomenclature used previously and according to the repeat unit definitions in STRBase (http://www.cstl.nist.gov/strbase/). The sequences are named with (1) the locus name, (2) the length of the repeat region divided by the length of the repeat unit, (3) the sequence(s) of the repeat unit(s) followed by the number of repeats and (4) variations in the flanking regions. Lower case letters in the main output file are used to flag sequences with previously unknown variations in the STRs. SNPs in the flanking regions are named by their "rs" numbers and the nucleotides in the SNP position. Data from 207 Danes sequenced with the Ion Torrent™ HID STR 10-plex that amplified nine STRs (CSF1PO, D3S1358, D5S818, D7S820, D8S1179, D16S539, TH01, TPOX, vWA), and Amelogenin was analysed with STRinNGS. Sequencing uncovered five common SNPs near four STRs and revealed 20 new alleles in the 207 Danes. Three short homopolymers in the D8S1179 flanking regions caused frequent sequencing errors. In 29 of 3726 allele calls (0.8%), sequences with homopolymer errors were falsely assigned as true alleles. An in-house developed script in R compensated for these errors by compiling sequence reads that had identical STR sequences and identical nucleotides in the five common SNPs. In the output file from the R script, all SNP-STR haplotype calls were correct. The 207 samples and six additional samples were sequenced for D3S1358, D12S391, and D21S11 using the 454 GS Junior platform in this and a previous work. Overall, next generation sequencing (NGS) of the 11 STRs lowered the mean match probability 386 times and increased the typical paternity indexes (i.e. the geometric mean) for trios and duos 47 and 23 times, respectively, compared to the traditional PCR-CE typing of the same population.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
丁鹏笑完成签到 ,获得积分0
1秒前
1秒前
羲成完成签到 ,获得积分10
2秒前
2秒前
2秒前
繁荣的紫完成签到 ,获得积分20
3秒前
mm完成签到,获得积分10
3秒前
豌豆苗完成签到 ,获得积分10
4秒前
001完成签到,获得积分10
4秒前
huairenzhao发布了新的文献求助10
5秒前
木光完成签到,获得积分10
6秒前
小小吴完成签到,获得积分10
6秒前
小猫完成签到 ,获得积分10
7秒前
PPSlu完成签到,获得积分10
9秒前
小包子完成签到,获得积分10
10秒前
ncuwzq完成签到,获得积分10
12秒前
叶若相怜完成签到,获得积分10
13秒前
WXyue完成签到 ,获得积分10
13秒前
咕噜噜完成签到 ,获得积分10
13秒前
小知了完成签到,获得积分10
13秒前
小虫虫完成签到,获得积分10
14秒前
静静在学呢完成签到,获得积分10
15秒前
慕青应助mm采纳,获得10
15秒前
16秒前
Jenkin完成签到,获得积分10
17秒前
妮妮完成签到 ,获得积分10
18秒前
hjwwz26完成签到,获得积分10
19秒前
朴素鑫完成签到,获得积分10
19秒前
陆千万完成签到,获得积分10
20秒前
欢喜的代容完成签到,获得积分10
20秒前
草莓大王完成签到,获得积分10
21秒前
昭昭发布了新的文献求助20
21秒前
跳跃的凡柔完成签到,获得积分10
22秒前
之星君完成签到,获得积分10
23秒前
发发旦旦完成签到,获得积分10
23秒前
24秒前
Somnolence咩完成签到,获得积分10
26秒前
之之完成签到,获得积分10
26秒前
St雪完成签到,获得积分10
26秒前
俭朴冰姬完成签到,获得积分10
27秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Inorganic Chemistry Eighth Edition 1200
Free parameter models in liquid scintillation counting 1000
Standards for Molecular Testing for Red Cell, Platelet, and Neutrophil Antigens, 7th edition 1000
The Organic Chemistry of Biological Pathways Second Edition 800
The Psychological Quest for Meaning 800
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6314478
求助须知:如何正确求助?哪些是违规求助? 8130703
关于积分的说明 17037719
捐赠科研通 5370196
什么是DOI,文献DOI怎么找? 2851158
邀请新用户注册赠送积分活动 1828962
关于科研通互助平台的介绍 1681159