Familial long-read sequencing increases yield of de novo mutations

遗传学 生物 索引 杂交基因组组装 DNA测序 深度测序 INDEL突变 基因组 基因组学 参考基因组 纳米孔测序 顺序装配 计算生物学 Illumina染料测序 生殖系 全基因组测序 结构变异
作者
Michelle D Noyes,William T Harvey,David Porubsky,Arvis Sulovari,Ruiyang Li,Nicholas R Rose,Peter A Audano,Katherine M Munson,Alexandra P Lewis,Kendra Hoekzema,Tuomo Mantere,Tina A Graves-Lindsay,Ashley D Sanders,Sara Goodwin,Melissa Kramer,Younes Mokrab,Michael C Zody,Alexander Hoischen,Jan O Korbel,W Richard McCombie,Evan E Eichler
出处
期刊:American Journal of Human Genetics [Elsevier BV]
卷期号:109 (4): 631-646
标识
DOI:10.1016/j.ajhg.2022.02.014
摘要

Summary

Studies of de novo mutation (DNM) have typically excluded some of the most repetitive and complex regions of the genome because these regions cannot be unambiguously mapped with short-read sequencing data. To better understand the genome-wide pattern of DNM, we generated long-read sequence data from an autism parent-child quad with an affected female where no pathogenic variant had been discovered in short-read Illumina sequence data. We deeply sequenced all four individuals by using three sequencing platforms (Illumina, Oxford Nanopore, and Pacific Biosciences) and three complementary technologies (Strand-seq, optical mapping, and 10X Genomics). Using long-read sequencing, we initially discovered and validated 171 DNMs across two children—a 20% increase in the number of de novo single-nucleotide variants (SNVs) and indels when compared to short-read callsets. The number of DNMs further increased by 5% when considering a more complete human reference (T2T-CHM13) because of the recovery of events in regions absent from GRCh38 (e.g., three DNMs in heterochromatic satellites). In total, we validated 195 de novo germline mutations and 23 potential post-zygotic mosaic mutations across both children; the overall true substitution rate based on this integrated callset is at least 1.41 × 10−8 substitutions per nucleotide per generation. We also identified six de novo insertions and deletions in tandem repeats, two of which represent structural variants. We demonstrate that long-read sequencing and assembly, especially when combined with a more complete reference genome, increases the number of DNMs by >25% compared to previous studies, providing a more complete catalog of DNM compared to short-read data alone.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
荷包蛋完成签到,获得积分20
刚刚
李健应助小俊花采纳,获得10
1秒前
2秒前
loading发布了新的文献求助10
2秒前
xiaoliuxiaoli发布了新的文献求助10
2秒前
arthur发布了新的文献求助10
3秒前
研友_VZG7GZ应助汪鼎采纳,获得10
3秒前
3秒前
李健的小迷弟应助小艾采纳,获得10
4秒前
6秒前
lhtyzcg发布了新的文献求助10
6秒前
CipherSage应助呱呱采纳,获得10
6秒前
wanci应助关中人采纳,获得10
7秒前
8秒前
顺利映菡完成签到,获得积分10
9秒前
PMY发布了新的文献求助10
9秒前
思源应助金金段采纳,获得10
9秒前
烟花应助耍酷的剑身采纳,获得10
9秒前
9秒前
wk0635发布了新的文献求助10
10秒前
yy发布了新的文献求助10
10秒前
一刀一个球磨机完成签到 ,获得积分10
11秒前
顺利映菡发布了新的文献求助10
12秒前
yeeeeees完成签到,获得积分10
12秒前
13秒前
aa完成签到,获得积分10
13秒前
小卢发布了新的文献求助10
13秒前
Halo完成签到,获得积分10
13秒前
尊敬的半梅完成签到 ,获得积分10
15秒前
15秒前
soda发布了新的文献求助10
15秒前
人间完成签到,获得积分20
16秒前
甜青提完成签到,获得积分10
16秒前
17秒前
xkhxh发布了新的文献求助10
18秒前
科目三应助PMY采纳,获得10
18秒前
20秒前
深鬼关注了科研通微信公众号
20秒前
21秒前
要减肥发布了新的文献求助10
21秒前
高分求助中
Psychopathic Traits and Quality of Prison Life 1000
Malcolm Fraser : a biography 680
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6453772
求助须知:如何正确求助?哪些是违规求助? 8264898
关于积分的说明 17614195
捐赠科研通 5519052
什么是DOI,文献DOI怎么找? 2904499
邀请新用户注册赠送积分活动 1881201
关于科研通互助平台的介绍 1723727