Index switching causes “spreading-of-signal” among multiplexed samples in Illumina HiSeq 4000 DNA sequencing

Illumina染料测序 DNA测序 计算生物学 多路复用 纳米孔测序 深度测序 DNA 计算机科学 基因组 生物 遗传学 基因 电信
作者
Rahul Sinha,Geoff Stanley,Gulati Gs,Camille Ezran,Kyle J. Travaglini,E.T. Wei,C. K. Chan,Nabhan An,Tong Su,Morganti Rm,Conley Sd,Hassan Chaı̈b,Kristy Red‐Horse,Longaker Mt,Snyder Mp,Krasnow Ma,Weissman Il
标识
DOI:10.1101/125724
摘要

Abstract Illumina-based next generation sequencing (NGS) has accelerated biomedical discovery through its ability to generate thousands of gigabases of sequencing output per run at a fraction of the time and cost of conventional technologies. The process typically involves four basic steps: library preparation, cluster generation, sequencing, and data analysis. In 2015, a new chemistry of cluster generation was introduced in the newer Illumina machines (HiSeq 3000/4000/X Ten) called exclusion amplification (ExAmp), which was a fundamental shift from the earlier method of random cluster generation by bridge amplification on a non-patterned flow cell. The ExAmp chemistry, in conjunction with patterned flow cells containing nanowells at fixed locations, increases cluster density on the flow cell, thereby reducing the cost per run. It also increases sequence read quality, especially for longer read lengths (up to 150 base pairs). This advance has been widely adopted for genome sequencing because greater sequencing depth can be achieved for lower cost without compromising the quality of longer reads. We show that this promising chemistry is problematic, however, when multiplexing samples. We discovered that up to 5-10% of sequencing reads (or signals) are incorrectly assigned from a given sample to other samples in a multiplexed pool. We provide evidence that this “spreading-of-signals” arises from low levels of free index primers present in the pool. These index primers can prime pooled library fragments at random via complementary 3’ ends, and get extended by DNA polymerase, creating a new library molecule with a new index before binding to the patterned flow cell to generate a cluster for sequencing. This causes the resulting read from that cluster to be assigned to a different sample, causing the spread of signals within multiplexed samples. We show that low levels of free index primers persist after the most common library purification procedure recommended by Illumina, and that the amount of signal spreading among samples is proportional to the level of free index primer present in the library pool. This artifact causes homogenization and misclassification of cells in single cell RNA-seq experiments. Therefore, all data generated in this way must now be carefully re-examined to ensure that “spreading-of-signals” has not compromised data analysis and conclusions. Re-sequencing samples using an older technology that uses conventional bridge amplification for cluster generation, or improved library cleanup strategies to remove free index primers, can minimize or eliminate this signal spreading artifact.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科研小渣渣完成签到,获得积分10
1秒前
田様应助背后的飞飞采纳,获得10
1秒前
wzh完成签到,获得积分10
1秒前
小新发布了新的文献求助10
2秒前
zzz01218完成签到,获得积分20
2秒前
2秒前
HHW发布了新的文献求助10
2秒前
3秒前
洽洽鹰击发布了新的文献求助10
4秒前
FashionBoy应助科研狗采纳,获得10
4秒前
5秒前
充电宝应助liuyin采纳,获得10
6秒前
Jasper应助kuankuan采纳,获得10
6秒前
7秒前
7秒前
素衣完成签到,获得积分10
7秒前
8秒前
NexusExplorer应助甘愿采纳,获得10
9秒前
李宫俊发布了新的文献求助10
10秒前
11秒前
ZsJJkk发布了新的文献求助10
11秒前
12秒前
deardorff完成签到,获得积分10
13秒前
13秒前
13秒前
芋圆发布了新的文献求助10
13秒前
可可完成签到,获得积分10
14秒前
郭小胖14发布了新的文献求助10
14秒前
田様应助教生物的杨教授采纳,获得10
15秒前
Owen应助HHW采纳,获得10
15秒前
昭昭发布了新的文献求助10
15秒前
高镜涵发布了新的文献求助10
15秒前
16秒前
季春九完成签到,获得积分10
16秒前
晴朗泥泞发布了新的文献求助10
16秒前
斯文败类应助科研通管家采纳,获得10
16秒前
小蘑菇应助科研通管家采纳,获得10
16秒前
17秒前
17秒前
17秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Picture this! Including first nations fiction picture books in school library collections 2000
The Cambridge History of China: Volume 4, Sui and T'ang China, 589–906 AD, Part Two 1500
Cowries - A Guide to the Gastropod Family Cypraeidae 1200
Quality by Design - An Indispensable Approach to Accelerate Biopharmaceutical Product Development 800
Signals, Systems, and Signal Processing 610
The Oxford Handbook of Archaeology and Language 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6393780
求助须知:如何正确求助?哪些是违规求助? 8208835
关于积分的说明 17379904
捐赠科研通 5446900
什么是DOI,文献DOI怎么找? 2879741
邀请新用户注册赠送积分活动 1856202
关于科研通互助平台的介绍 1698963