High resolution shotgun metagenomics: the more data, the better?

基因组 霰弹枪测序 深度测序 工作流程 DNA测序 样品(材料) 计算机科学 猎枪 数据挖掘 计算生物学 生物 基因组 数据库 遗传学 基因 化学 色谱法
作者
Julien Tremblay,Charles W. Greer
标识
DOI:10.1101/2022.04.19.488797
摘要

Abstract In shotgun metagenomics (SM), the state of the art bioinformatic workflows are referred to as high resolution shotgun metagenomics (HRSM) and require intensive computing and disk storage resources. While the increase in data output of the latest iteration of high throughput DNA sequencing systems can allow for unprecedented sequencing depth at a minimal cost, adjustments in HRSM workflows will be needed to properly process these ever-increasing sequence datasets. One potential adaptation is to generate so-called shallow SM datasets that contain fewer sequencing data per sample as compared to the more classic high coverage sequencing. While shallow sequencing is a promising avenue for SM data analysis, detailed benchmarks using real data are lacking. In this case study, we took four public SM datasets, one massive and the others moderate in size and subsampled each dataset at various levels to mimic shallow sequencing datasets of various sequencing depths. Our results suggest that shallow SM sequencing is a viable avenue to obtain sound results regarding microbial community structures and that high depth sequencing does not bring additional elements for ecological interpretation. More specifically, results obtained by subsampling as little as 0.5M sequencing clusters per sample were similar to the results obtained with the largest subsampled dataset for the human gut and agricultural soil datasets. For the Antarctic dataset, which contained only a few samples, 4M sequencing clusters per sample was found to generate comparable results to the full dataset. One area where ultra-deep sequencing and maximizing the usage of all data was undeniably beneficial was in the generation of metagenome-assembled genomes (MAGs). Key points – Three public multi-sample shotgun metagenomic NovaSeq datasets totalling 12,389,583 and 202 Gb, respectively were analyzed at various sequencing depths to evaluate the accuracy of shallow shotgun metagenomic sequencing using a high resolution shotgun metagenomic bioinformatic workflow. A synthetic mock community of 20 bacterial genomes was also analyzed for validation purposes. – Datasets subsampled to low sequencing depths gave nearly identical ecological patterns (taxonomic and functional composition and beta-alpha-diversity) compared to high depth subsampled datasets. – Rare taxa and functions could be uncovered with high sequencing depth vs. low sequencing depth datasets, but did not affect global ecological patterns. – High sequencing depth was positively correlated with both quantity and quality of recovered metagenome-assembled genomes.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
Hong发布了新的文献求助10
1秒前
dota1dota26发布了新的文献求助10
1秒前
2秒前
Yao丶完成签到,获得积分10
3秒前
3秒前
CharlotteBlue应助科研通管家采纳,获得20
3秒前
drwang120完成签到,获得积分10
3秒前
4秒前
5秒前
6秒前
yhb完成签到 ,获得积分10
8秒前
yxy999完成签到,获得积分10
9秒前
9秒前
10秒前
kuu发布了新的文献求助10
10秒前
11秒前
vv的平行宇宙完成签到,获得积分10
12秒前
koom发布了新的文献求助10
12秒前
星辰大海应助方方是小猪采纳,获得10
13秒前
13秒前
赘婿应助好运波函数采纳,获得10
13秒前
yyc发布了新的文献求助10
15秒前
YJJ发布了新的文献求助10
15秒前
16秒前
jayzhang0771发布了新的文献求助10
16秒前
谨慎不二发布了新的文献求助10
17秒前
19秒前
20秒前
111发布了新的文献求助10
20秒前
whale完成签到,获得积分10
22秒前
23秒前
25秒前
28秒前
jebert发布了新的文献求助10
28秒前
29秒前
32秒前
就酱发布了新的文献求助10
33秒前
秋雪瑶应助谨慎不二采纳,获得10
33秒前
34秒前
高分求助中
【本贴是提醒信息,请勿应助】请在求助之前详细阅读求助说明!!!! 20000
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
The Three Stars Each: The Astrolabes and Related Texts 900
Yuwu Song, Biographical Dictionary of the People's Republic of China 800
Multifunctional Agriculture, A New Paradigm for European Agriculture and Rural Development 600
Challenges, Strategies, and Resiliency in Disaster and Risk Management 500
Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2482773
求助须知:如何正确求助?哪些是违规求助? 2145005
关于积分的说明 5471981
捐赠科研通 1867334
什么是DOI,文献DOI怎么找? 928220
版权声明 563073
科研通“疑难数据库(出版商)”最低求助积分说明 496600