Refining biome labeling for large-scale microbial community samples: Leveraging neural networks and transfer learning

生物群落 微生物群 计算机科学 注释 数据科学 人工智能 生物 生态学 生态系统 生物信息学
作者
Nan Wang,Teng Wang,Kang Ning
出处
期刊:Environmental science & ecotechnology [Elsevier]
卷期号:17: 100304-100304
标识
DOI:10.1016/j.ese.2023.100304
摘要

Microbiome research has generated an extensive amount of data, resulting in a wealth of publicly accessible samples. Accurate annotation of these samples is crucial for effectively utilizing microbiome data across scientific disciplines. However, a notable challenge arises from the lack of essential annotations, particularly regarding collection location and sample biome information, which significantly hinders environmental microbiome research. In this study, we introduce Meta-Sorter, a novel approach utilizing neural networks and transfer learning, to enhance biome labeling for thousands of microbiome samples in the MGnify database that have incomplete information. Our findings demonstrate that Meta-Sorter achieved a remarkable accuracy rate of 96.7% in classifying samples among the 16,507 lacking detailed biome annotations. Notably, Meta-Sorter provides precise classifications for representative environmental samples that were previously ambiguously labeled as "Marine" in MGnify, thereby elucidating their specific origins in benthic and water column environments. Moreover, Meta-Sorter effectively distinguishes samples derived from human-environment interactions, enabling clear differentiation between environmental and human-related studies. By improving the completeness of biome label information for numerous microbial community samples, our research facilitates more accurate knowledge discovery across diverse disciplines, with particular implications for environmental research.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
在水一方应助juice采纳,获得10
1秒前
SciGPT应助着急的续采纳,获得10
1秒前
2秒前
Zero完成签到,获得积分10
2秒前
小阁子完成签到,获得积分10
4秒前
bingbingbing完成签到,获得积分10
4秒前
123444完成签到,获得积分10
4秒前
shinysparrow应助含糊的猪采纳,获得30
5秒前
Zero发布了新的文献求助10
5秒前
123444发布了新的文献求助10
6秒前
丘比特应助luo采纳,获得10
7秒前
7秒前
Deerlet完成签到,获得积分10
7秒前
kaka完成签到 ,获得积分10
8秒前
殷勤的书包完成签到,获得积分10
8秒前
Cinderella完成签到,获得积分10
8秒前
8秒前
9秒前
我是老大应助柯南采纳,获得10
11秒前
隐形曼青应助子车半烟采纳,获得10
11秒前
11秒前
12秒前
12秒前
www完成签到 ,获得积分10
12秒前
juice发布了新的文献求助10
12秒前
cha236发布了新的文献求助200
12秒前
Touching完成签到 ,获得积分10
14秒前
OUCER发布了新的文献求助10
15秒前
15秒前
用九发布了新的文献求助10
16秒前
16秒前
16秒前
GODB1ACK应助顾宇采纳,获得10
18秒前
19秒前
21秒前
22秒前
22秒前
22秒前
22秒前
科研小菜发布了新的文献求助10
22秒前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Sport in der Antike 800
De arte gymnastica. The art of gymnastics 600
Berns Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
Stephen R. Mackinnon - Chen Hansheng: China’s Last Romantic Revolutionary (2023) 500
Sport in der Antike Hardcover – March 1, 2015 500
Boris Pesce - Gli impiegati della Fiat dal 1955 al 1999 un percorso nella memoria 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2422058
求助须知:如何正确求助?哪些是违规求助? 2111559
关于积分的说明 5345491
捐赠科研通 1839069
什么是DOI,文献DOI怎么找? 915501
版权声明 561201
科研通“疑难数据库(出版商)”最低求助积分说明 489590