Compare the performance of multiple binary classification models in microbial high-throughput sequencing datasets

支持向量机 人工智能 随机森林 计算机科学 二进制数 机器学习 二元分类 人工神经网络 估计员 预测建模 数据挖掘 模式识别(心理学) 数学 统计 算术
作者
Nuohan Xu,Zhenyan Zhang,Yechao Shen,Qi Zhang,Zhen Liu,Yitian Yu,Yan Wang,Chaotang Lei,Mingjing Ke,Danyan Qiu,Tao Lu,Yi‐Ling Chen,Juntao Xiong,Haifeng Qian
出处
期刊:Science of The Total Environment [Elsevier]
卷期号:837: 155807-155807 被引量:4
标识
DOI:10.1016/j.scitotenv.2022.155807
摘要

The development of machine learning and deep learning provided solutions for predicting microbiota response on environmental change based on microbial high-throughput sequencing. However, there were few studies specifically clarifying the performance and practical of two types of binary classification models to find a better algorithm for the microbiota data analysis. Here, for the first time, we evaluated the performance, accuracy and running time of the binary classification models built by three machine learning methods - random forest (RF), support vector machine (SVM), logistic regression (LR), and one deep learning method - back propagation neural network (BPNN). The built models were based on the microbiota datasets that removed low-quality variables and solved the class imbalance problem. Additionally, we optimized the models by tuning. Our study demonstrated that dataset pre-processing was a necessary process for model construction. Among these 4 binary classification models, BPNN and RF were the most suitable methods for constructing microbiota binary classification models. Using these 4 models to predict multiple microbial datasets, BPNN showed the highest accuracy and the most robust performance, while the RF method was ranked second. We also constructed the optimal models by adjusting the epochs of BPNN and the n_estimators of RF for six times. The evaluation related to performances of models provided a road map for the application of artificial intelligence to assess microbial ecology.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
不安青牛应助鹿书雪采纳,获得20
1秒前
xiaofan发布了新的文献求助10
2秒前
烟花应助biubiubiu采纳,获得10
2秒前
2秒前
贪玩的滑板完成签到,获得积分20
2秒前
董泽云发布了新的文献求助10
3秒前
深情安青应助科研通管家采纳,获得10
4秒前
彭于晏应助科研通管家采纳,获得10
4秒前
NexusExplorer应助科研通管家采纳,获得10
4秒前
bkagyin应助科研通管家采纳,获得10
4秒前
4秒前
秋雪瑶应助科研通管家采纳,获得10
4秒前
彭于晏应助科研通管家采纳,获得10
4秒前
4秒前
NexusExplorer应助科研通管家采纳,获得10
4秒前
4秒前
科研通AI2S应助科研通管家采纳,获得50
4秒前
4秒前
Lone发布了新的文献求助10
4秒前
隐形曼青应助科研通管家采纳,获得10
4秒前
SciGPT应助科研通管家采纳,获得10
4秒前
SOLOMON应助科研通管家采纳,获得10
4秒前
慕青应助科研通管家采纳,获得30
4秒前
4秒前
科目三应助科研通管家采纳,获得10
4秒前
4秒前
天天快乐应助科研通管家采纳,获得20
4秒前
大模型应助科研通管家采纳,获得10
5秒前
5秒前
科目三应助山牙子采纳,获得10
5秒前
科目三应助科研通管家采纳,获得30
5秒前
寂寞的念烟完成签到,获得积分10
5秒前
5秒前
5秒前
5秒前
江泽给初见的求助进行了留言
5秒前
傻傻的宛白完成签到 ,获得积分10
7秒前
在水一方应助Ruby采纳,获得10
7秒前
Rigel完成签到,获得积分10
9秒前
10秒前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Sport in der Antike 800
De arte gymnastica. The art of gymnastics 600
Berns Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
Stephen R. Mackinnon - Chen Hansheng: China’s Last Romantic Revolutionary (2023) 500
Sport in der Antike Hardcover – March 1, 2015 500
Boris Pesce - Gli impiegati della Fiat dal 1955 al 1999 un percorso nella memoria 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2421347
求助须知:如何正确求助?哪些是违规求助? 2111210
关于积分的说明 5343582
捐赠科研通 1838689
什么是DOI,文献DOI怎么找? 915376
版权声明 561171
科研通“疑难数据库(出版商)”最低求助积分说明 489531