清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

Alignment-Free Antimicrobial Peptide Predictors: Improving Performance by a Thorough Analysis of the Largest Available Data Set

标杆管理 集合(抽象数据类型) 数据集 机器学习 计算机科学 随机森林 数据挖掘 人工智能 营销 业务 程序设计语言
作者
Sergio A. Pinacho-Castellanos,César R. García‐Jacas,Michael K. Gilson,Carlos A. Brizuela
出处
期刊:Journal of Chemical Information and Modeling [American Chemical Society]
卷期号:61 (6): 3141-3157 被引量:78
标识
DOI:10.1021/acs.jcim.1c00251
摘要

In the last two decades, a large number of machine-learning-based predictors for the activities of antimicrobial peptides (AMPs) have been proposed. These predictors differ from one another in the learning method and in the training and testing data sets used. Unfortunately, the training data sets present several drawbacks, such as a low representativeness regarding the experimentally validated AMP space, and duplicated peptide sequences between negative and positive data sets. These limitations give a low confidence to most of the approaches to be used in prospective studies. To address these weaknesses, we propose novel modeling and assessing data sets from the largest experimentally validated nonredundant peptide data set reported to date. From these novel data sets, alignment-free quantitative sequence-activity models (AF-QSAMs) based on Random Forest are created to identify general AMPs and their antibacterial, antifungal, antiparasitic, and antiviral functional types. An applicability domain analysis is carried out to determine the reliability of the predictions obtained, which, to the best of our knowledge, is performed for the first time for AMP recognition. A benchmarking is undertaken between the models proposed and several models from the literature that are freely available in 13 programs (ClassAMP, iAMP-2L, ADAM, MLAMP, AMPScanner v2.0, AntiFP, AMPfun, PEPred-suite, AxPEP, CAMPR3, iAMPpred, APIN, and Meta-iAVP). The models proposed are those with the best performance in all of the endpoints modeled, while most of the methods from the literature have weak-to-random predictive agreements. The models proposed are also assessed through Y-scrambling and repeated k-fold cross-validation tests, demonstrating that the outcomes obtained by them are not given by chance. Three chemometric analyses also confirmed the relevance of the peptides descriptors used in the modeling. Therefore, it can be concluded that the models built by fixing the drawbacks existing in the literature contribute to identifying antibacterial, antifungal, antiparasitic, and antiviral peptides with high effectivity and reliability. Models are freely available via the AMPDiscover tool at https://biocom-ampdiscover.cicese.mx/.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Mine发布了新的文献求助10
2秒前
Mine完成签到,获得积分10
15秒前
yf完成签到,获得积分10
1分钟前
蓝意完成签到,获得积分0
1分钟前
汉堡包应助qaz111222采纳,获得10
2分钟前
2分钟前
qaz111222发布了新的文献求助10
2分钟前
125mmD91T完成签到,获得积分10
2分钟前
w0304hf完成签到,获得积分10
2分钟前
老戎完成签到 ,获得积分10
2分钟前
qaz111222完成签到,获得积分10
2分钟前
文静灵阳完成签到 ,获得积分10
2分钟前
曾经不言完成签到 ,获得积分10
2分钟前
hj完成签到 ,获得积分10
2分钟前
烟花应助科研通管家采纳,获得10
3分钟前
喻初原完成签到 ,获得积分10
4分钟前
安安爱阎魔完成签到,获得积分10
5分钟前
5分钟前
知行者完成签到 ,获得积分10
6分钟前
大医仁心完成签到 ,获得积分10
6分钟前
飞龙在天完成签到 ,获得积分10
7分钟前
成就小蜜蜂完成签到 ,获得积分10
8分钟前
liuye0202完成签到,获得积分10
8分钟前
机智的苗条完成签到,获得积分10
10分钟前
成就的香菇完成签到,获得积分10
10分钟前
鸡鸡大魔王完成签到,获得积分10
10分钟前
喜悦的唇彩完成签到,获得积分10
10分钟前
羞涩的问兰完成签到,获得积分10
10分钟前
丰富的亦寒完成签到,获得积分10
10分钟前
标致初曼完成签到,获得积分10
10分钟前
哈哈哈完成签到,获得积分10
10分钟前
luo完成签到,获得积分10
10分钟前
螺丝炒钉子完成签到,获得积分10
10分钟前
晴空万里完成签到 ,获得积分10
11分钟前
Xu完成签到 ,获得积分10
11分钟前
果冻完成签到 ,获得积分10
11分钟前
传奇3应助黄如果被采纳,获得30
12分钟前
Eva完成签到,获得积分20
12分钟前
12分钟前
Eva发布了新的文献求助30
13分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Emmy Noether's Wonderful Theorem 1200
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
基于非线性光纤环形镜的全保偏锁模激光器研究-上海科技大学 800
Signals, Systems, and Signal Processing 610
Research Methods for Business: A Skill Building Approach, 9th Edition 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6410665
求助须知:如何正确求助?哪些是违规求助? 8229918
关于积分的说明 17463336
捐赠科研通 5463597
什么是DOI,文献DOI怎么找? 2886946
邀请新用户注册赠送积分活动 1863321
关于科研通互助平台的介绍 1702496