生物信息学
计算生物学
精确性和召回率
基因组
抗菌肽
计算机科学
基因组
Python(编程语言)
Web服务器
软件
生物
康蒂格
数据挖掘
人工智能
肽
基因
遗传学
程序设计语言
生物化学
互联网
万维网
作者
Célio Dias Santos Júnior,Shaojun Pan,Xing‐Ming Zhao,Luís Pedro Coelho
出处
期刊:PeerJ
[PeerJ, Inc.]
日期:2020-12-18
卷期号:8: e10555-e10555
被引量:39
摘要
Antimicrobial peptides (AMPs) have the potential to tackle multidrug-resistant pathogens in both clinical and non-clinical contexts. The recent growth in the availability of genomes and metagenomes provides an opportunity for in silico prediction of novel AMP molecules. However, due to the small size of these peptides, standard gene prospection methods cannot be applied in this domain and alternative approaches are necessary. In particular, standard gene prediction methods have low precision for short peptides, and functional classification by homology results in low recall.Here, we present Macrel (for metagenomic AMP classification and retrieval), which is an end-to-end pipeline for the prospection of high-quality AMP candidates from (meta)genomes. For this, we introduce a novel set of 22 peptide features. These were used to build classifiers which perform similarly to the state-of-the-art in the prediction of both antimicrobial and hemolytic activity of peptides, but with enhanced precision (using standard benchmarks as well as a stricter testing regime). We demonstrate that Macrel recovers high-quality AMP candidates using realistic simulations and real data.Macrel is implemented in Python 3. It is available as open source at https://github.com/BigDataBiology/macrel and through bioconda. Classification of peptides or prediction of AMPs in contigs can also be performed on the webserver: https://big-data-biology.org/software/macrel.
科研通智能强力驱动
Strongly Powered by AbleSci AI