非核糖体肽
化学
腺苷酸化
领域(数学分析)
计算生物学
基因组
组合化学
生物化学
基因
生物合成
数学
生物
数学分析
作者
Zhihan Zhang,Yuyang Zhou,Shengling Xie,Run‐Zhou Liu,Zilei Huang,Pachaiyappan Saravana Kumar,Guozhong Feng,Fajie Yuan,Lihan Zhang
摘要
Nonribosomal peptides serve as pivotal sources for drug discovery. Accurate prediction of the substrate specificity of adenylation domains in nonribosomal peptide synthetases is crucial for genome mining of nonribosomal peptides, yet current prediction methods fall short in accuracy. In this work, we analyzed 4,100 adenylation domains from documented nonribosomal peptide synthetases and found that the flavodoxin-like subdomain universally governs substrate specificity in all bacterial adenylation domains and that its phylogenetic analysis can correlate the sequences of adenylation domains and their substrate specificity. Leveraging the sequences within the flavodoxin-like subdomain, we developed a substrate specificity prediction algorithm using a protein language model, achieving 92% overall prediction accuracy for 43 frequently observed amino acids, significantly improving the prediction reliability. The efficacy of our prediction tool was validated through targeted genome mining, which led to the discovery of novel antimicrobial peptides. Our work lays a foundation to understand the sequence-to-function relationship of the bacterial adenylation domain and will facilitate the exploitation of nonribosomal peptides. NRPStransformer is available at http://www.nrpstransformer.cn.
科研通智能强力驱动
Strongly Powered by AbleSci AI