可药性
计算生物学
鉴定(生物学)
计算机科学
药物发现
结合位点
药品
蛋白质组
药物靶点
药物数据库
血浆蛋白结合
序列(生物学)
资源(消歧)
药物开发
数据挖掘
生物信息学
蛋白质结构预测
机器学习
人工智能
作者
Minjie Mou,Mingxi Lu,Zhimeng Zhou,Yanlin Ren,Xinyuan Yu,Ziqi Pan,Yuan Zhou,Hao Yang,Lingyan Zheng,GU Shukai,Yang Zhang,Wei Hu,F B Li,Haibin Dai,Feng Zhu
标识
DOI:10.1002/advs.202516530
摘要
ABSTRACT Proteins interact with diverse molecular modalities, yet the incomplete identification of their binding sites has left the proteome‐wide druggability largely underexplored. Although various computational methods have been developed for the prediction of protein binding sites, existing approaches are limited by their specificity to a single drug modality, dependence on high‐quality structural data, or insufficient predictive accuracy. Here, a unified sequence‐based framework, ALLSites, is constructed to identify proteome‐wide binding sites across all drug modalities. Leveraging ESM‐2 embeddings, ALLSites integrates a gated convolutional network with a transformer architecture to capture both global and local sequence features, effectively modeling residue interactions directly from sequence. This design bridges the gap between sequence‐based and structure‐based approaches, enabling ALLSites to achieve superior predictive performance across diverse drug modalities, including proteins, peptides, small molecules, carbohydrates, DNA, and RNA. It achieves state‐of‐the‐art performance among sequence‐based methods and matches the accuracy of the best structure‐based tools. By enabling accurate and structure‐free binding site prediction across all drug modalities, ALLSites is expected to expand the druggable proteome and provide a powerful resource for drug discovery.
科研通智能强力驱动
Strongly Powered by AbleSci AI