A bidirectional interpretable compound-protein interaction prediction framework based on cross attention

可解释性 计算机科学 一般化 人工智能 机器学习 鉴定(生物学) 人工神经网络 聚类分析 数据挖掘 数学 植物 生物 数学分析
作者
Meng Wang,Jianmin Wang,Zhiwei Rong,Liuying Wang,Zhenyi Xu,Liuchao Zhang,Jia He,Shuang Li,Lei Cao,Yan Hou,Kang Li
出处
期刊:Computers in Biology and Medicine [Elsevier BV]
卷期号:172: 108239-108239 被引量:7
标识
DOI:10.1016/j.compbiomed.2024.108239
摘要

The identification of compound-protein interactions (CPIs) plays a vital role in drug discovery. However, the huge cost and labor-intensive nature in vitro and vivo experiments make it urgent for researchers to develop novel CPI prediction methods. Despite emerging deep learning methods have achieved promising performance in CPI prediction, they also face ongoing challenges: (i) providing bidirectional interpretability from both the chemical and biological perspective for the prediction results; (ii) comprehensively evaluating model generalization performance; (iii) demonstrating the practical applicability of these models. To overcome the challenges posed by current deep learning methods, we propose a cross multi-head attention oriented bidirectional interpretable CPI prediction model (CmhAttCPI). First, CmhAttCPI takes molecular graphs and protein sequences as inputs, utilizing the GCW module to learn atom features and the CNN module to learn residue features, respectively. Second, the model applies cross multi-head attention module to compute attention weights for atoms and residues. Finally, CmhAttCPI employs a fully connected neural network to predict scores for CPIs. We evaluated the performance of CmhAttCPI on balanced datasets and imbalanced datasets. The results consistently show that CmhAttCPI outperforms multiple state-of-the-art methods. We constructed three scenarios based on compound and protein clustering and comprehensively evaluated the model generalization ability within these scenarios. The results demonstrate that the generalization ability of CmhAttCPI surpasses that of other models. Besides, the visualizations of attention weights reveal that CmhAttCPI provides chemical and biological interpretation for CPI prediction. Moreover, case studies confirm the practical applicability of CmhAttCPI in discovering anticancer candidates.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
陶醉的忻发布了新的文献求助10
刚刚
刚刚
3秒前
GUGU应助顽石采纳,获得10
3秒前
4秒前
4秒前
yy发布了新的文献求助10
4秒前
6秒前
Pupput发布了新的文献求助10
8秒前
lili发布了新的文献求助10
8秒前
欢呼问旋完成签到,获得积分10
8秒前
畅快若雁完成签到,获得积分10
9秒前
10秒前
10秒前
14秒前
大麦迪发布了新的文献求助10
15秒前
15秒前
15秒前
遁一完成签到,获得积分10
17秒前
17秒前
天道酬勤完成签到,获得积分10
17秒前
科研通AI6.3应助重重采纳,获得10
18秒前
郑咏坤发布了新的文献求助10
19秒前
顽石完成签到,获得积分10
20秒前
子虚一尘完成签到,获得积分10
20秒前
超人研究生完成签到,获得积分10
22秒前
22秒前
Tsuki完成签到,获得积分20
24秒前
Fjun发布了新的文献求助10
24秒前
24秒前
GUGU应助顽石采纳,获得10
24秒前
24秒前
酷波er应助cff采纳,获得10
25秒前
26秒前
蓝天应助大胆的巧蕊采纳,获得10
26秒前
28秒前
cvev发布了新的文献求助10
29秒前
bkagyin应助碧蓝丹烟采纳,获得10
29秒前
xx发布了新的文献求助10
29秒前
犽狸发布了新的文献求助10
30秒前
高分求助中
Psychopathic Traits and Quality of Prison Life 1000
Malcolm Fraser : a biography 680
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6453971
求助须知:如何正确求助?哪些是违规求助? 8265072
关于积分的说明 17614898
捐赠科研通 5519499
什么是DOI,文献DOI怎么找? 2904577
邀请新用户注册赠送积分活动 1881250
关于科研通互助平台的介绍 1723868