计算机科学
一般化
背景(考古学)
图形
序列(生物学)
人工智能
机器学习
钥匙(锁)
数据挖掘
源代码
蛋白质-蛋白质相互作用
丙氨酸扫描
理论计算机科学
蛋白质测序
计算模型
编码(集合论)
生物学数据
模拟生物系统
计算生物学
航程(航空)
计算复杂性理论
语义学(计算机科学)
序列比对
数据类型
作者
Ke Yan,Tao Liu,Xinxin Zhan,Shutao Chen,Moran Li,Tianqi Hu,Bin Liu
标识
DOI:10.1021/acs.jcim.5c03139
摘要
Peptide-Protein Interactions (PepPIs) are essential to a wide range of biological processes, including gene regulation, cellular homeostasis, and metabolic modulation. Researchers have developed several computational deep learning predictors based on the sequence information to predict the PepPIs. However, the generalization performance of most computational methods is constrained by the limited protein-peptide complexes in the RCSB Protein Data Bank database. Moreover, it is challenging to utilize the complex context of proteins and peptides to predict PepPIs. In this study, we propose HGT-PepPI, a heterogeneous graph-based framework designed for PepPIs prediction. The peptide and protein sequences are initialized as heterogeneous nodes with semantic representations using the ProtT5 model. The three multirelational edges are constructed by integrating sequence semantic information, evolutionary conservation profiles, and experimentally validated interactions between proteins and peptides, respectively. By constructing a graph that inherently integrates multiple types of biological information, our method achieves superior generalization by learning transferable patterns of interaction semantics. Moreover, the proposed method employs the message-passing operations to capture the local sequence characteristics and global complex contextual dependencies, thereby enabling a comprehensive modeling of interaction semantics. Experimental results demonstrate that HGT-PepPI outperforms the existing state-of-the-art approaches in both predictive performance and robustness. In addition, we designed an alanine scanning mutagenesis experiment and a binding affinity experiment, which successfully verified the model's ability to identify key residues and guide peptide drug design. The data and source code of HGT-PepPI can be publicly accessible via http://bliulab.net/HGT-PepPI.
科研通智能强力驱动
Strongly Powered by AbleSci AI