亚细胞定位
蛋白质亚细胞定位预测
计算机科学
人工智能
序列(生物学)
图形
蛋白质测序
计算生物学
细胞质
蛋白质结构
嵌入
蛋白质靶向
肽序列
蛋白质-蛋白质相互作用
卷积神经网络
核定位序列
语言模型
模式识别(心理学)
网络模型
生物
钥匙(锁)
深度学习
UniProt公司
序列比对
作者
Chen Song,Shidong Wu,Xiankun Zhang,Mengyu Li,Tao Li
标识
DOI:10.1109/tcbbio.2025.3634612
摘要
Proteins serve as the essential executors of cellular activities, and their mislocalization often resulting in diverse diseases. Traditional methods for determining protein subcellular localization are noted for their time-consuming, labor-intensive, and complex nature. To address these challenges, this study introduces SubLoc, a deep learning-based algorithm for predicting protein subcellular localization. The methodology comprises three key steps: Firstly, leveraging the deep pretrained protein language model ProtT5 to derive protein embedding vectors, thereby capturing intricate patterns and biological functionalities of protein sequences. Secondly, constructing a 3D protein structure graph model using amino acid residue contact relationships within the sequence, which is subsequently processed by a graph convolutional network to effectively manage spatial structural information. Lastly, employing a bidirectional gated recurrent unit and multi-head attention mechanism to analyze sequence features, integrating both structural and sequence data for enhanced subcellular localization prediction. Experimental results demonstrate that SubLoc exhibits exceptional performance in localizing proteins across 10 subcellular compartments, outperforming all comparative methods in terms of precision, recall, and MCC average values. Notably, SubLoc achieves particularly notable results in identifying Cytoplasm and Mitochondrion locations.
科研通智能强力驱动
Strongly Powered by AbleSci AI