计算生物学
亚细胞定位
序列(生物学)
人工智能
蛋白质测序
特征(语言学)
氨基酸
自然语言处理
模式识别(心理学)
肽序列
计算机科学
生物
生物化学
语言学
基因
哲学
作者
Junhao Liu,Zeyu Luo,Yawen Sun,Rui Wang,Xin Li,Xinyun Ye,Dong‐Qing Wei,Yu‐Juan Zhang
出处
期刊:Journal of computational biophysics and chemistry
[World Scientific]
日期:2025-03-22
卷期号:25 (01): 1-23
标识
DOI:10.1142/s2737416525500322
摘要
Machine learning algorithms have revolutionized the study of protein subcellular localization; however, their black-box nature limits their interpretability. To address this, we built upon ProtLoc-Mex1, an automated pipeline incorporating interpretation techniques, by integrating chemical and GO annotation features to create a transparent random forest predictor. When applied to membrane protein type distinction and subcellular localization prediction, ProtLoc-Mex1 identified important features, explored the feature interaction effect and explored the functional feature semantic representation in language models, deepening our understanding of protein targeting mechanisms. We also created modules to facilitate their use for various prediction understanding in machine learning systems and provide a valuable resource for the scientific community. The code is available at ( https://github.com/yujuan-zhang/ProtLoc-mexl ).
科研通智能强力驱动
Strongly Powered by AbleSci AI