计算机科学
水准点(测量)
编码器
人工智能
构造(python库)
特征(语言学)
编码(内存)
机器学习
语言学
哲学
大地测量学
程序设计语言
地理
操作系统
作者
Wanshan Ning,Fengling Qin,Ziwei Zhou,Hang Yang,Chentan Li,Yaping Guo
摘要
Abstract Lysine lactylation (Kla), a novel lactate-derived post-translational modification, is involved in a myriad of biological processes and complex diseases. While several computational methods have been developed to identify Kla sites, these approaches still suffer from small datasets. In this work, we collected 23 984 Kla sites in 7297 proteins from the literature to construct the benchmark dataset. Leveraging recent advances in feature encoding, we tailored a multi-feature hybrid system, which integrated eight complementary feature–encoding strategies derived from two automated encoders and a composition-based module. Combining the hybrid system with deep learning, we presented our newly designed predictor named HybridKla, achieving an area under the curve (AUC) value of 0.8460. Compared to existing tools, HybridKla achieved >28.90% improvement of the AUC value (0.8460 versus 0.6563). we also conducted a proteome-wide search and provided a systematic prediction of Kla sites. The friendly online service of HybridKla is freely accessible for academic research at http://transkla.zzu.edu.cn/.
科研通智能强力驱动
Strongly Powered by AbleSci AI