计算机科学
水准点(测量)
人工智能
深度学习
卷积神经网络
特征(语言学)
机器学习
集合(抽象数据类型)
利用
特征工程
序列(生物学)
化学
哲学
生物化学
语言学
程序设计语言
地理
计算机安全
大地测量学
作者
Xuetong Yang,Junru Jin,Ruheng Wang,Zhongshen Li,Yu Wang,Leyi Wei
标识
DOI:10.1021/acs.jcim.3c00297
摘要
Anticancer peptides (ACPs) recently have been receiving increasing attention in cancer therapy due to their low consumption, few adverse side effects, and easy accessibility. However, it remains a great challenge to identify anticancer peptides via experimental approaches, requiring expensive and time-consuming experimental studies. In addition, traditional machine-learning-based methods are proposed for ACP prediction mainly depending on hand-crafted feature engineering, which normally achieves low prediction performance. In this study, we propose CACPP (Contrastive ACP Predictor), a deep learning framework based on the convolutional neural network (CNN) and contrastive learning for accurately predicting anticancer peptides. In particular, we introduce the TextCNN model to extract the high-latent features based on the peptide sequences only and exploit the contrastive learning module to learn more distinguishable feature representations to make better predictions. Comparative results on the benchmark data sets indicate that CACPP outperforms all the state-of-the-art methods in the prediction of anticancer peptides. Moreover, to intuitively show that our model has good classification ability, we visualize the dimension reduction of the features from our model and explore the relationship between ACP sequences and anticancer functions. Furthermore, we also discuss the influence of data set construction on model prediction and explore our model performance on the data sets with verified negative samples.
科研通智能强力驱动
Strongly Powered by AbleSci AI