索引
INDEL突变
突变
计算机科学
序列(生物学)
计算生物学
人工智能
特征(语言学)
遗传学
生物
基因
单核苷酸多态性
语言学
基因型
哲学
摘要
Inframe insertion/deletion (indel) mutations play a crucial role in human genetic variation and are closely associated with various diseases. Currently, most methods for predicting pathogenic inframe indel mutations are based on machine learning, utilizing manually engineered features for prediction, which may overlook certain mutation data. Moreover, existing deep learning methods mainly focus on protein sequence features, neglecting other aspects. This paper proposes a method for predicting pathogenic inframe indel mutations, termed DPPred-indel, based on a combination of a biological language model and feature fusion. DPPred-indel leverages transfer learning to address the limited data availability and integrates features from both protein and DNA sequence levels for prediction. Subsequent experiments validate the effectiveness of transfer learning and feature fusion. DPPred-indel demonstrates comparable performance to state-of-the-art methods on two independent test sets constructed in this study.
科研通智能强力驱动
Strongly Powered by AbleSci AI