自编码
算法
人工智能
联想(心理学)
环状RNA
遗传算法
数学
计算机科学
核糖核酸
计算生物学
遗传学
生物
机器学习
人工神经网络
基因
认识论
哲学
作者
C. M. Salooja,Arjun Sanker,K. Deepthi,A. S. Jereesh
标识
DOI:10.1142/s0219720024500185
摘要
Circular RNAs (circRNAs) are endogenous non-coding RNAs with a covalently closed loop structure. They have many biological functions, mainly regulatory ones. They have been proven to modulate protein-coding genes in the human genome. CircRNAs are linked to various diseases like Alzheimer’s disease, diabetes, atherosclerosis, Parkinson’s disease and cancer. Identifying the associations between circular RNAs and diseases is essential for disease diagnosis, prevention, and treatment. The proposed model, based on the variational autoencoder and genetic algorithm circular RNA disease association (VAGA-CDA), predicts novel circRNA-disease associations. First, the experimentally verified circRNA-disease associations are augmented with the synthetic minority oversampling technique (SMOTE) and regenerated using a variational autoencoder, and feature selection is applied to these vectors by a genetic algorithm (GA). The variational autoencoder effectively extracts features from the augmented samples. The optimized feature selection of the genetic algorithm effectively carried out dimensionality reduction. The sophisticated feature vectors extracted are then given to a Random Forest classifier to predict new circRNA-disease associations. The proposed model yields an AUC value of 0.9644 and 0.9628 under 5-fold and 10-fold cross-validations, respectively. The results of the case studies indicate the robustness of the proposed model.
科研通智能强力驱动
Strongly Powered by AbleSci AI