随机森林
基因
朴素贝叶斯分类器
基因调控网络
决策树
支持向量机
计算机科学
计算生物学
机器学习
生物信息学
生物
基因表达
遗传学
作者
Sona Charles,Jeyakumar Natarajan
标识
DOI:10.47852/bonviewmedin32021554
摘要
Tetralogy of Fallot (TOF) is a combinatorial congenital abnormality comprising of ventricular septal defect (VSD), pulmonary valve stenosis, a misplaced aorta and a thickened right ventricular wall. Biologically relevant module identification from transcriptome data may be considered as a binary classification problem. We utilized publicly accessible mRNA expression data to extract the differentially expressed genes (DEGs) and further weighted gene co-expression network analysis to identify ten modules in TOF. Network topological properties of modular and non-modular genes were considered as features for binary classification. We applied SVM, Random Forest, Decision Trees, KNN and Naïve Bayes algorithm to network features. Random Forest and decision tree algorithms displayed an accuracy of 99.1% and 98% respectively. All the methods, in combination predicted 71 common genes which were used to construct a gene regulatory network. The network was expanded to include 30 miRNAs targeting the genes. Interestingly, 39 out of 71 genes were transcription factors out of which ELN, SOX6 and FOXO3 genes are novel candidates in TOF. The work also provides a sub-module of genes and miRNAs supported by statistical models as prospective candidates to be biomarkers. Received: 18 August 2023 | Revised: 27 September 2023 | Accepted: 8 October 2023 Conflicts of Interest The authors declare that they have no conflicts of interest to this work. Data Availability Statement The data that support this work are available upon reasonable request to the corresponding author.
科研通智能强力驱动
Strongly Powered by AbleSci AI