计算机科学
数据挖掘
预处理器
聚类分析
背景(考古学)
数据预处理
微阵列分析技术
特征选择
基因芯片分析
DNA微阵列
机器学习
人工智能
基因
生物
基因表达
生物化学
古生物学
作者
Alessandro Fiori,Alberto Grand,Giulia Bruno,Francesco Brundu,Domenico Schioppa,Andrea Bertotti
标识
DOI:10.4018/jdm.2014010102
摘要
Nowadays, a huge amount of high throughput molecular data are available for analysis and provide novel and useful insights into complex biological systems, through the acquisition of a high-resolution picture of their molecular status in defined experimental conditions. In this context, microarrays are a powerful tool to analyze thousands of gene expression values with a single experiment. A number of approaches have been developed to detecting genes highly correlated to diseases, selecting genes that exhibit a similar behavior under specific conditions, building models to predict disease outcome based on genetic profiles, and inferring regulatory networks. This paper discusses popular and recent data mining techniques (i.e., Feature Selection, Clustering, Classification, and Association Rule Mining) applied to microarray data. The main characteristics of microarray data and preprocessing procedures are presented to understand the critical issues introduced by gene expression values analysis. Each technique is analyzed, and relevant examples of pertinent literature are reported. Moreover, real use cases exploiting analytic pipelines that use these methods are also introduced. Finally, future directions of data mining research on microarray data are envisioned.
科研通智能强力驱动
Strongly Powered by AbleSci AI