代谢组学
数据挖掘
规范化(社会学)
偏最小二乘回归
主成分分析
数据库规范化
多元统计
预处理器
数据预处理
计算机科学
缺少数据
数据处理
线性判别分析
模式识别(心理学)
人工智能
机器学习
生物信息学
生物
操作系统
社会学
人类学
作者
Hans Rolando Zamora-Obando,Gustavo Henrique Bueno Duarte,Ana Valéria Colnaghi Simionato
标识
DOI:10.1007/978-3-030-77252-9_12
摘要
The present chapter describes basic aspects of the main steps for data processing on mass spectrometry-based metabolomics platforms, focusing on the main objectives and important considerations of each step. Initially, an overview of metabolomics and the pivotal techniques applied in the field are presented. Important features of data acquisition and preprocessing such as data compression, noise filtering, and baseline correction are revised focusing on practical aspects. Peak detection, deconvolution, and alignment as well as missing values are also discussed. Special attention is given to chemical and mathematical normalization approaches and the role of the quality control (QC) samples. Methods for uni- and multivariate statistical analysis and data pretreatment that could impact them are reviewed, emphasizing the most widely used multivariate methods, i.e., principal components analysis (PCA), partial least squares-discriminant analysis (PLS-DA), orthogonal partial least square-discriminant analysis (OPLS-DA), and hierarchical cluster analysis (HCA). Criteria for model validation and softwares used in data processing were also approached. The chapter ends with some concerns about the minimal requirements to report metadata in metabolomics.
科研通智能强力驱动
Strongly Powered by AbleSci AI