代谢组学
主成分分析
过度拟合
化学计量学
化学
线性判别分析
偏最小二乘回归
代谢物
质子核磁共振
人工智能
模式识别(心理学)
计算生物学
机器学习
计算机科学
色谱法
人工神经网络
生物化学
生物
立体化学
作者
Lin Jiang,Hunter Sullivan,Bo Wang
标识
DOI:10.1080/00032719.2021.2019758
摘要
Metabolomics is an interdisciplinary area that integrates knowledge of instrumentation, data science, and biochemistry. Metabolomics studies the changes in a large number of metabolites after various treatments using analytical platforms. However, the interpretation approaches have not been completely investigated. Principal component analysis (PCA) is an unsupervised method that describes high throughput metabolite data, which is different from supervised approaches such as partial least-squares discriminant analysis (PLS-DA) which frequently has overfitting problems. The interpretation of PCA loadings, especially for studies with multiple study groups, is not well developed for metabolomics. In this study, a new method was reported that integrates PCA loading values with the commonly used statistical t-test analysis to significantly improve the convenience and efficiency of interpretation. The method was demonstrated using practical studies of NMR metabolomics on the extracts from sea anemone that were treated with six atrazine concentrations. The results indicated that the approach is suitable for multiple groups of metabolomics for early-stage discoveries, such as low concentrations and potentially longitudinal studies. In summary, this methodology may be critical in studies such as environmental metabolomics with various stimuli factors where the data interpretation was previously incompletely developed.
科研通智能强力驱动
Strongly Powered by AbleSci AI