随机森林
代谢组学
人工智能
机器学习
化学
线性判别分析
偏最小二乘回归
特征选择
掺假者
计算机科学
色谱法
作者
Fawzan Sigma Aurum,Muhammad Zukhrufuz Zaman,Edi Purwanto,Danar Praseptiangga,K. Nakano
出处
期刊:Food bioscience
[Elsevier BV]
日期:2023-09-13
卷期号:56: 103122-103122
被引量:5
标识
DOI:10.1016/j.fbio.2023.103122
摘要
Coffee is an export commodity that is prone to fraudulent practices. Therefore, this study presents a novel approach to authenticate coffee origins using targeted metabolomics with gas chromatography-tandem mass spectrometry (GC-MS/MS) and machine learning models. A total of 200 coffee samples from different harvest years and areas from Indonesia were extracted using the derivatisation method and then analysed for their metabolite profiles. Several supervised machine-learning models were tested to classify coffee origins and discover their potential markers. The study found various metabolite features spanning diverse chemical classes, encompassing sugar alcohols, carbohydrates, amino acids, organic acids, fatty acids, and phenols. Random forest (RF) and partial least squares discriminant analysis (PLS-DA) were among the most accurate models in predicting the origin of coffee from several classes in the validation dataset. The accuracy of both models is in the range of 91%–100%. Furthermore, this study proposes a new strategy for determining "intersection features" as the set of features that are important and common to both RF and PLS-DA models, thereby providing a robust selection of coffee origin markers. Overall, the approach and findings of this study have far-reaching implications for coffee authentication.
科研通智能强力驱动
Strongly Powered by AbleSci AI