均方误差
线性回归
偏最小二乘回归
决定系数
可滴定酸
数学
相关系数
统计
回归分析
回归
皮尔逊积矩相关系数
预测建模
分析化学(期刊)
化学
色谱法
食品科学
作者
Chrysanthi Chariskou,Christos Bazinas,Andries J. Daniels,Umezuruike Linus Opara,H.H. Nieuwoudt,Vassilis G. Kaburlasos
标识
DOI:10.23919/softcom55329.2022.9911292
摘要
Wavenumbers of high absolute value of correlation coefficient to Total Soluble Solids (TSS), pH, or Titratable Acidity (TA) were selected from reflection Fourier transform near infrared (FT -NIR) spectra of intact grape berries of the white variety Thompson Seedless. Multiple linear regression (MLR) and partial least squares (PLS) regression were applied to the spectra to construct trained regression models able to predict TSS, pH, and TA. Square Pearson's correlation coefficient (R2) and the Mean Square Error (MSE) were used to evaluate the precision of prediction. TSS content was predicted with R2 score of 0.972 and MSE 0.094 using MLR and with R2 0.926 and MSE 0.223 using PLS regression. The pH prediction scores were R2 0.812 and MSE 0.002 with MLR. With PLS regression the values were R2 0.485 and MSE 0.004. TA can be predicted only from the second derivatives of the spectra. MLR produced R2 for prediction 0.745 and MSE 0.076, while the scores using PLS regression were R2 0.648 and MSE 0.114. It was concluded that variable selection could greatly improve the prediction accuracy. The appropriateness of the two regression methods depends on the structure of the spectra dataset and on the characteristics whose prediction is sought.
科研通智能强力驱动
Strongly Powered by AbleSci AI