校准
偏最小二乘回归
主成分分析
多元统计
聚类分析
蒙特卡罗方法
计算机科学
近红外光谱
鉴定(生物学)
统计
化学计量学
变量(数学)
集合(抽象数据类型)
数学
模式识别(心理学)
人工智能
机器学习
物理
光学
数学分析
生物
植物
程序设计语言
作者
Xueguang Shao,Min Zhang,Wensheng Cai
出处
期刊:Analytical Methods
[Royal Society of Chemistry]
日期:2012-01-01
卷期号:4 (2): 467-467
被引量:22
摘要
Near-infrared (NIR) spectral analysis usually needs to take advantage of multivariate calibration. However, not all the variables in the spectra have equal contributions to a calibration model. Identification of informative variables is a key step to build a high performance model. According to the influence of a variable on the calibration model, influential variable (IV) is defined and a method for identification of IVs is proposed in this work. In the method, a set of partial least squares (PLS) models are built using a subset of variables selected randomly by Monte Carlo re-sampling, and then the clustering of these models are investigated by means of principal component analysis. The variables that make the models grouping can be identified as the IVs. Finally, the PLS model built with the selected IVs is adopted as the calibration model. Five NIR spectral datasets are used to test the performance and applicability of the method. The results show that the identified IVs are reasonable and the calibration model is efficient enough to produce accurate and reliable predictions.
科研通智能强力驱动
Strongly Powered by AbleSci AI