Boosting(机器学习)
计算机科学
成对比较
机器学习
人工智能
特征选择
预测建模
产量(工程)
数据挖掘
材料科学
冶金
作者
Mehmet Furkan Çelik,Mustafa Serkan Işık,Gülşen Taşkın,Esra Erten,Gustau Camps‐Valls
标识
DOI:10.1109/lgrs.2023.3303643
摘要
Cotton is under the threat of climate and ecosystem change, and has an essential role in the global textile industry. This makes its yield prediction essential for both economics and sustainability. The potential cotton yield can be predicted by integrating climatic factors, soil parameters, and biophysical parameters observed by high temporal & spatial resolution remote sensing satellites. This study used a multisource dataset to create an explainable and accurate predictive model for cotton yield prediction over the continental US (CONUS). A recently proposed glass-box method called Explainable Boosting Machine (EBM), which provides transparency, reliability, and ease of interpretation, was implemented. Accuracy performance was compared with common machine learning (ML) methods for predicting cotton yields. The EBM showed higher accuracy against other glass-box methods and competitive results with black-box models. With the help of the EBM, the importance of individual features and their pairwise interactions was revealed without applying any post-hoc methods. The study findings showed that the precipitation (P), enhanced vegetation index (EVI), and leaf area index (LAI) are the three most important dynamic features. The dynamic features are the driver of the created model with 78% of the overall feature importance, followed by pairwise interactions of the features with 16% contribution. Lastly, static features contribute 6% to the overall feature importance. The study highlights the importance of using multi-source data and interactions of the input features and providing an interpretable model to understand the inner dynamics of cotton yield predictions.
科研通智能强力驱动
Strongly Powered by AbleSci AI