Evaluating the extrapolation potential of random forest digital soil mapping

外推法 随机森林 表土 相似性(几何) 数字土壤制图 特征(语言学) 航程(航空) 统计 土壤分类 索引(排版) 鉴定(生物学) 环境科学 计算机科学 数学 数据挖掘 土壤科学 机器学习 土壤水分 人工智能 工程类 生态学 语言学 哲学 生物 万维网 图像(数学) 航空航天工程
作者
Fatemeh Hateffard,Luc Steinbuch,G.B.M. Heuvelink
出处
期刊:Geoderma [Elsevier BV]
卷期号:441: 116740-116740 被引量:4
标识
DOI:10.1016/j.geoderma.2023.116740
摘要

Spatial soil information is essential for informed decision-making in a wide range of fields. Digital soil mapping (DSM) using machine learning algorithms has become a popular approach for generating soil maps. DSM capitalises on the relation between environmental variables (i.e., features) and a soil property of interest. It typically needs a training dataset that covers the feature space well. Mapping in areas where there are no training data is challenging, because extrapolation in geographic space often induces extrapolation in feature space and can seriously deteriorate prediction accuracy. The objective of this study was to analyse the extrapolation effects of random forest DSM models by predicting topsoil properties (OC, clay, and pH) in four African countries using soil data from the ISRIC Africa Soil Profiles database. The study was conducted in eight experiments whereby soil data from one or three countries were used to predict in the other countries. We calculated similarities between donor and recipient areas using four measures, including soil type similarity, homosoil, dissimilarity index by area of applicability (AOA), and quantile regression forest (QRF) prediction interval width. The aim was to determine the level of agreement between these four measures and identify the method that had the strongest agreement with common validation metrics. The results indicated a positive correlation between soil type similarity, homosoil and dissimilarity index by AOA. Surprisingly, we observed a negative correlation between dissimilarity index by AOA and QRF prediction interval width. Although the cross-validation results for the trained models were acceptable, the extrapolation results were unsatisfactory, highlighting the risk of extrapolation. Using soil data from three countries instead of one increased the similarities for all measures, but it had a limited effect on improving extrapolation. Also, none of the measures had a strong correlation with the validation metrics. This was particularly disappointing for AOA and QRF, which we had expected to be strong indicators of extrapolation prediction performance. Results showed that homosoil and soil type methods had the strongest correlation with validation metrics. The results for this case study revealed limitations of using AOA and QRF as measures of extrapolation effects, highlighting the importance of not relying on these methods blindly. Further research and more case studies are needed to address the effects of extrapolation of DSM models.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
xupapa发布了新的文献求助10
1秒前
1秒前
美丽的凌蝶完成签到,获得积分10
2秒前
2秒前
在水一方应助酷酷的夏波采纳,获得30
3秒前
科研通AI6.4应助沉默南露采纳,获得10
5秒前
Scc发布了新的文献求助10
5秒前
Sun_1完成签到,获得积分10
5秒前
常富育发布了新的文献求助10
5秒前
lll发布了新的文献求助10
5秒前
5秒前
大模型应助APTX4869采纳,获得10
5秒前
ChenxiPan发布了新的文献求助10
6秒前
6秒前
潇潇发布了新的文献求助10
7秒前
火星上的白凡完成签到,获得积分10
7秒前
星辰大海应助柔弱曲奇采纳,获得10
7秒前
guohuameike发布了新的文献求助10
8秒前
马荣应助galvin采纳,获得20
8秒前
9秒前
凹凸先森完成签到,获得积分10
9秒前
情怀应助RXL采纳,获得10
9秒前
无知的h发布了新的文献求助10
10秒前
牧童完成签到,获得积分10
10秒前
12秒前
朴实寻真应助easyproud采纳,获得10
12秒前
12秒前
buren完成签到,获得积分10
12秒前
12秒前
13秒前
打打应助tyy采纳,获得10
13秒前
14秒前
caixukun发布了新的文献求助10
14秒前
小二郎应助ChenxiPan采纳,获得10
14秒前
雨过天晴发布了新的文献求助10
14秒前
14秒前
15秒前
16秒前
17秒前
高分求助中
The Wiley Blackwell Companion to Diachronic and Historical Linguistics 3000
Standards for Molecular Testing for Red Cell, Platelet, and Neutrophil Antigens, 7th edition 1000
HANDBOOK OF CHEMISTRY AND PHYSICS 106th edition 1000
ASPEN Adult Nutrition Support Core Curriculum, Fourth Edition 1000
Signals, Systems, and Signal Processing 610
脑电大模型与情感脑机接口研究--郑伟龙 500
GMP in Practice: Regulatory Expectations for the Pharmaceutical Industry 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6296266
求助须知:如何正确求助?哪些是违规求助? 8113717
关于积分的说明 16982766
捐赠科研通 5358394
什么是DOI,文献DOI怎么找? 2846844
邀请新用户注册赠送积分活动 1824112
关于科研通互助平台的介绍 1679015