A new framework for missing data estimation and reconstruction based on the geographical input information, data mining, and multi-criteria decision-making; theory and application in missing groundwater data of Damghan plain

数据挖掘 决策树 均方误差 缺少数据 支持向量机 人工神经网络 随机森林 计算机科学 计算 极限学习机 数学 统计
作者
Alireza Mohaghegh,Saeed Farzin,Mahdi Valikhan Anaraki
出处
期刊:Groundwater for Sustainable Development [Elsevier]
卷期号:: 100767-100767
标识
DOI:10.1016/j.gsd.2022.100767
摘要

In the present study, a new framework is developed based on the geographical data (GD), data mining techniques (DI), and Hesitant fuzzy-multicriteria decision-making methods (HF-MCDA) for modeling groundwater table (GWT) missing data in Damghan plain. The GD is used as inputs in the presented approach, and available GWT is used as output. The different DI, including artificial neural network (ANN), tree model M5 (M5), multivariate adaptive regression spline (MARS), least-square support vector machine (LSSVM), random forest (RF), and extreme learning machine (ELM), are employed for establishing a relation between GD and GWT and estimating missing GWT. However, there is this challenge that one of the DI is better because there are different criteria for selecting the best DI, including error criteria, uncertainty, and computation time. Moreover, there is hesitation about the choice of weight criteria. In this condition, HF-MCDA is a practical choice. According to the results, M5 (by values of 5.485 m, 10.811 m, and 0.998 for MAE, RMSE, and R2, respectively) and LSSVM (by values of 3.043 m, 17.005 m, and 0.997 for MAE, RMSE, and R2, respectively) have accurate results than other investigated DI. In contrast, ELM has the worst results in terms of accuracy. M5 and RF have the best and worst performance based on the time computation term. The results of bootstrap uncertainty show that LSSVM has minimum uncertainty (by the value of 1.349 m for d_factor), and ELM has maximum uncertainty (by the value of 1.570 m for d_factor). Finally, according to the results of HF-MCDA, M5, LSSVM has first and the second rank with a closed score. Besides, the MARS algorithm is placed B in the third rank with a slight difference from M5 and LSSVM. Based on the high and closed scores of M5, LSSVM, and MARS, these methods can be used to find missing GWT data. • A new framework has been introduced for modeling missing data of GWT using geographical data and date information. • Different DI have been employed as an estimator in the presented framework. • The uncertainty of algorithms has been assessed by bootstrapping method. • New decision-making method HF-MCDA is used for selecting the best algorithm. • The presented framework has the potential for modeling missing data in different fields.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
情怀应助Fortune采纳,获得10
1秒前
qsh完成签到,获得积分10
1秒前
啸傲西湖完成签到,获得积分10
2秒前
ming应助雨水采纳,获得10
4秒前
5秒前
windcreator完成签到,获得积分10
5秒前
蝉鸣完成签到,获得积分10
7秒前
冰子完成签到 ,获得积分10
9秒前
云铱梦令完成签到,获得积分10
9秒前
yuHS完成签到,获得积分10
9秒前
10秒前
某某发布了新的文献求助10
11秒前
MrLiu完成签到,获得积分10
11秒前
漂亮的芝发布了新的文献求助10
13秒前
小灰灰完成签到 ,获得积分10
13秒前
852应助韦明凯采纳,获得10
13秒前
搜集达人应助如晴采纳,获得10
14秒前
15秒前
我是老大应助roclie采纳,获得10
16秒前
18秒前
雨水完成签到,获得积分10
20秒前
汉堡包应助hs采纳,获得10
20秒前
h3m完成签到 ,获得积分10
22秒前
22秒前
25秒前
大模型应助沉泽采纳,获得10
25秒前
25秒前
爆米花应助fanfan采纳,获得10
27秒前
NexusExplorer应助不散的和弦采纳,获得10
27秒前
大胆夜山完成签到,获得积分20
28秒前
29秒前
钱俊完成签到,获得积分10
29秒前
大酋长发布了新的文献求助10
31秒前
共享精神应助火焰向上采纳,获得10
31秒前
32秒前
33秒前
34秒前
34秒前
风趣飞柏完成签到,获得积分10
34秒前
李花开又白完成签到,获得积分10
35秒前
高分求助中
The three stars each : the Astrolabes and related texts 1070
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Sport in der Antike 800
Aspect and Predication: The Semantics of Argument Structure 666
De arte gymnastica. The art of gymnastics 600
少脉山油柑叶的化学成分研究 530
Sport in der Antike Hardcover – March 1, 2015 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2409068
求助须知:如何正确求助?哪些是违规求助? 2105043
关于积分的说明 5315846
捐赠科研通 1832535
什么是DOI,文献DOI怎么找? 913080
版权声明 560733
科研通“疑难数据库(出版商)”最低求助积分说明 488238