Prediction of flood risk levels of urban flooded points though using machine learning with unbalanced data

重采样 大洪水 估计员 熵(时间箭头) 洪水(心理学) 数据挖掘 k-最近邻算法 随机森林 采样(信号处理) 人工智能 计算机科学 机器学习 统计 数学 地理 考古 心理学 物理 滤波器(信号处理) 量子力学 计算机视觉 心理治疗师
作者
Hongfa Wang,Yu Meng,Hongshi Xu,Huiliang Wang,Xinjian Guan,Yuan Liu,Meng Liu,Zening Wu
出处
期刊:Journal of Hydrology [Elsevier BV]
卷期号:630: 130742-130742 被引量:9
标识
DOI:10.1016/j.jhydrol.2024.130742
摘要

With the emphasis on preventing urban flooding and the enhancement of rational urban development, data related to urban flooding are also collected with unbalanced sample size that is a widespread phenomenon in other world fields. The performance of the classification model is compromised by unbalanced datasets, therefore, minority-class samples, floods with higher risk, are often missing alerted or incorrectly warned. To solve this problem, a novel hybrid resampling proposal is proposed in this research proved to be effective for balancing data. First, it optimizes an imbalanced dataset by the Borderline-SMOTE algorithm. Next, alternative datasets are synthesized through under-sampling techniques, whose qualities are evaluated by using information entropy and calculated rely on the k-nearest neighbor entropy estimator. The suggested method not only makes full use of the original data information, but also avoids under-fitting due to the single under-sampling utilization. A practical application in the central area of Zhengzhou, China, combining the resampling proposal and the Random Forest classification model optimized by Genetic Algorithm, the results show that significantly better results are yielded compared without any treatment in terms of all assessment indicators (Accuracy, Recall, G-mean and F1-score) have been improved.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
俊逸雪瑶完成签到,获得积分10
1秒前
1秒前
xue发布了新的文献求助10
1秒前
羔羊完成签到,获得积分10
1秒前
完美世界应助aibimixiusi采纳,获得10
1秒前
11111发布了新的文献求助10
1秒前
1秒前
duwang完成签到,获得积分10
2秒前
rainer发布了新的文献求助10
2秒前
夕瑶摇啊发布了新的文献求助10
2秒前
3秒前
立军发布了新的文献求助30
3秒前
3秒前
亍孞关注了科研通微信公众号
3秒前
亍孞关注了科研通微信公众号
3秒前
suiyi完成签到,获得积分10
4秒前
wujiaman345完成签到,获得积分10
4秒前
5秒前
文静的怜烟完成签到,获得积分10
5秒前
底层玩家完成签到,获得积分10
5秒前
6秒前
OvO完成签到,获得积分20
6秒前
俊逸雪瑶发布了新的文献求助10
6秒前
6秒前
和花花完成签到,获得积分10
6秒前
7秒前
是鹤发布了新的文献求助10
7秒前
7秒前
7秒前
Oil完成签到,获得积分10
8秒前
平淡向雁完成签到,获得积分10
8秒前
9秒前
9秒前
9dingyushu发布了新的文献求助30
9秒前
晨纯发布了新的文献求助10
10秒前
10秒前
AllRightReserved应助平淡砖头采纳,获得10
10秒前
JamesPei应助Yeee采纳,获得10
10秒前
aibimixiusi完成签到,获得积分20
10秒前
10秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
The Resilient Mindset 400
Impact of Storage Orientation and Duration on Prefilled Syringe Performance: Break-Loose and Glide Forces, and Injection Time Across Multiple Time Points 360
Programming for Chemical Engineers Using C, C++, and MATLAB 300
Upland Kenya wild flowers and ferns: a flora of the flowers, ferns, grasses, and sedges of highland Kenya 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6646882
求助须知:如何正确求助?哪些是违规求助? 8402691
关于积分的说明 17966956
捐赠科研通 5839381
什么是DOI,文献DOI怎么找? 2969936
邀请新用户注册赠送积分活动 1945113
关于科研通互助平台的介绍 1863939