山崩
计算机科学
机器学习
统一建模语言
采样(信号处理)
人工智能
样品(材料)
混合模型
随机森林
数据挖掘
地质学
岩土工程
软件
计算机视觉
化学
滤波器(信号处理)
色谱法
程序设计语言
作者
Chenxu Su,Bijiao Wang,Yunhong Lv,Mingpeng Zhang,Dalei Peng,Bate Bate,Shuai Zhang
标识
DOI:10.1080/17499518.2022.2088802
摘要
Datasets containing recorded landslide and non-landslide samples can greatly influence the performance of machine learning (ML) models, which are commonly used in landslide susceptibility mapping (LSM). However, the non-landslide samples cannot be directly obtained. In this study, a pattern-based approach is proposed to improve the LSM process, constructing unsupervised machine learning (UML) – supervised machine learning (SML) collaborative models in which the non-landslide samples can be reasonably selected. Two UML models, the Gaussian mixture model (GMM) and K-means, are introduced to sample the non-landslide datasets with four sampling selections (abbreviated as A, B, C and D, respectively). Then non-landslide patterns recognised by the UML models are learned by the random forest (RF). A new sensitivity index, accuracy improvement ratio (AIR), is defined to evaluate the superiority of these sampling selections. Compared with the GMM-RF model, the K-means-RF model is more capable of recognising non-landslide patterns and providing sufficient and reliable non-landslide samples. The sampling selection A of the K-means-RF with an AIR value of 2.3 is regarded as the best selection. The results indicate that the UML-SML model based on the pattern-based approach offers an effective strategy to find the non-landslide samples and has a better solution to the LSM.
科研通智能强力驱动
Strongly Powered by AbleSci AI