计算机科学
数据挖掘
领域(数学)
过程(计算)
质量(理念)
土地利用
工程类
数学
土木工程
哲学
认识论
纯数学
操作系统
作者
Hao Wu,Zhangwei Jiang,Anning Dong,Ronghui Gao,Xiaoqin Yan,Zhihui Hu,Fengling Mao,Hong Liu,P.P. Li,Peng Luo,Zijin Guo,Qingfeng Guan,Yao Yao
标识
DOI:10.1080/13658816.2024.2387200
摘要
A high-quality land-use dataset is crucial for constructing a high-performance land-use classification model. Due to the complexity and spatial heterogeneity of land-use, the dataset construction process is inefficient and costly. This challenge affects the quality of datasets, consequently impacting the model's performance. The emerging field of Data-Centric Artificial Intelligence (DCAI) is expected to deliver techniques for dataset optimization, offering a promising solution to the problem. Therefore, this study proposes a data-centric framework named DCAI-CLUD for the construction of land-use datasets. Based on this framework, the accuracy and rate of data labeling are improved by 5.93 and 28.97%. The Gini index of the dataset and the proportion of samples with non-mixed land-use categories are enhanced by 3.27 and 8.52%. The overall accuracy (OA) and Kappa of the land-use classification model improved significantly by 27.87 and 58.08%. This study is the first to introduce DCAI into the field of geographic information and remote sensing and verify its effectiveness. The proposed framework can effectively improve the construction efficiency and quality of the dataset and synchronously optimize the model performance. Based on the proposed framework, we constructed a multi-source land-use dataset of major cities in China named CN-MSLU-100K.
科研通智能强力驱动
Strongly Powered by AbleSci AI