Back to Search Start Over

DCAI-CLUD: a data-centric framework for the construction of land-use datasets.

Authors :
Wu, Hao
Jiang, Zhangwei
Dong, Anning
Gao, Ronghui
Yan, Xiaoqin
Hu, Zhihui
Mao, Fengling
Liu, Hong
Li, Pengxuan
Luo, Peng
Guo, Zijin
Guan, Qingfeng
Yao, Yao
Source :
International Journal of Geographical Information Science. Nov2024, Vol. 38 Issue 11, p2379-2402. 24p.
Publication Year :
2024

Abstract

A high-quality land-use dataset is crucial for constructing a high-performance land-use classification model. Due to the complexity and spatial heterogeneity of land-use, the dataset construction process is inefficient and costly. This challenge affects the quality of datasets, consequently impacting the model's performance. The emerging field of Data-Centric Artificial Intelligence (DCAI) is expected to deliver techniques for dataset optimization, offering a promising solution to the problem. Therefore, this study proposes a data-centric framework named DCAI-CLUD for the construction of land-use datasets. Based on this framework, the accuracy and rate of data labeling are improved by 5.93 and 28.97%. The Gini index of the dataset and the proportion of samples with non-mixed land-use categories are enhanced by 3.27 and 8.52%. The overall accuracy (OA) and Kappa of the land-use classification model improved significantly by 27.87 and 58.08%. This study is the first to introduce DCAI into the field of geographic information and remote sensing and verify its effectiveness. The proposed framework can effectively improve the construction efficiency and quality of the dataset and synchronously optimize the model performance. Based on the proposed framework, we constructed a multi-source land-use dataset of major cities in China named CN-MSLU-100K. HIGHLIGHTS: A framework for optimizing the land-use dataset construction process is proposed. Filtering and pre-labeling improved the quality and efficiency of data labeling. The performance of land-use classification model is enhanced by dataset optimization. Preconceived results have a subjective impact on the data labelers. The first study to introduce DCAI for land-use classification is launched. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13658816
Volume :
38
Issue :
11
Database :
Academic Search Index
Journal :
International Journal of Geographical Information Science
Publication Type :
Academic Journal
Accession number :
180430099
Full Text :
https://doi.org/10.1080/13658816.2024.2387200