Back to Search Start Over

A machine learning approach to extracting spatial information from geological texts in Chinese.

Authors :
Chu, Deping
Wan, Bo
Li, Hong
Dong, Shuai
Fu, Jinming
Liu, Yiyang
Huang, Kuan
Liu, Hui
Source :
International Journal of Geographical Information Science; Nov2022, Vol. 36 Issue 11, p2169-2193, 25p
Publication Year :
2022

Abstract

Texts have become an important spatial data resource. Interpretation of unstructured geoscience texts using natural language processing methods can effectively facilitate the discovery and retrieval of geographic information. Yet studies on the extraction of spatial information from textual geoscience data are limited compared to digital geoscience data. In this work, a machine learning approach is proposed for mining spatial relations in Chinese geological texts. The approach views spatial relation extraction as a sequence labeling problem, avoids the division of relation categories, and enables mining fine-grained spatial relations. The extracted geological texts commonly describe three-dimensional spatial relations among regions, strata, and lithologies. The extracted spatial relations are classified into three major categories (topological relations, absolute directional relations and relative directional relations) and 14 subcategories. We validated the proposed model with a test dataset, constructed visual displays of the extracted spatial relations on different topics, and quantified the uncertainty in the process from spatial entity recognition to spatial relation extraction. With the detailed portrayal of these spatial relations, this study provides support for solving theoretical and practical problems of cognition, prediction, decision-making, and evaluation in geoscience. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13658816
Volume :
36
Issue :
11
Database :
Complementary Index
Journal :
International Journal of Geographical Information Science
Publication Type :
Academic Journal
Accession number :
159948630
Full Text :
https://doi.org/10.1080/13658816.2022.2087224