Back to Search Start Over

Extracting Spatio-Temporal Information from Chinese Archaeological Site Text.

Authors :
Yuan, Wenjing
Yang, Lin
Yang, Qing
Sheng, Yehua
Wang, Ziyang
Source :
ISPRS International Journal of Geo-Information. Mar2022, Vol. 11 Issue 3, p175-N.PAG. 18p.
Publication Year :
2022

Abstract

Archaeological site text is the main carrier of archaeological data at present, which contains rich information. How to efficiently extract useful knowledge from the massive unstructured archaeological site texts is of great significance for the mining and reuse of archaeological information. According to the site information (such as name, location, cultural type, dynasty, etc.) recorded in the Chinese archaeological site text, this paper combines deep learning and natural language processing techniques to study the information extraction method for automatically obtaining the spatio-temporal information of sites. The initial construction of the corpus of Chinese archaeological site text is completed for the first time, and the corpus is input into the Bidirectional Long Short-Term Memory with Conditional Random Fields (BiLSTM-CRF) entity recognition model and Bidirectional Gated Recurrent Units with Dual Attention (BiGRU-Dual Attention) relationship extraction model for training. The F1 values of BiLSTM-CRF model and BiGRU-Dual Attention model on the test set reach 87.87% and 88.05%, respectively. The study demonstrates that the information extraction method proposed in this paper is feasible for the Chinese archaeological site texts, which promotes the establishment of knowledge graphs in archaeology and provides new methods and ideas for the development of information mining technology in archaeology. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
22209964
Volume :
11
Issue :
3
Database :
Academic Search Index
Journal :
ISPRS International Journal of Geo-Information
Publication Type :
Academic Journal
Accession number :
156018754
Full Text :
https://doi.org/10.3390/ijgi11030175