Back to Search Start Over

Enhanced semantic representation model for multisource point of interest attribute alignment.

Authors :
Li, Pengpeng
Wang, Yong
Liu, Jiping
Luo, An
Xu, Shenghua
Zhang, Zhiran
Source :
Information Fusion. Oct2023, Vol. 98, pN.PAG-N.PAG. 1p.
Publication Year :
2023

Abstract

• ESRM unsupervised learning. • Two pre-trained target tasks: relationship consistency prediction and replace language model. • Downstream attribute alignment task fine-tuning. Multisource point of interest (POI) attribute alignment is the consistent processing of heterogeneous attribute values from different data sources pointing to the same POI data, which is one of the key technologies to achieve geospatial data fusion. However, semantic heterogeneity problems of synonyms and homographs among different POI data sources are encountered, which makes multisource POI data fusion challenging. This paper proposes a multisource POI attribute alignment method based on the Enhanced Semantic Representation Model (ESRM). First, the unlabeled corpus is preprocessed by Chinese word segmentation and attribute expression sequence construction. Then, the ESRM is pre-trained using the relational consistency prediction and replacement language model tasks. Finally, the model is fine-tuned through supervised learning to perform the attribute alignment task for multisource POI data, as per the specific downstream tasks. We used the POI attributes of Baidu Map, Tencent Map, and Gaode Map in Chengdu, China as the experimental data. The findings demonstrate that the proposed model outperforms existing methods for attribute alignment. Specifically, the category attribute consistency achieves a Macro-F1 value of over 90%, and the address attribute standardization achieves a BLEU-4 score of over 95%. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15662535
Volume :
98
Database :
Academic Search Index
Journal :
Information Fusion
Publication Type :
Academic Journal
Accession number :
164155558
Full Text :
https://doi.org/10.1016/j.inffus.2023.101852