Back to Search Start Over

Toward Geographic Information Harvesting: Extraction of Spatial Relational Facts from Web Documents

Authors :
Maguelonne Teisseire
Dino Ienco
Mathieu Roche
Donato Malerba
Corrado Loglisci
University of Bari Aldo Moro (UNIBA)
Territoires, Environnement, Télédétection et Information Spatiale (UMR TETIS)
Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad)-AgroParisTech-Institut national de recherche en sciences et technologies pour l'environnement et l'agriculture (IRSTEA)
Fouille de données environnementales (TATOO)
Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier (LIRMM)
Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)
Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)
Exploration et exploitation de données textuelles (TEXTE)
UNIVERSITA ALDO MORO BARI ITA
Partenaires IRSTEA
Institut national de recherche en sciences et technologies pour l'environnement et l'agriculture (IRSTEA)-Institut national de recherche en sciences et technologies pour l'environnement et l'agriculture (IRSTEA)
Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)
Source :
12th International Workshop on Spatial and Spatiotemporal Data Mining (SSTDM) @ ICDM, 12th International Workshop on Spatial and Spatiotemporal Data Mining (SSTDM) @ ICDM, Dec 2012, Brussels, Belgium. pp.789-796, ⟨10.1109/ICDMW.2012.20⟩, International Workshop on Spatial and Spatiotemporal Data Mining (SSTDM-12), In Cooperation with IEEE ICDM 2012, 10 December 2012, Brussels, Belgium, SSTDM: Spatial and Spatio-Temporal Data Mining, SSTDM: Spatial and Spatio-Temporal Data Mining, Dec 2012, Brussels, Belgium. pp.789-796, ICDM 2012, ICDM 2012, Dec 2012, Bruxelles, Belgium. pp.789-796, ⟨10.1109/ICDMW.2012.20⟩, ICDM Workshops
Publication Year :
2012
Publisher :
HAL CCSD, 2012.

Abstract

International Workshop on Spatial and Spatiotemporal Data Mining (SSTDM-12), In Cooperation with IEEE ICDM 2012, 10 December 2012, Brussels, Belgium; International audience; This paper faces the problem of harvesting geographic information from Web documents, specifically, extracting facts on spatial relations among geographic places. The motivation is twofold. First, researchers on Spatial Data Mining often assume that spatial data are already available, thanks to current GIS and positioning technologies. Nevertheless, this is not applicable to the case of spatial information embedded in data without an explicit spatial modeling, such as documents. Second, despite the huge amount of Web documents conveying useful geographic information, there is not much work on how to harvest spatial data from these documents. The problem is particularly challenging because of the lack of annotated documents, which prevents the application of supervised learning techniques. In this paper, we propose to harvest facts on geographic places through an unsupervised approach which recognizes spatial relations among geographic places without supposing the availability of annotated documents. The proposed approach is based on the combined use of a spatial ontology and a prototype-based classifier. A case study on topological and directional relations is reported and commented.

Details

Language :
English
Database :
OpenAIRE
Journal :
12th International Workshop on Spatial and Spatiotemporal Data Mining (SSTDM) @ ICDM, 12th International Workshop on Spatial and Spatiotemporal Data Mining (SSTDM) @ ICDM, Dec 2012, Brussels, Belgium. pp.789-796, ⟨10.1109/ICDMW.2012.20⟩, International Workshop on Spatial and Spatiotemporal Data Mining (SSTDM-12), In Cooperation with IEEE ICDM 2012, 10 December 2012, Brussels, Belgium, SSTDM: Spatial and Spatio-Temporal Data Mining, SSTDM: Spatial and Spatio-Temporal Data Mining, Dec 2012, Brussels, Belgium. pp.789-796, ICDM 2012, ICDM 2012, Dec 2012, Bruxelles, Belgium. pp.789-796, ⟨10.1109/ICDMW.2012.20⟩, ICDM Workshops
Accession number :
edsair.doi.dedup.....ec04845a2dbc0a0644d2390aaad345ba
Full Text :
https://doi.org/10.1109/ICDMW.2012.20⟩