Back to Search Start Over

Evaluating Geospatial RDF Stores Using the Benchmark Geographica 2

Authors :
Theofilos Ioannidis
Manolis Koubarakis
George Garbis
Konstantina Bereta
Kostis Kyzirakos
Source :
Journal on Data Semantics
Publication Year :
2021
Publisher :
Springer Science and Business Media LLC, 2021.

Abstract

Geospatial extensions of SPARQL, like GeoSPARQL and stSPARQL, have been defined since 2007 and while several geospatial RDF stores have implemented a substantial part of these extensions, other stores limited their support mostly on point geometry features. A parallel process with the above was that RDF frameworks evolved in an interesting way by presenting a more mature set of geospatial features, such as GeoSPARQL support and including the latest indexing technologies. As a logical consequence, a shift in the use of RDF frameworks is to be expected, from base platforms that users extend to create more complete geospatial RDF stores, to attractive finished RDF solutions for many geospatial applications. Alongside with the ever-increasing size of linked geospatial data that semantic stores need to handle, all the above provided our group the motivation to improve our single node systems benchmark Geographica, originally defined in 2013. Geographica 2 is more comprehensive, because it now includes new geospatial RDF stores and frameworks, big real world datasets of many hundred million triples with up to fifty million features of complex geometries, new tests and queries that reveal the scalability of these systems. The augmented and revised real world workload of Geographica 2 tests the efficiency of primitive spatial functions in RDF stores, their performance in the geocoding scenario against the new Census dataset in addition to many other real use case scenarios and finally includes computation of statistics for geospatial datasets. A more detailed and systematic evaluation is performed using the synthetic workload. The new scalability workload aims at discovering the limits of centralized geospatial RDF stores of various architectures. It employs a set of six well balanced real world datasets with highly complex geometries covering many European countries and compares three RDF stores in terms of storage space, bulk loading and query response time. In addition, a special version of the benchmark has been created for systems with limited geospatial functionality and two more systems of this category are introduced along the six systems of the main benchmark, all stressed against point-only subsets of the workloads. Three out of the eight systems use an RDBMS for the persistence layer, while some of them offer a variety of persistence options.<br />EU project ExtremeEarth (825258)

Details

ISSN :
18612040 and 18612032
Volume :
10
Database :
OpenAIRE
Journal :
Journal on Data Semantics
Accession number :
edsair.doi.dedup.....b485b798f98cb2cc7a1e03673fdfad56
Full Text :
https://doi.org/10.1007/s13740-021-00118-x