Back to Search Start Over

A Transformation of the RDF Mapping Language into a High-Level Data Analysis Language for Execution in a Distributed Computing Environment

Authors :
Sergey A. Stupnikov
Wenfei Tang
Source :
Communications in Computer and Information Science ISBN: 9783030811990, DAMDID/RCDL (Selected Papers)
Publication Year :
2021
Publisher :
Springer International Publishing, 2021.

Abstract

Nowadays scientific data should be FAIR that are Findable, Accessible, Interoperable and Reusable. Reference implementation of FAIR data management principles proposed recently considers RDF as unifying data model and RDF Mapping Language (RML) as the basic language for data integration. This paper is aimed at development of methods and tools for scalable data integration in the frame of this architecture. A mapping from RML into a high-level data analysis language Pig Latin that runs on Hadoop is considered. The mapping is implemented using model transformation technologies. These allows to execute RML programs in the Hadoop distributed computing environment. According to the experimental evaluation RML implementation developed scales w.r.t. data volume and outperforms related implementations.

Details

ISBN :
978-3-030-81199-0
ISBNs :
9783030811990
Database :
OpenAIRE
Journal :
Communications in Computer and Information Science ISBN: 9783030811990, DAMDID/RCDL (Selected Papers)
Accession number :
edsair.doi...........5f88468b4d7b1f9c265e98024a8c0b40
Full Text :
https://doi.org/10.1007/978-3-030-81200-3_6