Back to Search Start Over

Method on Multi-granularity Data Provenance for Data Fusion

Authors :
YANG Fei-fei, SHEN Si-yu, SHEN De-rong, NIE Tie-zheng, KOU Yue
Source :
Jisuanji kexue, Vol 49, Iss 5, Pp 120-128 (2022)
Publication Year :
2022
Publisher :
Editorial office of Computer Science, 2022.

Abstract

As the amount of data increases,correlates and crosses between data,the value of data needs to be maximized through data fusion.However,due to the complexity of the data fusion process,to clearly explain the data fusion process,it is necessary to establish a backtracking mechanism for data fusion.Although many researches are focused on data provenance,most of them are based on query and workflow,and few of them are for data fusion.This paper focuses on the provenance of data fusion,and proposes a method to support multi-granularity provenance.Firstly,the data fusion process is abstracted,and the semantic graphs of patterns,entities and attributes are constructed with the entity as the core,and an optimized model for storing storage provenance information is proposed.Secondly,on the basis of the semantic graph,the data provenance query algorithms at the entity level and the attribute level are proposed respectively,and the corresponding query optimization strategy are also proposed.Finally,experiments demonstrate the effectiveness of the proposed data provenance method.

Details

Language :
Chinese
ISSN :
1002137X
Volume :
49
Issue :
5
Database :
Directory of Open Access Journals
Journal :
Jisuanji kexue
Publication Type :
Academic Journal
Accession number :
edsdoj.9fc57a3cbb34450e8c6f2bdb47d9e0ef
Document Type :
article
Full Text :
https://doi.org/10.11896/jsjkx.210300092