Back to Search Start Over

Extracting OLAP Cubes From Document-Oriented NoSQL Database Based on Parallel Similarity Algorithms

Authors :
Kambiz Majidzadeh
Farnaz Davardoost
Amin Babazadeh Sangar
Source :
Canadian Journal of Electrical and Computer Engineering. 43:111-118
Publication Year :
2020
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2020.

Abstract

Today, the relational database is not suitable for data management due to the large variety and volume of data which are mostly untrusted. Therefore, NoSQL has attracted the attention of companies. Despite it being a proper choice for managing a variety of large volume data, there is a big challenge and difficulty in performing online analytical processing (OLAP) on NoSQL since it is schema-less. This article aims to introduce a model to overcome null value in converting document-oriented NoSQL databases into relational databases using parallel similarity techniques. The proposed model includes four phases, shingling, chunck, minhashing, and locality-sensitive hashing MapReduce (LSHMR). Each phase performs a proper process on input NoSQL databases. The main idea of LSHMR is based on the nature of both locality-sensitive hashing (LSH) and MapReduce (MR). In this article, the LSH similarity search technique is used on the MR framework to extract OLAP cubes. LSH is used to decrease the number of comparisons. Furthermore, MR enables efficient distributed and parallel computing. The proposed model is an efficient and suitable approach for extracting OLAP cubes from an NoSQL database.

Details

ISSN :
08408688
Volume :
43
Database :
OpenAIRE
Journal :
Canadian Journal of Electrical and Computer Engineering
Accession number :
edsair.doi...........6ac0ccf45fc5dcd5bcf8ed5e77b0c6a2
Full Text :
https://doi.org/10.1109/cjece.2019.2953049