1. 3iCubing: An Interval Inverted Index Approach to Data Cubes
- Author
-
Marco Domingues, Rodrigo Rocha Silva, and Jorge Bernardino
- Subjects
Big data ,data cube ,inverted index ,OLAP ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
The increase in the amounts of information used to analyze data is problematic since the memory necessary to store and process it is getting quite big. The interval inverted index representation was developed to reduce the required memory to store data, and Frag-Cubing is one of the most popular algorithms. In this paper, we propose two new data cubing algorithms: 3iCubing and M3iCubing. 3iCubing is a Frag-Cubing-based algorithm that uses the interval inverted index representation, while M3iCubing uses both a normal and interval inverted index data representation. The algorithms were compared using synthetic and real data sets in indexation and querying operations, both runtime and memory-wise. The experimental evaluation shows that 3iCubing can considerably reduce the memory needed to index a data set, reducing around 25% of the memory used by Frag-Cubing. Moreover, the results show that the interval inverted index representation is dependent on the data skewness to reduce the memory consumption, having positive results with highly skewed and real-world data sets.
- Published
- 2022
- Full Text
- View/download PDF