1. IDSA: An Efficient Algorithm for Skyline Queries Computation on Dynamic and Incomplete Data With Changing States
- Author
-
Sherzod Turaev, Sharyar Wani, Hamidah Ibrahim, Yasir Hamid, Yonis Gulzar, Ali Amer Alwan, and Arjumand Bano Soomo
- Subjects
General Computer Science ,Exploit ,Computer science ,02 engineering and technology ,computer.software_genre ,incomplete database ,Reduction (complexity) ,skyline queries ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,General Materials Science ,Pruning (decision trees) ,Electrical and Electronic Engineering ,Skyline ,Data manipulation language ,General Engineering ,Process (computing) ,InformationSystems_DATABASEMANAGEMENT ,Dynamic database ,TK1-9971 ,pairwise comparison ,020201 artificial intelligence & image processing ,Pairwise comparison ,Electrical engineering. Electronics. Nuclear engineering ,Data mining ,Tuple ,computer - Abstract
Skyline queries have been widely used as an effective query tool in many contemporary database applications. The main concept of skyline queries relies on retrieving the non-dominated tuples in the database which are known skylines. In most database applications, the contents of the databases are dynamic due to the continuous changes made towards the database. Typically, the changes in the contents of the database occur through data manipulation operations (INSERT and/or UPDATE). Performing these operations on the database results in invalidating the most recent skylines before changes are made on the database. Furthermore, the presence of incomplete data in databases becomes frequent phenomena in recent database applications. Data incompleteness causes several challenges on skyline queries such as losing the transitivity property of the skyline technique and the test dominance process between tuples being cyclic. Reapplying skyline technique on the entire updated incomplete database to determine the new skylines is unwise due to the exhaustive pairwise comparisons. Thus, this paper proposes an approach, named Incomplete Dynamic Skyline Algorithm (IDSA) which attempts to determine the skylines on dynamic and incomplete databases. Two optimization techniques have been incorporated in IDSA, namely: pruning and selecting superior local skylines. The pruning process attempts to exploit the derived skylines before the INSERT/UPDATE operation made on the database to identify the new skylines. Moreover, selecting superior local skylines process assists in further eliminating the remaining non-skylines from further processing. These two optimization techniques lead to a large reduction in the number of domination tests due to avoiding re-computing of skylines over the entire updated database to derive the new skylines. Extensive experiments have been accomplished on both real and synthetic datasets, and the results demonstrate that IDSA outperforms the existing solutions in terms of the number of domination tests and the processing time of the skyline operation.
- Published
- 2021