Back to Search Start Over

IoT streaming data integration from multiple sources.

Authors :
Tu, Doan Quang
Kayes, A. S. M.
Rahayu, Wenny
Nguyen, Kinh
Source :
Computing. Oct2020, Vol. 102 Issue 10, p2299-2329. 31p.
Publication Year :
2020

Abstract

The Internet of Things (IoT) has recently received considerable interest due to the development of smart technologies in today's interconnected world. With the rapid advancement in Internet technologies and the proliferation of IoT sensors, myriad systems and applications generate data of a massive volume, variety and velocity which traditional databases and systems are unable to manage effectively. Many organizations need to deal with these massive datasets that encounter different types of data (e.g., IoT streaming data, static data) in different formats (e.g., structured, semi-structured) coming from multiple sources. Several data integration mechanisms have been designed to process mostly static data. Unfortunately, these techniques are not able to deal with and integrate IoT streaming datasets from multiple sources. In this paper, we identify the challenges of IoT Streaming Data Integration (ISDI) and present a formal approach for the real-time integration of such IoT streaming datasets. We address one of the important issues of timing conflict/alignment among streaming data coming from multiple sources. A generic window-based ISDI approach is proposed to deal with IoT data in different formats and algorithms are developed to integrate IoT streaming data from multiple sources. In particular, we extend the basic windowing algorithm for real-time data integration and to deal with the timing alignment issue. We also introduce a de-duplication algorithm to deal with data redundancy and to demonstrate the useful fragments of the integrated data. We conduct several sets of experiments and quantify the performance of our proposed window-based approach. In particular, we compare our local experimental results with a real setup for streaming data, using Apache Spark. The results of the experiments, which are performed on several IoT datasets, show the efficiency of our proposed solution in terms of processing time. The results are also used to provide an integrated data view to the users. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0010485X
Volume :
102
Issue :
10
Database :
Academic Search Index
Journal :
Computing
Publication Type :
Academic Journal
Accession number :
145889055
Full Text :
https://doi.org/10.1007/s00607-020-00830-9