Back to Search
Start Over
Multi-query processing of XML data streams on multicore
- Source :
- The Journal of Supercomputing. 73:2339-2368
- Publication Year :
- 2016
- Publisher :
- Springer Science and Business Media LLC, 2016.
-
Abstract
- The multicore architecture has been the norm for all computing systems in recent years as it provides the CPU-level support of parallelism. However, existing algorithms for processing XML streams do not fully take advantage of the facility since they have not been devised to run in parallel. In this article, we propose several methods to parallelize the finite state automata (FSA)-based XML stream processing technique efficiently. We transform a large collection of XPath expressions into multiple FSA-based query indexes and then process XML streams in parallel by virtue of the index-level parallelism. Each core works only with its own query index so that no synchronization issue occurs while filtering XML streams with multiple path patterns given by users. We also present an in-memory MapReduce model that enables to process a large collection of twig pattern joins over XML streams simultaneously. Twig pattern joins in our approach are performed by multiple H/W threads in a shared and balanced way. Extensive experiments show that our algorithm outperforms conventional algorithms with an 8-core CPU by up to ten times for processing 10 million XPath expressions over XML streams.
- Subjects :
- Computer science
computer.internet_protocol
Data stream mining
Efficient XML Interchange
02 engineering and technology
computer.file_format
Parallel computing
computer.software_genre
Theoretical Computer Science
Twig
Simple API for XML
XML database
Parallel processing (DSP implementation)
Hardware and Architecture
020204 information systems
Streaming XML
ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
computer
Software
XML
Information Systems
XPath
Subjects
Details
- ISSN :
- 15730484 and 09208542
- Volume :
- 73
- Database :
- OpenAIRE
- Journal :
- The Journal of Supercomputing
- Accession number :
- edsair.doi...........24247eebd5db76b3e6427ced0ae37da1