Back to Search
Start Over
Streaming changepoint detection for transition matrices
- Source :
- Data Mining and Knowledge Discovery. 35:1287-1316
- Publication Year :
- 2021
- Publisher :
- Springer Science and Business Media LLC, 2021.
-
Abstract
- Sequentially detecting multiple changepoints in a data stream is a challenging task. Difficulties relate to both computational and statistical aspects, and in the latter, specifying control parameters is a particular problem. Choosing control parameters typically relies on unrealistic assumptions, such as the distributions generating the data, and their parameters, being known. This is implausible in the streaming paradigm, where several changepoints will exist. Further, current literature is mostly concerned with streams of continuous-valued observations, and focuses on detecting a single changepoint. There is a dearth of literature dedicated to detecting multiple changepoints in transition matrices, which arise from a sequence of discrete states. This paper makes the following contributions: a complete framework is developed for adaptively and sequentially estimating a Markov transition matrix in the streaming data setting. A change detection method is then developed, using a novel moment matching technique, which can effectively monitor for multiple changepoints in a transition matrix. This adaptive detection and estimation procedure for transition matrices, referred to as ADEPT-M, is compared to several change detectors on synthetic data streams, and is implemented on two real-world data streams – one consisting of over nine million HTTP web requests, and the other being a well-studied electricity market data set.
- Subjects :
- Data stream
Technology
Computer Networks and Communications
Computer science
Markov chain
02 engineering and technology
computer.software_genre
01 natural sciences
Computer Science, Artificial Intelligence
Synthetic data
Continuous monitoring
010104 statistics & probability
Moment matching
0801 Artificial Intelligence and Image Processing
0202 electrical engineering, electronic engineering, information engineering
Artificial Intelligence & Image Processing
0101 mathematics
ADEPT-M
Science & Technology
Computer Science, Information Systems
Data stream mining
Stochastic matrix
0804 Data Format
Computer Science Applications
Moment (mathematics)
Data set
Forgetting factor
0806 Information Systems
Computer Science
020201 artificial intelligence & image processing
Data mining
computer
Change detection
Information Systems
Subjects
Details
- ISSN :
- 1573756X and 13845810
- Volume :
- 35
- Database :
- OpenAIRE
- Journal :
- Data Mining and Knowledge Discovery
- Accession number :
- edsair.doi.dedup.....c23fe3b50a241bf3554dd35bbc041c1c
- Full Text :
- https://doi.org/10.1007/s10618-021-00747-7