Back to Search
Start Over
CDDM: Concept Drift Detection Model for Data Stream
- Source :
- International Journal of Interactive Mobile Technologies, Vol 14, Iss 10, Pp 90-106 (2020), International Journal of Interactive Mobile Technologies (iJIM); Vol. 14 No. 10 (2020); pp. 90-106
- Publication Year :
- 2020
- Publisher :
- International Association of Online Engineering (IAOE), 2020.
-
Abstract
- Data stream is the huge amount of data generated in various fields, including financial processes, social media activities, Internet of Things applications, and many others. Such data cannot be processed through traditional data mining algorithms due to several constraints, including limited memory, data speed, and dynamic environment. Concept Drift is known as the main constraint of data stream mining, mainly in the classification task. It refers to the change in the data stream underlining distribution over time. Thus, it results in accuracy deterioration of classification models and wrong predictions. Spam emails, consumer behavior changes, and adversary activates, are examples of Concept Drift. In this paper, a Concept Drift detection model is introduced, Concept Drift Detection Model (CDDM). It monitors the accuracy of the classification model over a sliding window, assuming the decline in accuracy indicates a drift occurrence. A modification over CDDM is a weighted version of the CDDM as W-CDDM.Both models have evaluated against two real datasets and four artificial datasets. The experimental results of abrupt drift show that CDDM, W-CDDM outperforms the other models in the dataset of 100K and 1M instances, respectively. Regarding gradual drift, the W-CDDM overtook the rest in terms of accuracy, run time, and detection delays in the dataset of 100 K instances. While in the dataset of 1M instances, CDDM has got the highest accuracy using the NB classifier. Moreover, W-CDDM achieves the highest accuracy on real datasets.
- Subjects :
- Data stream
Concept drift
Computer Networks and Communications
business.industry
Computer science
Data stream mining
data stream mining, concept drift, concept drift detection, data stream classification
Concept Drift
TK5101-6720
Concept Drift Detection
computer.software_genre
Data Stream Mining
Data mining algorithm
Computer Science Applications
ComputingMethodologies_PATTERNRECOGNITION
Sliding window protocol
Telecommunication
Data mining
Internet of Things
business
Data Stream Classification
computer
Classifier (UML)
Subjects
Details
- Language :
- English
- ISSN :
- 18657923
- Volume :
- 14
- Issue :
- 10
- Database :
- OpenAIRE
- Journal :
- International Journal of Interactive Mobile Technologies
- Accession number :
- edsair.doi.dedup.....ebc867673a6429f3494c247f1261dadb