Back to Search
Start Over
A Novel Drift Detection Algorithm Based on Features’ Importance Analysis in a Data Streams Environment
- Source :
- Journal of Artificial Intelligence and Soft Computing Research. 10:287-298
- Publication Year :
- 2020
- Publisher :
- Walter de Gruyter GmbH, 2020.
-
Abstract
- The training set consists of many features that influence the classifier in different degrees. Choosing the most important features and rejecting those that do not carry relevant information is of great importance to the operating of the learned model. In the case of data streams, the importance of the features may additionally change over time. Such changes affect the performance of the classifier but can also be an important indicator of occurring concept-drift. In this work, we propose a new algorithm for data streams classification, called Random Forest with Features Importance (RFFI), which uses the measure of features importance as a drift detector. The RFFT algorithm implements solutions inspired by the Random Forest algorithm to the data stream scenarios. The proposed algorithm combines the ability of ensemble methods for handling slow changes in a data stream with a new method for detecting concept drift occurrence. The work contains an experimental analysis of the proposed algorithm, carried out on synthetic and real data.
- Subjects :
- Random Forest
Drift detection
Computer science
Data stream mining
02 engineering and technology
computer.software_genre
Data Stream Mining
Random forest
Artificial Intelligence
Hardware and Architecture
020204 information systems
Modeling and Simulation
0202 electrical engineering, electronic engineering, information engineering
Computer science and engineering [Engineering]
020201 artificial intelligence & image processing
Computer Vision and Pattern Recognition
Data mining
computer
Information Systems
Subjects
Details
- ISSN :
- 20832567
- Volume :
- 10
- Database :
- OpenAIRE
- Journal :
- Journal of Artificial Intelligence and Soft Computing Research
- Accession number :
- edsair.doi.dedup.....00e1e618f608741aea1c3ae9c555f8b9