Back to Search
Start Over
SACCOS: A Semi-Supervised Framework for Emerging Class Detection and Concept Drift Adaption Over Data Streams
- Source :
- IEEE Transactions on Knowledge and Data Engineering. 34:1416-1426
- Publication Year :
- 2022
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2022.
-
Abstract
- In this paper, we address the challenges of detecting instances from emerging classes over a non-stationary data stream during classification. In particular, instances from an entirely unknown class may appear in a data stream over time. Existing classification techniques utilize unsupervised clustering to identify the emergence of such data instances. Unfortunately, they make strong assumptions which are typically invalid in practice; (i) Most instances associated with a class are closer to each other in feature space than instances associated with different classes, (ii) Covariates of data are normalized through an oracle to overcome the effect of a few data instances having large feature values, and (iii) Labels of instances from emerging classes are readily available soon after detection. To address the challenges that occur in practice when the above assumptions are weak, we propose a practical semi-supervised emerging class detection framework. Particularly, we aim to identify similar data instances within local regions in feature space by incorporating a mutual graph clustering mechanism. Our empirical evaluation of this framework on real-world datasets demonstrates its superiority of classification performance compared to existing methods while using significantly fewer labeled instances.
- Subjects :
- Data stream
Concept drift
Computer science
Data stream mining
business.industry
Feature vector
Machine learning
computer.software_genre
Class (biology)
Oracle
Computer Science Applications
Computational Theory and Mathematics
Feature (machine learning)
Artificial intelligence
business
computer
Information Systems
Clustering coefficient
Subjects
Details
- ISSN :
- 23263865 and 10414347
- Volume :
- 34
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Knowledge and Data Engineering
- Accession number :
- edsair.doi...........4f71b5964de7cc282fb0a0137917f821
- Full Text :
- https://doi.org/10.1109/tkde.2020.2993193