Back to Search Start Over

Semi-Supervised Anomaly Detection Algorithm Using Probabilistic Labeling (SAD-PL)

Authors :
Kibae Lee
Chong Hyun Lee
Jongkil Lee
Source :
IEEE Access, Vol 9, Pp 142972-142981 (2021)
Publication Year :
2021
Publisher :
IEEE, 2021.

Abstract

To detect abnormal data via semi-supervised learning, unlabeled data are generally assumed to be normal data. This assumption, however, causes inevitable performance degradation when a small fraction of abnormal data is included in the unlabeled dataset. To overcome the degradation and to maintain stable detection performance, we propose a semi-supervised anomaly detection algorithm using probabilistic labeling (SAD-PL) for unlabeled data. The proposed SAD-PL is composed of two steps: (1) estimating local outlier factor (LOF) scores of latent vectors from both labeled and unlabeled data and (2) estimating labeling probability on the unlabeled data by using the prior missing probability of the labeled data via the Neyman-Pearson (NP) criterion. The SAD-PL runs iteratively by using the proposed complementary learning functions until the rate of label changes is lower than the predefined threshold. Experimental results reveal that the SAD-PL shows superior detection probability over the existing algorithms and stable performance regardless of the normal to abnormal data ratio in unlabeled data and the ratio of change variation of unlabeled data statistics to labeled data statistics.

Details

Language :
English
ISSN :
21693536
Volume :
9
Database :
Directory of Open Access Journals
Journal :
IEEE Access
Publication Type :
Academic Journal
Accession number :
edsdoj.4811eb232c2640888111bce2565c716d
Document Type :
article
Full Text :
https://doi.org/10.1109/ACCESS.2021.3120710