Back to Search Start Over

Outlier Detection with the Use of Isolation Forests

Authors :
Krystian Zieliński
Krzysztof Najman
Source :
Data Analysis and Classification ISBN: 9783030751890
Publication Year :
2021
Publisher :
Springer International Publishing, 2021.

Abstract

Appropriate preparation of data for analysis is a key element in empirical research. Considering the source of data or the nature of the phenomenon studied, some observations may differ significantly from others. Inclusion of such cases in a research may seriously distort the profile of the population under examination. Nevertheless, their omission can be equally disadvantageous. When analyzing dynamically changing phenomena, especially in case of big data, a relatively small amount of outliers may constitute a coherent and internally homogeneous group, which, along with the registration of subsequent observations, may grow into an independent cluster. Whether or not an outlier is removed from the dataset, researcher must be first aware of its existence. For this purpose, an appropriate method of anomaly detection should be used. Identification of such units allows the researcher to make an appropriate decision regarding the further steps in the analysis.

Details

ISBN :
978-3-030-75189-0
ISBNs :
9783030751890
Database :
OpenAIRE
Journal :
Data Analysis and Classification ISBN: 9783030751890
Accession number :
edsair.doi...........e938431a1ad778eb8a2cd745824f6bc5
Full Text :
https://doi.org/10.1007/978-3-030-75190-6_5