Back to Search Start Over

Density-based outlier scoring on Kepler data.

Authors :
Giles, Daniel K
Walkowicz, Lucianne
Source :
Monthly Notices of the Royal Astronomical Society. Nov2020, Vol. 499 Issue 1, p524-542. 19p.
Publication Year :
2020

Abstract

In the present era of large-scale surveys, big data present new challenges to the discovery process for anomalous data. Such data can be indicative of systematic errors, extreme (or rare) forms of known phenomena, or most interestingly, truly novel phenomena that exhibit as-of-yet unobserved behaviours. In this work, we present an outlier scoring methodology to identify and characterize the most promising unusual sources to facilitate discoveries of such anomalous data. We have developed a data mining method based on k -nearest neighbour distance in feature space to efficiently identify the most anomalous light curves. We test variations of this method including using principal components of the feature space, removing select features, the effect of the choice of k , and scoring to subset samples. We evaluate the performance of our scoring on known object classes and find that our scoring consistently scores rare (<1000) object classes higher than common classes. We have applied scoring to all long cadence light curves of Quarters 1–17 of Kepler's prime mission and present outlier scores for all 2.8 million light curves for the roughly 200k objects. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00358711
Volume :
499
Issue :
1
Database :
Academic Search Index
Journal :
Monthly Notices of the Royal Astronomical Society
Publication Type :
Academic Journal
Accession number :
146608299
Full Text :
https://doi.org/10.1093/mnras/staa2736