1. Transient-optimized real-bogus classification with Bayesian convolutional neural networks - sifting the GOTO candidate stream
- Author
-
P. T. O'Brien, David Mkrtichian, Kendall Ackley, U. Burhanudin, T. Heikkilä, R. Cutter, Andrew J. Levan, Paul Chote, Benjamin P. Gompertz, Justyn R. Maund, Supachai Awiphan, Y. L. Mong, Klaas Wiersema, E. J. Daw, James McCormac, G. Ramsay, Krzysztof Ulaczyk, S. Tooke, Enric Palle, D. Mata Sánchez, R. Eyles-Ferris, Christopher J. Duffy, T. Killestein, Saran Poshyachinda, Eric Thrane, Seppo Mattila, James Mullaney, S. Williams, E. Rol, Puji Irawati, S. Aukkaravittayapun, L. K. Nuttall, Don Pollacco, Rubina Kotak, Danny Steeghs, Rene P. Breton, Utane Sawangwit, R. L. C. Starling, A. Chrimes, J. D. Lyman, L. Makrygianni, Elizabeth R. Stanway, Mark Kennedy, S. P. Littlefair, P. A. Strøm, Duncan K. Galloway, Martin J. Dyer, and V. S. Dhillon
- Subjects
Goto ,Test data generation ,Active learning (machine learning) ,Astronomy ,Bayesian probability ,FOS: Physical sciences ,Machine learning ,computer.software_genre ,01 natural sciences ,Convolutional neural network ,photometric [techniques] ,surveys ,0103 physical sciences ,Classifier (linguistics) ,ST/T007184/1 ,data analysis [methods] ,Transient (computer programming) ,010306 general physics ,QA ,010303 astronomy & astrophysics ,Instrumentation and Methods for Astrophysics (astro-ph.IM) ,STFC ,QC ,Physics ,astro-ph.HE ,High Energy Astrophysical Phenomena (astro-ph.HE) ,business.industry ,RCUK ,Astronomy and Astrophysics ,ST/T003103/1 ,Space and Planetary Science ,Scalability ,Artificial intelligence ,ST/P000495/1 ,business ,Astrophysics - Instrumentation and Methods for Astrophysics ,Astrophysics - High Energy Astrophysical Phenomena ,computer ,astro-ph.IM - Abstract
Large-scale sky surveys have played a transformative role in our understanding of astrophysical transients, only made possible by increasingly powerful machine learning-based filtering to accurately sift through the vast quantities of incoming data generated. In this paper, we present a new real-bogus classifier based on a Bayesian convolutional neural network that provides nuanced, uncertainty-aware classification of transient candidates in difference imaging, and demonstrate its application to the datastream from the GOTO wide-field optical survey. Not only are candidates assigned a well-calibrated probability of being real, but also an associated confidence that can be used to prioritise human vetting efforts and inform future model optimisation via active learning. To fully realise the potential of this architecture, we present a fully-automated training set generation method which requires no human labelling, incorporating a novel data-driven augmentation method to significantly improve the recovery of faint and nuclear transient sources. We achieve competitive classification accuracy (FPR and FNR both below 1%) compared against classifiers trained with fully human-labelled datasets, whilst being significantly quicker and less labour-intensive to build. This data-driven approach is uniquely scalable to the upcoming challenges and data needs of next-generation transient surveys. We make our data generation and model training codes available to the community., 17 pages, 12 figures, resubmitted to MNRAS following reviewer comments
- Published
- 2021
- Full Text
- View/download PDF