Back to Search Start Over

Galaxy And Mass Assembly: automatic morphological classification of galaxies using statistical learning.

Authors :
Sreejith, Sreevarsha
Pereverzyev Jr., Sergiy
Kelvin, Lee S.
Marleau, Francine R.
Haltmeier, Markus
Ebner, Judith
Bland-Hawthorn, Joss
Driver, Simon P.
Graham, Alister W.
Holwerda, Benne W.
Hopkins, Andrew M.
Liske, Jochen
Loveday, Jon
Moffett, Amanda J.
Pimbblet, Kevin A.
Taylor, Edward N.
Lingyu Wang
Wright, Angus. H.
Source :
Monthly Notices of the Royal Astronomical Society. Mar2018, Vol. 474 Issue 4, p5232-5258. 27p. 6 Color Photographs, 3 Diagrams, 11 Charts, 12 Graphs.
Publication Year :
2018

Abstract

We apply four statistical learning methods to a sample of 7941 galaxies (z < 0.06) from the Galaxy And Mass Assembly survey to test the feasibility of using automated algorithms to classify galaxies. Using 10 features measured for each galaxy (sizes, colours, shape parameters, and stellar mass), we apply the techniques of Support Vector Machines, Classification Trees, Classification Trees with Random Forest (CTRF) and Neural Networks, and returning True Prediction Ratios (TPRs) of 75.8 per cent, 69.0 per cent, 76.2 per cent, and 76.0 per cent, respectively. Those occasions whereby all four algorithms agree with each other yet disagree with the visual classification (‘unanimous disagreement’) serves as a potential indicator of human error in classification, occurring in ∼ 9 per cent of ellipticals, ∼ 9 per cent of little blue spheroids, ∼ 14 per cent of early-type spirals, ∼ 21 per cent of intermediate-type spirals, and ∼ 4 per cent of late-type spirals and irregulars. We observe that the choice of parameters rather than that of algorithms is more crucial in determining classification accuracy. Due to its simplicity in formulation and implementation, we recommend the CTRF algorithm for classifying future galaxy data sets. Adopting the CTRF algorithm, the TPRs of the five galaxy types are : E, 70.1 per cent; LBS, 75.6 per cent; S0–Sa, 63.6 per cent; Sab–Scd, 56.4 per cent, and Sd–Irr, 88.9 per cent. Further, we train a binary classifier using this CTRF algorithm that divides galaxies into spheroid-dominated (E, LBS, and S0–Sa) and disc-dominated (Sab–Scd and Sd–Irr), achieving an overall accuracy of 89.8 per cent. This translates into an accuracy of 84.9 per cent for spheroid-dominated systems and 92.5 per cent for disc-dominated systems. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00358711
Volume :
474
Issue :
4
Database :
Academic Search Index
Journal :
Monthly Notices of the Royal Astronomical Society
Publication Type :
Academic Journal
Accession number :
128124038
Full Text :
https://doi.org/10.1093/mnras/stx2976