1. Application of all relevant feature selection for failure analysis of parameter-induced simulation crashes in climate models.
- Author
-
Paja, W., Wrzesień, M., Niemiec, R., and Rudnicki, W. R.
- Subjects
ATMOSPHERIC models ,MACHINE learning ,FAILURE analysis ,SIMULATION methods & models ,MACHINE theory - Abstract
The climate models are extremely complex pieces of software. They reflect best knowledge on physical components of the climate, nevertheless, they contain several parameters, which are too weakly constrained by observations, and can potentially lead to a crash of simulation. Recently a study by Lucas et al. (2013) has shown that machine learning methods can be used for predicting which combinations of parameters can lead to crash of simulation, and hence which processes described by these parameters need refined analyses. In the current study we reanalyse the dataset used in this research using different methodology. We confirm the main conclusion of the original study concerning suitability of machine learning for prediction of crashes. We show, that only three of the eight parameters indicated in the original study as relevant for prediction of the crash are indeed strongly relevant, three other are relevant but redundant, and two are not relevant at all. We also show that the variance due to split of data between training and validation sets has large influence both on accuracy of predictions and relative importance of variables, hence only cross-validated approach can deliver robust prediction of performance and relevance of variables. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF