1. From Point to probabilistic gradient boosting for claim frequency and severity prediction
- Author
-
Chevalier, Dominik and Côté, Marie-Pier
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning ,62P05, 68T05 ,I.2.6 ,I.5.1 ,G.3 ,A.1 - Abstract
Gradient boosting for decision tree algorithms are increasingly used in actuarial applications as they show superior predictive performance over traditional generalized linear models. Many improvements and sophistications to the first gradient boosting machine algorithm exist. We present in a unified notation, and contrast, all the existing point and probabilistic gradient boosting for decision tree algorithms: GBM, XGBoost, DART, LightGBM, CatBoost, EGBM, PGBM, XGBoostLSS, cyclic GBM, and NGBoost. In this comprehensive numerical study, we compare their performance on five publicly available datasets for claim frequency and severity, of various size and comprising different number of (high cardinality) categorical variables. We explain how varying exposure-to-risk can be handled with boosting in frequency models. We compare the algorithms on the basis of computational efficiency, predictive performance, and model adequacy. LightGBM and XGBoostLSS win in terms of computational efficiency. The fully interpretable EGBM achieves competitive predictive performance compared to the black box algorithms considered. We find that there is no trade-off between model adequacy and predictive accuracy: both are achievable simultaneously., Comment: 26 pages, 4 figures, 26 tables, 7 algorithms
- Published
- 2024