1. Systematic Evaluation of Land Use Regression Models for NO2
- Author
-
Kees Meliefste, Meng Wang, Bert Brunekreef, Gerard Hoek, Rob Beelen, and Marloes Eeftens
- Subjects
Mean squared error ,Training data sets ,Statistics ,Range (statistics) ,Environmental Chemistry ,Sampling (statistics) ,Contrast (statistics) ,Spatial variability ,General Chemistry ,Land use regression ,Mathematics ,Test data - Abstract
Land use regression (LUR) models have become popular to explain the spatial variation of air pollution concentrations. Independent evaluation is important. We developed LUR models for nitrogen dioxide (NO2) using measurements conducted at 144 sampling sites in The Netherlands. Sites were randomly divided into training data sets with a size of 24, 36, 48, 72, 96, 108, and 120 sites. LUR models were evaluated using (1) internal “leave-one-out-cross-validation (LOOCV)” within the training data sets and (2) external “hold-out” validation (HV) against independent test data sets. In addition, we calculated Mean Square Error based validation R2s. The mean adjusted model and LOOCV R2 slightly decreased from 0.87 to 0.82 and 0.83 to 0.79, respectively, with an increasing number of training sites. In contrast, the mean HV R2 was lowest (0.60) with the smallest training sets and increased to 0.74 with the largest training sets. Predicted concentrations were more accurate in sites with out of range values for predict...
- Published
- 2012