Back to Search Start Over

Simulation of dew point temperature in different time scales based on grasshopper algorithm optimized extreme gradient boosting.

Authors :
Dong, Jianhua
Zeng, Wenzhi
Lei, Guoqing
Wu, Lifeng
Chen, Haorui
Wu, Jingwei
Huang, Jiesheng
Gaiser, Thomas
Srivastava, Amit Kumar
Source :
Journal of Hydrology. Mar2022, Vol. 606, pN.PAG-N.PAG. 1p.
Publication Year :
2022

Abstract

• The GOA-XGBoost model was the best model for estimating T dew. • The most important meteorological input was actual vapor pressure. • The hybrid modelling effects for T dew under daily and hourly scales were assessed. Dew point temperature (T dew) plays an important role in hydrology, meteorology, and other related research. This study evaluated the ability of a new machine learning model (hybrid extreme gradient boosting with grasshopper optimization algorithm (GOA-XGBoost)) to estimate T dew and compared it with two other tree-based models (XGBoost and random forest (RF)). We collected meteorological data namely actual vapor pressure (e a), maximum air temperature (T max), minimum air temperature (T min), maximum relative humidity (RH max), minimum relative humidity (RH min), atmospheric pressure (P a), 2 m high wind speed (U d), during 2016–2019 on daily and hourly time scales from the Sijiqinglin station in China to train, test, and validate each model. The results showed that the GOA-XGBoost model performed best, and the RF model had severe over-fitting problems during the validation phase at daily time scale. The models showed the best accuracy and stability when the input was e a (on average R2 = 1.000, RMSE = 0.296℃, MBE = 0.001℃, MAE = 0.167℃, and KGE = 0.991). The models had more significant errors when the inputs were T max , T min (on average R2 = 0.721, RMSE = 6.756℃, MBE = -0.101℃, MAE = 5.071℃, and KGE = 0.771). The estimation loss exhibited by the models were similar for the hourly and daily scale patterns. T and RH were the most basic meteorological factors and adding extraneous factors would affect the estimation accuracy of the model. The variability of meteorological data varied less on an hourly scale than on a daily scale. Therefore, the accuracy of the models was higher, but the data set and the volume of operations became larger. This led to a possible reduction in model stability, but the hourly scales are better suited for assessing the effects of simulations in extreme situations. Taking accuracy and stability into account, the GOA-XGBoost model was the best model and the most practical input for both time scales was e a. Therefore, in subsequent studies, the GOA-XGBoost model can be combined with the input e a to estimate T dew accurately. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00221694
Volume :
606
Database :
Academic Search Index
Journal :
Journal of Hydrology
Publication Type :
Academic Journal
Accession number :
155206948
Full Text :
https://doi.org/10.1016/j.jhydrol.2022.127452