Back to Search Start Over

Identification of Factors Influencing Episodes of High PM 10 Concentrations in the Air in Krakow (Poland) Using Random Forest Method.

Authors :
Gorzelnik, Tomasz
Bogacki, Marek
Oleniacz, Robert
Source :
Sustainability (2071-1050); Oct2024, Vol. 16 Issue 20, p9015, 23p
Publication Year :
2024

Abstract

The episodes of elevated concentrations of different gaseous pollutants and particulate matter (PM) are of major concern worldwide, especially in city agglomerations. Krakow is an example of an urban–industrial agglomeration with constantly occurring PM<subscript>10</subscript> air limit value exceedances. In recent years, a number of legislative actions have been undertaken to improve air quality in this area. The multitude of factors affecting the emergence of cases of very high air pollutant concentrations makes it difficult to analyze them using simple statistical methods. Machine learning (ML) methods can be an adequate option, especially when proper amounts of credible data are available. The main aim of this paper was to examine the influence of various factors (including main gaseous pollutant concentrations and some meteorological factors) on the effect of high PM<subscript>10</subscript> concentration episodes in the ambient air in Krakow (Poland) using the random forest algorithm. The original methodology based on the PM<subscript>10</subscript> limit and binary classification of cases with and without the occurrence of high concentration episodes was developed. The data used were derived from routine public air quality monitoring and a local meteorological station. A range of random forest classification models with various predictor sets and for different subsets of the observations coupled with variable importance analysis were performed. The performance of the algorithm was assessed using confusion matrices. The variable importance rankings revealed, among other things, the dominant impact of the mixing layer height on elevated PM<subscript>10</subscript> concentration episode formation. This research work showed the usefulness of the random forest algorithm in identifying factors contributing to poor air quality, even in the absence of reliable emission data. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
20711050
Volume :
16
Issue :
20
Database :
Complementary Index
Journal :
Sustainability (2071-1050)
Publication Type :
Academic Journal
Accession number :
180488454
Full Text :
https://doi.org/10.3390/su16209015