Back to Search Start Over

Machine learning algorithms for predicting roadside fine particulate matter concentration level in Hong Kong Central.

Authors :
Yin Zhao
Abu Hasan, Yahya
Source :
Computational Ecology & Software; 9/1/2013, Vol. 3 Issue 3, p61-73, 13p
Publication Year :
2013

Abstract

Data mining is an approach to discover knowledge from large data. Pollutant forecasting is an important problem in the environmental sciences. This paper tries to use data mining methods to forecast fine particles (PM<subscript>2.5</subscript>) concentration level in Hong Kong Central, which is a famous business centre in Asia. There are several classification algorithms available in data mining, such as Artificial Neural Network (ANN) and Support Vector Machine (SVM). ANN and SVM are both machine learning algorithm used in variant area. This paper builds PM<subscript>2.5</subscript> concentration level predictive models based on ANN and SVM by using R packages. The data set includes 2008-2011 period meteorological data and PM<subscript>2.5</subscript> data. The PM<subscript>2.5</subscript> concentration is divided into 2 levels: low and high. The critical point is 40μg/m³ (24 hours mean), which is based on the standard of US Environmental Protection Agency (EPA). The parameters of both models are selected by multiple cross validation. According to 100 times 10-fold cross validation, the testing accuracy of SVM is around 0.803∼0.820, which is much better than ANN whose accuracy is around 0.746∼0.793. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
2220721X
Volume :
3
Issue :
3
Database :
Supplemental Index
Journal :
Computational Ecology & Software
Publication Type :
Academic Journal
Accession number :
89638154