Back to Search
Start Over
Multi-source Machine Learning for AQI Estimation
- Source :
- IEEE BigData
- Publication Year :
- 2020
- Publisher :
- IEEE, 2020.
-
Abstract
- In many countries worldwide, effectively estimating AQI values and levels is essential for better monitoring the air pollution around the living area. This problem has become one of the interesting research subjects for many years, and there are many applications developed for personal usages. In this work, we aim to investigate a multi-source machine learning approach to approximate the local AQI scores at users’ location in a big city. We conduct different experiments on three primary data sets: "SEPHLA-MediaEval 2019", "MNR-Air-HCM," and "MNR-HCM," collected in Ho Chi Minh City (Vietnam) and Fukuoka city (Japan). From the data sets provided, we extract different types of useful attributes for the problem: the timestamp information, the geographical data, sensor data (humidity and temperature), users’ emotion tags (such as greenness, calmness, etc.), the semantic features from images captured by users as well as the public weather data (including temperature, dew point, humidity, wind speed, and pressure) of the related cities. After that, we compare five distinct machine learning models for estimating the local AQI score and level, including Support Vector Machine [1], Random Forest [2], Extreme Gradient Boosting [3], LightGBM [4] and CatBoost [5]. We use RMSE, MAE, and R2 for measuring the performance of these approaches. The experimental results show that using random forest with sensor data, combined with public weather data, the results in AQI values regression and AQI ranks prediction can be the highest in many cases.
- Subjects :
- Computer science
business.industry
Big data
Feature extraction
02 engineering and technology
010501 environmental sciences
Machine learning
computer.software_genre
01 natural sciences
Random forest
Support vector machine
Dew point
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Artificial intelligence
Timestamp
business
Calmness
computer
Multi-source
0105 earth and related environmental sciences
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2020 IEEE International Conference on Big Data (Big Data)
- Accession number :
- edsair.doi...........261109432aece0af529639899cf81ea6
- Full Text :
- https://doi.org/10.1109/bigdata50022.2020.9378322