Back to Search Start Over

Large-scale assessment of PFAS compounds in drinking water sources using machine learning.

Authors :
Fernandez, Nicolas
Nejadhashemi, A. Pouyan
Loveall, Christian
Source :
Water Research. Sep2023, Vol. 243, pN.PAG-N.PAG. 1p.
Publication Year :
2023

Abstract

• Monitoring PFAS in drinking water has increased due to health concerns. • Built models to detect & estimate PFAS in drinking water wells. • Random forest & autoregressive models enhance PFAS detection & estimation. • PFAS drivers include pollution sources, soil, geology, groundwater, & demography. The monitoring of Per and Polyfluoroalkyl substances (PFAS) in drinking water sources has significantly increased due to their recognition as a major public health concern. This information has been utilized to assess the importance of potential explanatory variables in determining the presence and concentration of PFAS in different regions. Nevertheless, the significance of these variables and the reliability of the methods in regions beyond where they were initially tested is still uncertain. Hence, our research pursues two main objectives: 1) to evaluate the validity of the aforementioned variables and methods for several PFAS species in a different area and 2) to build on existing modeling work; a new PFAS predictive model is introduced which is more reliable in determining the presence and concentration of PFAS at a regional level. To achieve these goals, we reconstructed four state-of-the-art models using a statewide dataset available for Michigan. These models involve spatial regression techniques, classification and regression random forest algorithms, and boosted regression trees. They also include numerous explanatory variables, such as features of local soil and hydrology and the number of nearby contamination sources. Then, we use a Bayesian selection approach to find the most relevant among these variables. Finally, we employ the most relevant covariates to assess PFAS occurrence and estimate their concentration using a novel combination of machine learning algorithms and conditional autoregressive (CAR) modeling. In the first case, PFAS occurrence was assessed with an accuracy comparable to the reconstructed models (>90%) while using significantly fewer variables. In the second case, by maintaining low data requirements, the estimated concentrations of most PFAS compounds were more closely aligned with available observations compared to previous methods, with correlation coefficients ρ > 0.90 and R 2 > 0.77. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00431354
Volume :
243
Database :
Academic Search Index
Journal :
Water Research
Publication Type :
Academic Journal
Accession number :
171339631
Full Text :
https://doi.org/10.1016/j.watres.2023.120307