Back to Search Start Over

Evaluating external generalizability of machine learning models for recycled aggregate concrete property prediction.

Authors :
Jadhav, Shreyas Pandurang
Selvam, M.
Zunain, Sheik Mohammad
Bugalia, Nikhil
Source :
Journal of Cleaner Production. Sep2024, Vol. 469, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Machine learning (ML) models have gained importance in predicting recycled aggregate concrete (RAC) properties, offering supposed benefits over conventional empirical and statistical techniques. However, whether ML models are externally generalizable for predicting RAC properties remains unanswered. This study addresses this gap by developing a systematic experimental framework for evaluating the external generalizability of ML models for predicting the compressive strength of RAC. Using a literature review, the authors created a primary dataset of 414 data points and sourced a secondary dataset comprising 330 data points from a previous paper. In Phase 1, prominent ML models like Random Forest (RF) and Extreme Gradient Boost (XGB) were tested for high-accuracy prediction on both primary and secondary datasets. A coefficient of determination (R2) as high as 0.76 for the primary dataset and 0.82 for the secondary dataset for testing sets was obtained for XGB. However, when the best-performing models of phase 1 were trained and tested with data sourced from different datasets in varying combinations, the ML model's performance significantly deteriorated (R2 < 0.25), demonstrating that ML models are not externally generalizable. The study's results highlight the trade-off between the ML model's prediction accuracy within the given dataset and its external generalizability. The study reveals that complex ML models, like XGB, may over-fit specific data, reducing their generalizability. These findings call for the intelligent usage of ML tools to identify nuanced hypotheses and promote rigorous science rather than unilaterally working on the accuracy of ML models. The study emphasizes the consideration of ML models' generalizability in the future. • External Generalizability of the ML modeling is evaluated. • A systematic methodological framework using multiple datasets is presented. • Current ML models for materials research are likely not generalizable. • These findings call for the intelligent usage of ML tools. • The study emphasizes the consideration of ML models' generalizability in the future. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09596526
Volume :
469
Database :
Academic Search Index
Journal :
Journal of Cleaner Production
Publication Type :
Academic Journal
Accession number :
178810391
Full Text :
https://doi.org/10.1016/j.jclepro.2024.143166