Back to Search Start Over

A convolutional neural network approach to classifying urban spaces using generative tools for data augmentation.

Authors :
Medel-Vera, Carlos
Vidal-Estévez, Pelayo
Mädler, Thomas
Source :
International Journal of Architectural Computing; Sep2024, Vol. 22 Issue 3, p392-411, 20p
Publication Year :
2024

Abstract

This article discusses an application for classifying urban spaces using convolutional neural networks (CNNs). A seed dataset was initially generated composed of 630 photographs of urban spaces from the Adobe Stock repository. This dataset was topped up with images produced by two generative artificial intelligence (AI) engines, namely, Deep Dream Generator and Midjourney, making two additional augmented datasets, each composed of 2200 images. The training process was carried out using four well-known CNNs, namely, GoogLeNet, ResNet-18, ShuffleNet, and MobileNet-v2. The results show an increase of roughly 30% in the predicting capabilities in both augmented datasets when compared to the seed dataset. Furthermore, performance metrics are generally higher when using ResNet-18 which may suggest that this CNN architecture is more applicable to urban classification projects. Finally, although both generative AI engines have similar performance, Midjourney seems to slightly outperform Deep Dream Generator as a data augmentation engine for urban spaces. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14780771
Volume :
22
Issue :
3
Database :
Complementary Index
Journal :
International Journal of Architectural Computing
Publication Type :
Academic Journal
Accession number :
180522616
Full Text :
https://doi.org/10.1177/14780771231225697