Back to Search
Start Over
Uncovering Regional Defaults from Photorealistic Forests in Text-to-Image Generation with DALL-E 2
- Publication Year :
- 2024
-
Abstract
- Regional defaults describe the emerging phenomenon that text-to-image (T2I) foundation models used in generative AI are prone to over-proportionally depicting certain geographic regions to the exclusion of others. In this work, we introduce a scalable evaluation for uncovering such regional defaults. The evaluation consists of region hierarchy--based image generation and cross-level similarity comparisons. We carry out an experiment by prompting DALL-E 2, a state-of-the-art T2I generation model capable of generating photorealistic images, to depict a forest. We select forest as an object class that displays regional variation and can be characterized using spatial statistics. For a region in the hierarchy, our experiment reveals the regional defaults implicit in DALL-E 2, along with their scale-dependent nature and spatial relationships. In addition, we discover that the implicit defaults do not necessarily correspond to the most widely forested regions in reality. Our findings underscore a need for further investigation into the geography of T2I generation and other forms of generative AI.<br />Comment: Accepted by the 16th Conference on Spatial Information Theory (COSIT 2024): https://cosit.ca
Details
- Database :
- arXiv
- Publication Type :
- Report
- Accession number :
- edsarx.2410.17255
- Document Type :
- Working Paper