Back to Search Start Over

Analyzing Geospatial and Socioeconomic Disparities in Breast Cancer Screening Among Populations in the United States: Machine Learning Approach

Authors :
Hashtarkhani, Soheil
Zhou, Yiwang
Kumsa, Fekede Asefa
White-Means, Shelley
Schwartz, David L
Shaban-Nejad, Arash
Source :
JMIR Cancer 2025;11:e59882
Publication Year :
2025

Abstract

Breast cancer screening plays a pivotal role in early detection and subsequent effective management of the disease, impacting patient outcomes and survival rates. This study aims to assess breast cancer screening rates nationwide in the United States and investigate the impact of social determinants of health on these screening rates. Data on mammography screening at the census tract level for 2018 and 2020 were collected from the Behavioral Risk Factor Surveillance System. We developed a large dataset of social determinants of health, comprising 13 variables for 72337 census tracts. Spatial analysis employing Getis-Ord Gi statistics was used to identify clusters of high and low breast cancer screening rates. To evaluate the influence of these social determinants, we implemented a random forest model, with the aim of comparing its performance to linear regression and support vector machine models. The models were evaluated using R2 and root mean squared error metrics. Shapley Additive Explanations values were subsequently used to assess the significance of variables and direction of their influence. Geospatial analysis revealed elevated screening rates in the eastern and northern United States, while central and midwestern regions exhibited lower rates. The random forest model demonstrated superior performance, with an R2=64.53 and root mean squared error of 2.06 compared to linear regression and support vector machine models. Shapley Additive Explanations values indicated that the percentage of the Black population, the number of mammography facilities within a 10-mile radius, and the percentage of the population with at least a bachelor's degree were the most influential variables, all positively associated with mammography screening rates.<br />Comment: 11 Pages, 4 Figures, 2 Tables

Details

Database :
arXiv
Journal :
JMIR Cancer 2025;11:e59882
Publication Type :
Report
Accession number :
edsarx.2502.06800
Document Type :
Working Paper
Full Text :
https://doi.org/10.2196/59882