Back to Search Start Over

Modeling the Bias of Digital Data: An Approach to Combining Digital With Official Statistics to Estimate and Predict Migration Trends

Authors :
Yuan Hsiao
Lee Fiorio
Jonathan Wakefield
Emilio Zagheni
Source :
Sociological Methods & Research. :004912412211401
Publication Year :
2023
Publisher :
SAGE Publications, 2023.

Abstract

Obtaining reliable and timely estimates of migration flows is critical for advancing the migration theory and guiding policy decisions, but it remains a challenge. Digital data provide granular information on time and space, but do not draw from representative samples of the population, leading to biased estimates. We propose a method for combining digital data and official statistics by using the official statistics to model the spatial and temporal dependence structure of the biases of digital data. We use simulations to demonstrate the validity of the model, then empirically illustrate our approach by combining geo-located Twitter data with data from the American Community Survey (ACS) to estimate state-level out-migration probabilities in the United States. We show that our model, which combines unbiased and biased data, produces predictions that are more accurate than predictions based solely on unbiased data. Our approach demonstrates how digital data can be used to complement, rather than replace, official statistics.

Details

ISSN :
15528294 and 00491241
Database :
OpenAIRE
Journal :
Sociological Methods & Research
Accession number :
edsair.doi...........5ab61ec171fa7ccadbec45c9ec01043b
Full Text :
https://doi.org/10.1177/00491241221140144