Back to Search
Start Over
A passive and inclusive strategy to impute missing values of a composite categorical variable with an application to determine HIV transmission categories
- Source :
- Annals of epidemiology. 51
- Publication Year :
- 2020
-
Abstract
- Purpose Multiple imputation (MI) is a widely acceptable approach to missing data problems in epidemiological studies. Composite variables are often used to summarize information from multiple, correlated items. This study aims to assess and compare different MI methods for handling missing categorical composite variables. Methods We investigate the problem in the context of a real application: estimating the prevalence of HIV transmission category, which is a composite variable generated by applying a hierarchical algorithm to a group of binary risk source variables from a national program data set. We use simulation studies to compare and assess the performance of alternative MI strategies. These methods include the active imputation, just another variable, and the passive imputation approaches. Results Our study suggests that the passive imputation approach performs better than the direct imputation approach and the inclusive and general imputation model (i.e. passive imputation with interactions) performs the best. There is no need to embed the information from the variable-combining algorithm in the passive imputation modeling. Conclusion We recommend practitioners adopting an inclusive and general passive imputation modeling strategy.
- Subjects :
- 2019-20 coronavirus outbreak
Epidemiology
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)
HIV Infections
Risk source
Machine learning
computer.software_genre
01 natural sciences
03 medical and health sciences
0302 clinical medicine
Prevalence
Statistics::Methodology
Medicine
Humans
Computer Simulation
030212 general & internal medicine
Imputation (statistics)
0101 mathematics
Hiv transmission
Categorical variable
Models, Statistical
Statistics::Applications
business.industry
010102 general mathematics
Missing data
Hierarchical algorithm
Data_GENERAL
Data Interpretation, Statistical
Artificial intelligence
business
computer
Subjects
Details
- ISSN :
- 18732585
- Volume :
- 51
- Database :
- OpenAIRE
- Journal :
- Annals of epidemiology
- Accession number :
- edsair.doi.dedup.....d7970c90d175e65e152d2b1b5836e28a