Back to Search Start Over

Variable selection for multiply-imputed data with penalized generalized estimating equations.

Authors :
Geronimi, J.
Saporta, G.
Source :
Computational Statistics & Data Analysis. Jun2017, Vol. 110, p103-114. 12p.
Publication Year :
2017

Abstract

Generalized estimating equations (GEE) are useful tools for marginal regression analysis for longitudinal data. Having a high number of variables along with the presence of missing data presents complex issues when working in a longitudinal context. In variable selection for instance, penalized generalized estimating equations have not been systematically developed to integrate missing data. The MI-PGEE: multiple imputation-penalized generalized estimating equations, an extension of the multiple imputation-least absolute shrinkage and selection operator (MI-LASSO) is presented. MI-PGEE allows integration of missing data and within-subject correlation in variable selection procedures. Missing data are dealt with using multiple imputation, and variable selection is performed using a group LASSO penalty. Estimated coefficients for the same variable across multiply-imputed datasets are considered as a group while applying penalized generalized estimating equations, leading to a unique model across multiply-imputed datasets. In order to select the tuning parameter, a new BIC-like criterion is proposed. In a simulation study, the advantage of using MI-PGEE compared to simple imputation PGEE is shown. The usefulness of the new method is illustrated by an application to a subgroup of the placebo arm of the strontium ranelate efficacy in knee osteoarthritis trial study. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01679473
Volume :
110
Database :
Academic Search Index
Journal :
Computational Statistics & Data Analysis
Publication Type :
Periodical
Accession number :
121401518
Full Text :
https://doi.org/10.1016/j.csda.2017.01.001