Back to Search Start Over

Normal distribution based pseudo ML for missing data: With applications to mean and covariance structure analysis

Authors :
Ke-Hai Yuan
Source :
Journal of Multivariate Analysis. 100:1900-1918
Publication Year :
2009
Publisher :
Elsevier BV, 2009.

Abstract

When missing data are either missing completely at random (MCAR) or missing at random (MAR), the maximum likelihood (ML) estimation procedure preserves many of its properties. However, in any statistical modeling, the distribution specification for the likelihood function is at best only an approximation to the real world. In particular, since the normal-distribution-based ML is typically applied to data with heterogeneous marginal skewness and kurtosis, it is necessary to know whether such a practice still generates consistent parameter estimates. When the manifest variables are linear combinations of independent random components and missing data are MAR, this paper shows that the normal-distribution-based MLE is consistent regardless of the distribution of the sample. Examples also show that the consistency of the MLE is not guaranteed for all nonnormally distributed samples. When the population follows a confirmatory factor model, and data are missing due to the magnitude of the factors, the MLE may not be consistent even when data are normally distributed. When data are missing due to the magnitude of measurement errors/uniqueness, MLEs for many of the covariance parameters related to the missing variables are still consistent. This paper also identifies and discusses the factors that affect the asymptotic biases of the MLE when data are not missing at random. In addition, the paper also shows that, under certain data models and MAR mechanism, the MLE is asymptotically normally distributed and the asymptotic covariance matrix is consistently estimated by the commonly used sandwich-type covariance matrix. The results indicate that certain formulas and/or conclusions in the existing literature may not be entirely correct.

Details

ISSN :
0047259X
Volume :
100
Database :
OpenAIRE
Journal :
Journal of Multivariate Analysis
Accession number :
edsair.doi.dedup.....a860eec920948bc1251dd4cf0b8d04b9
Full Text :
https://doi.org/10.1016/j.jmva.2009.05.001