1. Interval estimation of genetic susceptibility for retrospective case-control studies
- Author
-
Dmitri Zaykin, Meng, Zhaoling, and Ghosh, Sujit K.
- Subjects
Molecular Epidemiology ,Models, Statistical ,lcsh:QH426-470 ,Methodology Article ,Bayes Theorem ,lcsh:Genetics ,Haplotypes ,Pharmacogenetics ,Case-Control Studies ,Sample Size ,Prevalence ,Humans ,Genetic Predisposition to Disease ,Alleles ,Retrospective Studies - Abstract
Background This article describes classical and Bayesian interval estimation of genetic susceptibility based on random samples with pre-specified numbers of unrelated cases and controls. Results Frequencies of genotypes in cases and controls can be estimated directly from retrospective case-control data. On the other hand, genetic susceptibility defined as the expected proportion of cases among individuals with a particular genotype depends on the population proportion of cases (prevalence). Given this design, prevalence is an external parameter and hence the susceptibility cannot be estimated based on only the observed data. Interval estimation of susceptibility that can incorporate uncertainty in prevalence values is explored from both classical and Bayesian perspective. Similarity between classical and Bayesian interval estimates in terms of frequentist coverage probabilities for this problem allows an appealing interpretation of classical intervals as bounds for genetic susceptibility. In addition, it is observed that both the asymptotic classical and Bayesian interval estimates have comparable average length. These interval estimates serve as a very good approximation to the "exact" (finite sample) Bayesian interval estimates. Extension from genotypic to allelic susceptibility intervals shows dependency on phenotype-induced deviations from Hardy-Weinberg equilibrium. Conclusions The suggested classical and Bayesian interval estimates appear to perform reasonably well. Generally, the use of exact Bayesian interval estimation method is recommended for genetic susceptibility, however the asymptotic classical and approximate Bayesian methods are adequate for sample sizes of at least 50 cases and controls.