Back to Search Start Over

Informed conditioning on clinical covariates increases power in case-control association studies.

Authors :
Noah Zaitlen
Sara Lindström
Bogdan Pasaniuc
Marilyn Cornelis
Giulio Genovese
Samuela Pollack
Anne Barton
Heike Bickeböller
Donald W Bowden
Steve Eyre
Barry I Freedman
David J Friedman
John K Field
Leif Groop
Aage Haugen
Joachim Heinrich
Brian E Henderson
Pamela J Hicks
Lynne J Hocking
Laurence N Kolonel
Maria Teresa Landi
Carl D Langefeld
Loic Le Marchand
Michael Meister
Ann W Morgan
Olaide Y Raji
Angela Risch
Albert Rosenberger
David Scherf
Sophia Steer
Martin Walshaw
Kevin M Waters
Anthony G Wilson
Paul Wordsworth
Shanbeh Zienolddiny
Eric Tchetgen Tchetgen
Christopher Haiman
David J Hunter
Robert M Plenge
Jane Worthington
David C Christiani
Debra A Schaumberg
Daniel I Chasman
David Altshuler
Benjamin Voight
Peter Kraft
Nick Patterson
Alkes L Price
Source :
PLoS Genetics, Vol 8, Iss 11, p e1003032 (2012)
Publication Year :
2012
Publisher :
Public Library of Science (PLoS), 2012.

Abstract

Genetic case-control association studies often include data on clinical covariates, such as body mass index (BMI), smoking status, or age, that may modify the underlying genetic risk of case or control samples. For example, in type 2 diabetes, odds ratios for established variants estimated from low-BMI cases are larger than those estimated from high-BMI cases. An unanswered question is how to use this information to maximize statistical power in case-control studies that ascertain individuals on the basis of phenotype (case-control ascertainment) or phenotype and clinical covariates (case-control-covariate ascertainment). While current approaches improve power in studies with random ascertainment, they often lose power under case-control ascertainment and fail to capture available power increases under case-control-covariate ascertainment. We show that an informed conditioning approach, based on the liability threshold model with parameters informed by external epidemiological information, fully accounts for disease prevalence and non-random ascertainment of phenotype as well as covariates and provides a substantial increase in power while maintaining a properly controlled false-positive rate. Our method outperforms standard case-control association tests with or without covariates, tests of gene x covariate interaction, and previously proposed tests for dealing with covariates in ascertained data, with especially large improvements in the case of case-control-covariate ascertainment. We investigate empirical case-control studies of type 2 diabetes, prostate cancer, lung cancer, breast cancer, rheumatoid arthritis, age-related macular degeneration, and end-stage kidney disease over a total of 89,726 samples. In these datasets, informed conditioning outperforms logistic regression for 115 of the 157 known associated variants investigated (P-value = 1 × 10(-9)). The improvement varied across diseases with a 16% median increase in χ(2) test statistics and a commensurate increase in power. This suggests that applying our method to existing and future association studies of these diseases may identify novel disease loci.

Subjects

Subjects :
Genetics
QH426-470

Details

Language :
English
ISSN :
15537390 and 15537404
Volume :
8
Issue :
11
Database :
Directory of Open Access Journals
Journal :
PLoS Genetics
Publication Type :
Academic Journal
Accession number :
edsdoj.708d0c6242ad40ada417d4c7a6f43814
Document Type :
article
Full Text :
https://doi.org/10.1371/journal.pgen.1003032