Stephen S. Rich, Xiuqing Guo, Bruce M. Psaty, Mina K. Chung, Aladdin H. Shadyab, Nicholas L. Smith, Dan M. Roden, Christine M. Albert, Braxton D. Mitchell, Y.-D. Ida Chen, Matthew A. Allison, Loic Yengo, Barbara McKnight, Jerome I. Rotter, Leslie A. Lange, Mariza de Andrade, Patrick T. Ellinor, Charles Kooperberg, Cathy C. Laurie, Merry-Lynn McDonald, Ramachandran S. Vasan, Ryan D. Hernandez, Kari E. North, Peter M. Visscher, Pierrick Wainschtein, Rasika A. Mathias, Zhili Zheng, Bruce S. Weir, L. Adrienne Cupples, Lisa R. Yanek, Donna K. Arnett, Stephen T. McGarvey, Elizabeth A. Regan, Jian Yang, Ching-Ti Liu, Dawood Darbar, Eric Boerwinkle, Deepti Jain, Susan R. Heckbert, Susan Redline, and Benjamin Shoemaker
Heritability, the proportion of phenotypic variance explained by genetic factors, can be estimated from pedigree data 1, but such estimates are uninformative with respect to the underlying genetic architecture. Analyses of data from genome-wide association studies (GWAS) on unrelated individuals have shown that for human traits and disease, approximately one-third to two-thirds of heritability is captured by common SNPs 2–5. It is not known whether the remaining heritability is due to the imperfect tagging of causal variants by common SNPs, in particular if the causal variants are rare, or other reasons such as overestimation of heritability from pedigree data. Here we show that pedigree heritability for height and body mass index (BMI) appears to be largely recovered from whole-genome sequence (WGS) data on 25,465 unrelated individuals of European ancestry. We assigned 33.7 million genetic variants to groups based upon their minor allele frequencies (MAF) and linkage disequilibrium (LD) with variants nearby, and estimated and partitioned genetic variance accordingly. The estimated heritability was 0.68 (SE 0.10) for height and 0.30 (SE 0.10) for BMI, with a range of ~0.60 – 0.71 for height and ~0.25 – 0.35 for BMI, depending on quality control and analysis strategies. Low-MAF variants in low LD with neighbouring variants were enriched for heritability, to a greater extent for protein-altering variants, consistent with negative selection thereon. Cumulatively variants with 0.0001 < MAF < 0.1 explained 0.47 (SE 0.07) and 0.30 (SE 0.10) of heritability for height and BMI, respectively. Our results imply that rare variants, in particular those in regions of low LD, is a major source of the still missing heritability of complex traits and disease.