Back to Search Start Over

Expression reflects population structure.

Authors :
Brown, Brielin C.
Bray, Nicolas L.
Pachter, Lior
Source :
PLoS Genetics; 12/19/2018, Vol. 14 Issue 12, p1-15, 15p
Publication Year :
2018

Abstract

Population structure in genotype data has been extensively studied, and is revealed by looking at the principal components of the genotype matrix. However, no similar analysis of population structure in gene expression data has been conducted, in part because a naïve principal components analysis of the gene expression matrix does not cluster by population. We identify a linear projection that reveals population structure in gene expression data. Our approach relies on the coupling of the principal components of genotype to the principal components of gene expression via canonical correlation analysis. Our method is able to determine the significance of the variance in the canonical correlation projection explained by each gene. We identify 3,571 significant genes, only 837 of which had been previously reported to have an associated eQTL in the GEUVADIS results. We show that our projections are not primarily driven by differences in allele frequency at known cis-eQTLs and that similar projections can be recovered using only several hundred randomly selected genes and SNPs. Finally, we present preliminary work on the consequences for eQTL analysis. We observe that using our projection co-ordinates as covariates results in the discovery of slightly fewer genes with eQTLs, but that these genes replicate in GTEx matched tissue at a slightly higher rate. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15537390
Volume :
14
Issue :
12
Database :
Complementary Index
Journal :
PLoS Genetics
Publication Type :
Academic Journal
Accession number :
133634482
Full Text :
https://doi.org/10.1371/journal.pgen.1007841