Back to Search Start Over

Population genomic analysis of 962 whole genome sequences of humans reveals natural selection in non-coding regions

Authors :
Eric Boerwinkle
Alon Keinan
Xiaoming Liu
Alanna C. Morrison
Haley Hunter-Zinck
Donna M. Muzny
Andrew G. Clark
Ran Blekhman
Andrew D. Johnson
Joshua C. Bis
Jin Yu
Diana Chang
Leonardo Arbiza
Srilakshmi M. Raj
Cris Van Hout
Jian Lu
Bruce M. Psaty
Richard A. Gibbs
Fuli Yu
Elodie Gazave
L. Adrienne Cupples
Source :
PLoS ONE, Vol 10, Iss 3, p e0121644 (2015), PLoS ONE
Publication Year :
2015
Publisher :
Public Library of Science (PLoS), 2015.

Abstract

Whole genome analysis in large samples from a single population is needed to provide adequate power to assess relative strengths of natural selection across different functional components of the genome. In this study, we analyzed next-generation sequencing data from 962 European Americans, and found that as expected approximately 60% of the top 1% of positive selection signals lie in intergenic regions, 33% in intronic regions, and slightly over 1% in coding regions. Several detailed functional annotation categories in intergenic regions showed statistically significant enrichment in positively selected loci when compared to the null distribution of the genomic span of ENCODE categories. There was a significant enrichment of purifying selection signals detected in enhancers, transcription factor binding sites, microRNAs and target sites, but not on lincRNA or piRNAs, suggesting different evolutionary constraints for these domains. Loci in "repressed or low activity regions" and loci near or overlapping the transcription start site were the most significantly over-represented annotations among the top 1% of signals for positive selection.

Details

Language :
English
ISSN :
19326203
Volume :
10
Issue :
3
Database :
OpenAIRE
Journal :
PLoS ONE
Accession number :
edsair.doi.dedup.....b78ebf7a2c2c4e1e3ee15b7932cf1c6c