Back to Search Start Over

An expansive human regulatory lexicon encoded in transcription factor footprints

Authors :
Michael J. MacCoss
Mark Groudine
Molly Weaver
Tanya Kutyavin
John A. Stamatoyannopoulos
Shane Neph
Daniel Bates
Eric Rynes
Benjamin Vernot
Douglas Dunn
Anthony Schafer
Joshua M. Akey
Peter J. Sabo
Jeff Vierstra
Robert E. Thurman
R. Scott Hansen
Andrew B. Stergachis
Rajinder Kaul
Shinny Vong
Alex Reynolds
Theresa K. Canfield
Richard Humbert
Erika Giste
Jun Neri
Michaël Bender
Miaohua Zhang
Kristen Lee
Rachel Byron
Richard Sandstrom
Sam John
Eric Haugen
Gayathri Balasundaram
Vaughn Roach
Morgan Diegel
Hao Wang
Audra K. Johnson
Matthew T. Maurano
Source :
Nature
Publication Year :
2012

Abstract

Regulatory factor binding to genomic DNA protects the underlying sequence from cleavage by DNase I, leaving nucleotide-resolution footprints. Using genomic DNase I footprinting across 41 diverse cell and tissue types, we detected 45 million transcription factor occupancy events within regulatory regions, representing differential binding to 8.4 million distinct short sequence elements. Here we show that this small genomic sequence compartment, roughly twice the size of the exome, encodes an expansive repertoire of conserved recognition sequences for DNA-binding proteins that nearly doubles the size of the human cis-regulatory lexicon. We find that genetic variants affecting allelic chromatin states are concentrated in footprints, and that these elements are preferentially sheltered from DNA methylation. High-resolution DNase I cleavage patterns mirror nucleotide-level evolutionary conservation and track the crystallographic topography of protein-DNA interfaces, indicating that transcription factor structure has been evolutionarily imprinted on the human genome sequence. We identify a stereotyped 50-base-pair footprint that precisely defines the site of transcript origination within thousands of human promoters. Finally, we describe a large collection of novel regulatory factor recognition motifs that are highly conserved in both sequence and function, and exhibit cell-selective occupancy patterns that closely parallel major regulators of development, differentiation and pluripotency.

Details

Language :
English
ISSN :
14764687 and 00280836
Volume :
489
Issue :
7414
Database :
OpenAIRE
Journal :
Nature
Accession number :
edsair.doi.dedup.....d2ebac24fcab42efcc9708a7089a2d6a