Back to Search
Start Over
Incorporating Chromatin Accessibility Data into Sequence-to-Expression Modeling
- Source :
- Biophysical Journal. (5):1257-1267
- Publisher :
- Biophysical Society. Published by Elsevier Inc.
-
Abstract
- Prediction of gene expression levels from regulatory sequences is one of the major challenges of genomic biology today. A particularly promising approach to this problem is that taken by thermodynamics-based models that interpret an enhancer sequence in a given cellular context specified by transcription factor concentration levels and predict precise expression levels driven by that enhancer. Such models have so far not accounted for the effect of chromatin accessibility on interactions between transcription factor and DNA and consequently on gene-expression levels. Here, we extend a thermodynamics-based model of gene expression, called GEMSTAT (Gene Expression Modeling Based on Statistical Thermodynamics), to incorporate chromatin accessibility data and quantify its effect on accuracy of expression prediction. In the new model, called GEMSTAT-A, accessibility at a binding site is assumed to affect the transcription factor’s binding strength at the site, whereas all other aspects are identical to the GEMSTAT model. We show that this modification results in significantly better fits in a data set of over 30 enhancers regulating spatial expression patterns in the blastoderm-stage Drosophila embryo. It is important to note that the improved fits result not from an overall elevated accessibility in active enhancers but from the variation of accessibility levels within an enhancer. With whole-genome DNA accessibility measurements becoming increasingly popular, our work demonstrates how such data may be useful for sequence-to-expression models. It also calls for future advances in modeling accessibility levels from sequence and the transregulatory context, so as to predict accurately the effect of cis and trans perturbations on gene expression.
- Subjects :
- Biophysics
Context (language use)
Computational biology
Biology
03 medical and health sciences
chemistry.chemical_compound
0302 clinical medicine
Gene expression
Animals
Binding site
Enhancer
Transcription factor
030304 developmental biology
Genetics
Systems Biophysics
0303 health sciences
Models, Genetic
Gene Expression Regulation, Developmental
Chromatin Assembly and Disassembly
Chromatin
chemistry
Regulatory sequence
Thermodynamics
Drosophila
030217 neurology & neurosurgery
DNA
Subjects
Details
- Language :
- English
- ISSN :
- 00063495
- Issue :
- 5
- Database :
- OpenAIRE
- Journal :
- Biophysical Journal
- Accession number :
- edsair.doi.dedup.....f58bf84dfddfdb9d76755c6258d60219
- Full Text :
- https://doi.org/10.1016/j.bpj.2014.12.037