Back to Search
Start Over
Multivariate Hawkes process models of the occurrence of regulatory elements
- Source :
- BMC Bioinformatics, Carstensen, L, Sandelin, A G, Winther, O & Hansen, N R 2010, ' Multivariate Hawkes process models of the occurrence of regulatory elements ', BMC Bioinformatics, vol. 11, pp. 456 . https://doi.org/10.1186/1471-2105-11-456, BMC Bioinformatics, Vol 11, Iss 1, p 456 (2010), Carstensen, L, Sandelin, A, Winther, O & Hansen, N R 2010, ' Multivariate Hawkes process models of the occurrence of regulatory elements ', BMC Bioinformatics, vol. 11 . https://doi.org/10.1186/1471-2105-11-456
- Publication Year :
- 2010
-
Abstract
- Background A central question in molecular biology is how transcriptional regulatory elements (TREs) act in combination. Recent high-throughput data provide us with the location of multiple regulatory regions for multiple regulators, and thus with the possibility of analyzing the multivariate distribution of the occurrences of these TREs along the genome. Results We present a model of TRE occurrences known as the Hawkes process. We illustrate the use of this model by analyzing two different publically available data sets. We are able to model, in detail, how the occurrence of one TRE is affected by the occurrences of others, and we can test a range of natural hypotheses about the dependencies among the TRE occurrences. In contrast to earlier efforts, pre-processing steps such as clustering or binning are not needed, and we thus retain information about the dependencies among the TREs that is otherwise lost. For each of the two data sets we provide two results: first, a qualitative description of the dependencies among the occurrences of the TREs, and second, quantitative results on the favored or avoided distances between the different TREs. Conclusions The Hawkes process is a novel way of modeling the joint occurrences of multiple TREs along the genome that is capable of providing new insights into dependencies among elements involved in transcriptional regulation. The method is available as an R package from http://www.math.ku.dk/~richard/ppstat/.
- Subjects :
- Multivariate statistics
Process modeling
Process (engineering)
Multivariate normal distribution
Computational biology
Biology
computer.software_genre
lcsh:Computer applications to medicine. Medical informatics
01 natural sciences
Biochemistry
Genome
010104 statistics & probability
03 medical and health sciences
Structural Biology
Regulatory Elements, Transcriptional
0101 mathematics
Cluster analysis
Molecular Biology
lcsh:QH301-705.5
030304 developmental biology
0303 health sciences
Models, Genetic
Applied Mathematics
Methodology Article
Contrast (statistics)
Computer Science Applications
Range (mathematics)
lcsh:Biology (General)
Gene Expression Regulation
lcsh:R858-859.7
Data mining
computer
Algorithms
Subjects
Details
- Language :
- English
- ISSN :
- 14712105
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics
- Accession number :
- edsair.doi.dedup.....7dda46949fb99997ae54d488a09db237
- Full Text :
- https://doi.org/10.1186/1471-2105-11-456