Back to Search
Start Over
A unified framework for integrative study of heterogeneous gene regulatory mechanisms
- Source :
- Nature Machine Intelligence. 2:447-456
- Publication Year :
- 2020
- Publisher :
- Springer Science and Business Media LLC, 2020.
-
Abstract
- Gene expression is regulated by a large variety of mechanisms. Previous studies attempting to model the quantitative relationships between gene expression levels and regulatory mechanisms have considered only one or a few mechanisms at a time, which cannot provide a full picture of the complex interactions among different mechanisms. This was partially due to the heterogeneity of the mechanisms, which involve different types of biological objects and data representations, making it hard to study them in a unified way. Here, we describe a flexible framework that can integrate very different types of data for studying their joint effects on gene expression. In this framework, domain knowledge is represented by metapaths, while the manifestations of their effects in actual data are summarized by an embedding of the biological objects in a latent space. We demonstrate the use of our framework in integrating several diverse types of data that are related to gene expression in different ways, including DNA contacts in three-dimensional genome architecture, protein–protein interactions, genomic neighbourhoods and broad chromatin accessibility domains. The modelling results reveal that these several types of data are able to model gene expression fairly well individually, but even better when integrated. Gene expression is regulated by a variety of mechanisms, which have been difficult to study in a unified way. The authors propose a flexible framework that can integrate different types of data for studying their joint effects on gene expression. The framework uses a general network representation for data integration, metapaths for inputting prior knowledge of gene regulatory mechanisms, and embedding techniques for capturing complex structures in the data.
- Subjects :
- 0301 basic medicine
Computer Networks and Communications
Computer science
Systems biology
Computational biology
computer.software_genre
Data type
Variety (cybernetics)
Chromatin
Human-Computer Interaction
03 medical and health sciences
030104 developmental biology
0302 clinical medicine
Artificial Intelligence
Domain knowledge
Computer Vision and Pattern Recognition
Representation (mathematics)
computer
Gene
030217 neurology & neurosurgery
Software
Data integration
Subjects
Details
- ISSN :
- 25225839
- Volume :
- 2
- Database :
- OpenAIRE
- Journal :
- Nature Machine Intelligence
- Accession number :
- edsair.doi...........05b7cba3f8ca6565fd688edaac9b5d6f
- Full Text :
- https://doi.org/10.1038/s42256-020-0205-2