Back to Search
Start Over
PredTAD: A machine learning framework that models 3D chromatin organization alterations leading to oncogene dysregulation in breast cancer cell lines
- Source :
- Computational and Structural Biotechnology Journal, Vol 19, Iss, Pp 2870-2880 (2021), Computational and Structural Biotechnology Journal
- Publication Year :
- 2021
- Publisher :
- Elsevier, 2021.
-
Abstract
- Graphical abstract<br />Topologically associating domains, or TADs, play important roles in genome organization and gene regulation; however, they are often altered in diseases. High-throughput chromatin conformation capturing assays, such as Hi-C, can capture domains of increased interactions, and TADs and boundaries can be identified using well-established analytical tools. However, generating Hi-C data is expensive. In our study, we addressed the relationship between multi-omics data and higher-order chromatin structures using a newly developed machine-learning model called PredTAD. Our tool uses already-available and cost-effective datatypes such as transcription factor and histone modification ChIPseq data. Specifically, PredTAD utilizes both epigenetic and genetic features as well as neighboring information to classify the entire human genome as boundary or non-boundary regions. Our tool can predict boundary changes between normal and breast cancer genomes. Among the most important features for predicting boundary alterations were CTCF, subunits of cohesin (RAD21 and SMC3), and chromosome number, suggesting their roles in conserved and dynamic boundaries formation. Upon further analysis, we observed that genes near altered TAD boundaries were found to be involved in several important breast cancer signaling pathways such as Ras, Jak-STAT, and estrogen signaling pathways. We also discovered a TAD boundary alteration that contributes to RET oncogene overexpression. PredTAD can also successfully predict TAD boundary changes in other conditions and diseases. In conclusion, our newly developed machine learning tool allowed for a more complete understanding of the dynamic 3D chromatin structures involved in signaling pathway activation, altered gene expression, and disease state in breast cancer cells.
- Subjects :
- 3D chromatin architecture
Biophysics
Biology
Machine learning
computer.software_genre
Biochemistry
Genome
Computational biology
03 medical and health sciences
0302 clinical medicine
Breast cancer
Structural Biology
Chromatin organization
Genetics
Epigenetics
Transcription factor
ComputingMethodologies_COMPUTERGRAPHICS
030304 developmental biology
Regulation of gene expression
0303 health sciences
business.industry
TAD boundaries
Computer Science Applications
Chromatin
Estrogen signaling pathways
Histone
CTCF
030220 oncology & carcinogenesis
biology.protein
Human genome
Artificial intelligence
Topologically associated domains
business
computer
TP248.13-248.65
Research Article
Biotechnology
Subjects
Details
- Language :
- English
- ISSN :
- 20010370
- Volume :
- 19
- Database :
- OpenAIRE
- Journal :
- Computational and Structural Biotechnology Journal
- Accession number :
- edsair.doi.dedup.....d5dce49b981881a51f9091ce199b839a