Back to Search Start Over

Deep learning identifies and quantifies recombination hotspot determinants

Authors :
Yu Li
Trisevgeni Rapakoulia
Siyuan Chen
Xin Gao
Hiroyuki Kuwahara
Kevin Y. Yip
Source :
Bioinformatics
Publication Year :
2022
Publisher :
Oxford University Press (OUP), 2022.

Abstract

Motivation Recombination is one of the essential genetic processes for sexually reproducing organisms, which can happen more frequently in some regions, called recombination hotspots. Although several factors, such as PRDM9 binding motifs, are known to be related to the hotspots, their contributions to the recombination hotspots have not been quantified, and other determinants are yet to be elucidated. Here, we propose a computational method, RHSNet, based on deep learning and signal processing, to identify and quantify the hotspot determinants in a purely data-driven manner, utilizing datasets from various studies, populations, sexes and species. Results RHSNet can significantly outperform other sequence-based methods on multiple datasets across different species, sexes and studies. In addition to being able to identify hotspot regions and the well-known determinants accurately, more importantly, RHSNet can quantify the determinants that contribute significantly to the recombination hotspot formation in the relation between PRDM9 binding motif, histone modification and GC content. Further cross-sex, cross-population and cross-species studies suggest that the proposed method has the generalization power and potential to identify and quantify the evolutionary determinant motifs. Availability and implementation https://github.com/frankchen121212/RHSNet. Supplementary information Supplementary data are available at Bioinformatics online.

Details

ISSN :
13674811 and 13674803
Volume :
38
Database :
OpenAIRE
Journal :
Bioinformatics
Accession number :
edsair.doi.dedup.....d9d31a4c39e282e8802bc5d6a1a8e67c
Full Text :
https://doi.org/10.1093/bioinformatics/btac234