Back to Search
Start Over
Heuristics for Reversal Distance Between Genomes with Duplicated Genes
- Source :
- Algorithms for Computational Biology
- Publication Year :
- 2020
-
Abstract
- In comparative genomics, one goal is to find similarities between genomes of different organisms. Comparisons using genome features like genes, gene order, and regulatory sequences are carried out with this purpose in mind. Genome rearrangements are mutational events that affect large extensions of the genome. They are responsible for creating extant species with conserved genes in different positions across genomes. Close species—from an evolutionary point of view—tend to have the same set of genes or share most of them. When we consider gene order to compare two genomes, it is possible to use a parsimony criterion to estimate how close the species are. We are interested in the shortest sequence of genome rearrangements capable of transforming one genome into the other, which is named rearrangement distance. Reversal is one of the most studied genome rearrangements events. This event acts in a segment of the genome, inverting the position and possibly the orientation of genes in it. When the genome has no gene repetition, a common approach is to map it as a permutation such that each element represents a conserved block. When genomes have replicated genes, this mapping is usually performed using strings. The number of replicas depends on the organisms being compared, but in many scenarios, it tends to be small. In this work, we study the reversal distance between genomes with duplicated genes considering that the orientation of genes is unknown. We present three heuristics that use techniques like genetic algorithms and local search. We conduct experiments using a database of simulated genomes and compared our results with other algorithms from the literature.
- Subjects :
- Reversal
Heuristics
Genome rearrangement
Duplicated genes
Article
Subjects
Details
- Language :
- English
- Volume :
- 12099
- Database :
- OpenAIRE
- Journal :
- Algorithms for Computational Biology
- Accession number :
- edsair.pmc...........4b58726126a91b51b450be66dd99dfeb