Back to Search Start Over

HypercubeME: two hundred million combinatorially complete datasets from a single experiment.

Authors :
Esteban LA
Lonishin LR
Bobrovskiy D
Leleytner G
Bogatyreva NS
Kondrashov FA
Ivankov DN
Source :
Bioinformatics (Oxford, England) [Bioinformatics] 2019 Nov 19. Date of Electronic Publication: 2019 Nov 19.
Publication Year :
2019
Publisher :
Ahead of Print

Abstract

Motivation: Epistasis, the context-dependence of the contribution of an amino acid substitution to fitness, is common in evolution. To detect epistasis, fitness must be measured for at least four genotypes: the reference genotype, two different single mutants and a double mutant with both of the single mutations. For higher-order epistasis of the order n, fitness has to be measured for all 2n genotypes of an n-dimensional hypercube in genotype space forming a "combinatorially complete dataset". So far, only a handful of such datasets have been produced by manual curation. Concurrently, random mutagenesis experiments have produced measurements of fitness and other phenotypes in a high-throughput manner, potentially containing a number of combinatorially complete datasets.<br />Results: We present an effective recursive algorithm for finding all hypercube structures in random mutagenesis experimental data. To test the algorithm, we applied it to the data from a recent HIS3 protein dataset and found all 199,847,053 unique combinatorially complete genotype combinations of dimensionality ranging from two to twelve. The algorithm may be useful for researchers looking for higher-order epistasis in their high-throughput experimental data.<br />Availability: https://github.com/ivankovlab/HypercubeME.git.<br />Supplementary Information: Supplementary data are available at Bioinformatics online.<br /> (© The Author(s) 2019. Published by Oxford University Press.)

Details

Language :
English
ISSN :
1367-4811
Database :
MEDLINE
Journal :
Bioinformatics (Oxford, England)
Publication Type :
Academic Journal
Accession number :
31742320
Full Text :
https://doi.org/10.1093/bioinformatics/btz841