Author: "Alex Hawkins-Hooker" / Topic: function (mathematics) - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Alex Hawkins-Hooker"' showing total 3 results

Start Over Author "Alex Hawkins-Hooker" Topic function (mathematics)

3 results on '"Alex Hawkins-Hooker"'

1. Generating functional protein variants with variational autoencoders

Author: Alex Hawkins-Hooker, Arthur Chen, David Bikard, Guillaume Couairon, Florence Depardieu, Sebastien Baur, Biologie de Synthèse - Synthetic biology, Institut Pasteur [Paris] (IP), This work was supported by the French Government’s Investissement d’Avenir program and by Laboratoire d’Excellence ‘Integrative Biology of Emerging Infectious Diseases’ (ANR-10-LABX-62-IBEID) to D.B., ANR-10-LABX-0062,IBEID,Integrative Biology of Emerging Infectious Diseases(2010), and Institut Pasteur [Paris]
Subjects: Protein Structure Comparison, Luminescence, [SDV.BIO]Life Sciences [q-bio]/Biotechnology, Computer science, [SDV]Life Sciences [q-bio], Markov models, Variation (game tree), Protein Sequencing, Biochemistry, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Machine Learning, Database and Informatics Methods, 0302 clinical medicine, Protein sequencing, Macromolecular Structure Analysis, Hidden Markov models, Biology (General), chemistry.chemical_classification, 0303 health sciences, Sequence, Functional protein, Physics, Electromagnetic Radiation, Directed evolution, Recombinant Proteins, 3. Good health, Amino acid, Physical Sciences, Oxidoreductases, Photorhabdus, Sequence Analysis, Algorithms, Research Article, Multiple Alignment Calculation, Protein Structure, QH301-705.5, Bioinformatics, Materials Science, Material Properties, Computational biology, Research and Analysis Methods, 03 medical and health sciences, Computational Techniques, Escherichia coli, Computer Simulation, Molecular Biology Techniques, Sequencing Techniques, Molecular Biology, 030304 developmental biology, Structure (mathematical logic), Rational design, Computational Biology, Proteins, Reproducibility of Results, Biology and Life Sciences, Probability theory, Function (mathematics), Protein engineering, Split-Decomposition Method, chemistry, Solubility, Neural Networks, Computer, Sequence Alignment, 030217 neurology & neurosurgery, Mathematics
Abstract: The vast expansion of protein sequence databases provides an opportunity for new protein design approaches which seek to learn the sequence-function relationship directly from natural sequence variation. Deep generative models trained on protein sequence data have been shown to learn biologically meaningful representations helpful for a variety of downstream tasks, but their potential for direct use in the design of novel proteins remains largely unexplored. Here we show that variational autoencoders trained on a dataset of almost 70000 luciferase-like oxidoreductases can be used to generate novel, functional variants of the luxA bacterial luciferase. We propose separate VAE models to work with aligned sequence input (MSA VAE) and raw sequence input (AR-VAE), and offer evidence that while both are able to reproduce patterns of amino acid usage characteristic of the family, the MSA VAE is better able to capture long-distance dependencies reflecting the influence of 3D structure. To confirm the practical utility of the models, we used them to generate variants of luxA whose luminescence activity was validated experimentally. We further showed that conditional variants of both models could be used to increase the solubility of luxA without disrupting function. Altogether 6/12 of the variants generated using the unconditional AR-VAE and 9/11 generated using the unconditional MSA VAE retained measurable luminescence, together with all 23 of the less distant variants generated by conditional versions of the models; the most distant functional variant contained 35 differences relative to the nearest training set sequence. These results demonstrate the feasibility of using deep generative models to explore the space of possible protein sequences and generate useful variants, providing a method complementary to rational design and directed evolution approaches., Author summary The design of novel proteins with specified function and biochemical properties is a longstanding goal in bio-engineering with applications across medicine and nanotechnology. Despite the impressive achievements of traditional approaches, a great deal of scope remains for the development of data-driven methods capable of exploiting the record of natural sequence variation available in protein databases. Deep generative models such as variational autoencoders (VAEs) have shown remarkable success in synthesising realistic data samples across a range of modalities, driving recent interest in developing such models for proteins. However, experimental evidence for the viability of such techniques in practical protein design settings remains scarce. Here we show that VAEs trained on the family of luciferase-like oxidoreductases can be used to generate functional variants of the luxA bacterial luciferase. We compare the use of raw and aligned sequences as input to the model, providing evidence that models trained on aligned data are better able to learn functional constraints. Finally, we demonstrate the possibility of controlling desired properties of the designed sequences, by using conditional versions of the VAE models to increase the solubility of the wild-type luxA sequence from P. luminescens.
Published: 2021
Full Text: View/download PDF

2. Projection layers improve deep learning models of regulatory DNA function

Author: John E. Reid, Alex Hawkins-Hooker, and Henry Kenlay
Subjects: business.industry, Computer science, Feature vector, Deep learning, Artificial intelligence, Function (mathematics), Overfitting, Layer (object-oriented design), business, Representation (mathematics), Algorithm, Projection (linear algebra), Dropout (neural networks)
Abstract: With the increasing application of deep learning methods to the modelling of regulatory DNA sequences has come an interest in exploring what types of architecture are best suited to the domain. Networks designed to predict many functional characteristics of noncoding DNA in a multitask framework have to recognise a large number of motifs and as a result benefit from large numbers of convolutional filters in the first layer. The use of large first layers in turn motivates an exploration of strategies for addressing the sparsity of output and possibility for overfitting that result. To this end we propose the use of a dimensionality-reducing linear projection layer after the initial motif-recognising convolutions. In experiments with a reduced version of the DeepSEA dataset we find that inserting this layer in combination with dropout into convolutional and convolutional-recurrent architectures can improve predictive performance across a range of first layer sizes. We further validate our approach by incorporating the projection layer into a new convolutional-recurrent architecture which achieves state of the art performance on the full DeepSEA dataset. Analysis of the learned projection weights shows that the inclusion of this layer simplifies the network’s internal representation of the occurrence of motifs, notably by projecting features representing forward and reverse-complement motifs to similar positions in the lower dimensional feature space output by the layer.
Published: 2018
Full Text: View/download PDF

3. Projection layers improve deep learning models of regulatory DNA function

Author: John E. Reid, Henry Kenlay, and Alex Hawkins-Hooker
Subjects: 0301 basic medicine, General Immunology and Microbiology, Computer science, business.industry, Deep learning, Feature vector, General Medicine, Function (mathematics), Overfitting, General Biochemistry, Genetics and Molecular Biology, Projection (linear algebra), 03 medical and health sciences, 030104 developmental biology, 0302 clinical medicine, Artificial intelligence, General Pharmacology, Toxicology and Pharmaceutics, Layer (object-oriented design), Representation (mathematics), business, Algorithm, 030217 neurology & neurosurgery, Dropout (neural networks)
Abstract: With the increasing application of deep learning methods to the modelling of regulatory DNA sequences has come an interest in exploring what types of architecture are best suited to the domain. Networks designed to predict many functional characteristics of noncoding DNA in a multitask framework have to recognise a large number of motifs and as a result benefit from large numbers of convolutional filters in the first layer. The use of large first layers in turn motivates an exploration of strategies for addressing the sparsity of output and possibility for overfitting that result. To this end we propose the use of a dimensionality-reducing linear projection layer after the initial motif-recognising convolutions. In experiments with a reduced version of the DeepSEA dataset we find that inserting this layer in combination with dropout into convolutional and convolutional-recurrent architectures can improve predictive performance across a range of first layer sizes. We further validate our approach by incorporating the projection layer into a new convolutional-recurrent architecture which achieves state of the art performance on the full DeepSEA dataset. Analysis of the learned projection weights shows that the inclusion of this layer simplifies the network’s internal representation of the occurrence of motifs, notably by projecting features representing forward and reverse-complement motifs to similar positions in the lower dimensional feature space output by the layer.
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Alex Hawkins-Hooker"'

1. Generating functional protein variants with variational autoencoders

2. Projection layers improve deep learning models of regulatory DNA function

3. Projection layers improve deep learning models of regulatory DNA function

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

Publisher

3 results on '"Alex Hawkins-Hooker"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources