31 results on '"Hannah K. Wayment-Steele"'
Search Results
2. Deep learning models for predicting RNA degradation via dual crowdsourcing.
- Author
-
Hannah K. Wayment-Steele, Wipapat Kladwang, Andrew M. Watkins, Do Soon Kim, Bojan Tunguz, Walter Reade, Maggie Demkin, Jonathan Romano, Roger Wellington-Oguri, John J. Nicol, Jiayang Gao, Kazuki Onodera, Kazuki Fujikawa, Hanfei Mao, Gilles Vandewiele, Michele Tinti, Bram Steenwinckel, Takuya Ito, Taiga Noumi, Shujun He, Keiichiro Ishi, Youhan Lee, Fatih öztürk, King Yuen Chiu, Emin öztürk, Karim Amer, Mohamed Fares, and Rhiju Das
- Published
- 2022
- Full Text
- View/download PDF
3. Predictive models of RNA degradation through dual crowdsourcing.
- Author
-
Hannah K. Wayment-Steele, Wipapat Kladwang, Andrew M. Watkins, Do Soon Kim, Bojan Tunguz, Walter Reade, Maggie Demkin, Jonathan Romano, Roger Wellington-Oguri, John J. Nicol, Jiayang Gao, Kazuki Onodera, Kazuki Fujikawa, Hanfei Mao, Gilles Vandewiele, Michele Tinti, Bram Steenwinckel, Takuya Ito, Taiga Noumi, Shujun He, Keiichiro Ishi, Youhan Lee, Fatih öztürk, Anthony Chiu, Emin öztürk, Karim Amer, Mohamed Fares, Eterna Participants, and Rhiju Das
- Published
- 2021
4. RNA secondary structure packages evaluated and improved by high-throughput experiments
- Author
-
Hannah K, Wayment-Steele, Wipapat, Kladwang, Alexandra I, Strom, Jeehyung, Lee, Adrien, Treuille, Alex, Becka, and Rhiju, Das
- Subjects
Humans ,Nucleic Acid Conformation ,RNA ,Thermodynamics ,Cell Biology ,Molecular Biology ,Biochemistry ,Algorithms ,Protein Structure, Secondary ,Article ,Biotechnology - Abstract
Despite the popularity of computer-aided study and design of RNA molecules, little is known about the accuracy of commonly used structure modeling packages in tasks sensitive to ensemble properties of RNA. Here, we demonstrate that the EternaBench dataset, a set of more than 20,000 synthetic RNA constructs designed on the RNA design platform Eterna, provides incisive discriminative power in evaluating current packages in ensemble-oriented structure prediction tasks. We find that CONTRAfold and RNAsoft, packages with parameters derived through statistical learning, achieve consistently higher accuracy than more widely used packages in their standard settings, which derive parameters primarily from thermodynamic experiments. We hypothesized that training a multitask model with the varied data types in EternaBench might improve inference on ensemble-based prediction tasks. Indeed, the resulting model, named EternaFold, demonstrated improved performance that generalizes to diverse external datasets including complete messenger RNAs, viral genomes probed in human cells and synthetic designs modeling mRNA vaccines.
- Published
- 2022
- Full Text
- View/download PDF
5. Note: Variational Encoding of Protein Dynamics Benefits from Maximizing Latent Autocorrelation.
- Author
-
Hannah K. Wayment-Steele and Vijay S. Pande
- Published
- 2018
6. Prediction of multiple conformational states by combining sequence clustering with AlphaFold2
- Author
-
Hannah K. Wayment-Steele, Sergey Ovchinnikov, Lucy Colwell, and Dorothee Kern
- Abstract
AlphaFold2 (AF2) has revolutionized structural biology by accurately predicting single structures of proteins and protein-protein complexes. However, biological function is rooted in a protein’s ability to sample different conformational substates, and disease-causing point mutations are often due to population changes of these substates. This has sparked immense interest in expanding AF2’s capability to predict conformational substates. We demonstrate that clustering an input multiple sequence alignment (MSA) by sequence similarity enables AF2 to sample alternate states of known metamorphic proteins, including the circadian rhythm protein KaiB, the transcription factor RfaH, and the spindle checkpoint protein Mad2, and score these states with high confidence. Moreover, we use AF2 to identify a minimal set of two point mutations predicted to switch KaiB between its two states. Finally, we used our clustering method, AF-cluster, to screen for alternate states in protein families without known fold-switching, and identified a putative alternate state for the oxidoreductase DsbE. Similarly to KaiB, DsbE is predicted to switch between a thioredoxin-like fold and a novel fold. This prediction is the subject of future experimental testing. Further development of such bioinformatic methods in tandem with experiments will likely have profound impact on predicting protein energy landscapes, essential for shedding light into biological function.
- Published
- 2022
- Full Text
- View/download PDF
7. Theoretical basis for stabilizing messenger RNA through secondary structure design
- Author
-
Andrew M. Watkins, Po-Ssu Huang, Hannah K. Wayment-Steele, John J Nicol, Eterna Participants, Roger Wellington-Oguri, Do Soon Kim, Rhiju Das, Christian A Choe, and R Andres Parra Sperberg
- Subjects
Untranslated region ,RNA Stability ,Messenger RNA ,Computer science ,Base pair ,AcademicSubjects/SCI00010 ,Rational design ,RNA ,Translation (biology) ,Computational biology ,Biology ,RNA hydrolysis ,Article ,Narese/24 ,Genetics ,RNA and RNA-protein complexes ,Target protein ,Nucleic acid structure ,Protein secondary structure - Abstract
RNA hydrolysis presents problems in manufacturing, long-term storage, world-wide delivery, and in vivo stability of messenger RNA (mRNA)-based vaccines and therapeutics. A largely unexplored strategy to reduce mRNA hydrolysis is to redesign RNAs to form double-stranded regions, which are protected from in-line cleavage and enzymatic degradation, while coding for the same proteins. The amount of stabilization that this strategy can deliver and the most effective algorithmic approach to achieve stabilization remain poorly understood. Here, we present simple calculations for estimating RNA stability against hydrolysis, and a model that links the average unpaired probability of an mRNA, or AUP, to its overall hydrolysis rate. To characterize the stabilization achievable through structure design, we compare AUP optimization by conventional mRNA design methods to results from more computationally sophisticated algorithms and crowdsourcing through the OpenVaccine challenge on the Eterna platform. These computational tests were carried out on both model mRNAs and COVID-19 mRNA vaccine candidates. We find that rational design on Eterna and the more sophisticated algorithms lead to constructs with low AUP, which we term ‘superfolder’ mRNAs. These designs exhibit wide diversity of sequence and structure features that may be desirable for translation, biophysical size, and immunogenicity, and their folding is robust to temperature, choice of flanking untranslated regions, and changes in target protein sequence, as illustrated by rapid redesign of superfolder mRNAs for B.1.351, P.1, and B.1.1.7 variants of the prefusion-stabilized SARS-CoV-2 spike protein. Increases in in vitro mRNA half-life by at least two-fold appear immediately achievable.Significance statementMessenger RNA (mRNA) medicines that encode and promote translation of a target protein have shown promising use as vaccines in the current SARS-CoV-2 pandemic as well as infectious diseases due to their speed of design and manufacturing. However, these molecules are intrinsically prone to hydrolysis, leading to poor stability in aqueous buffer and major challenges in distribution. Here, we present a principled biophysical model for predicting RNA degradation, and demonstrate that the stability of any mRNA can be increased at least two-fold over conventional design techniques. Furthermore, the predicted stabilization is robust to post-design modifications. This conceptual framework and accompanying algorithm can be immediately deployed to guide re-design of mRNA vaccines and therapeutics to increase in vitro stability.
- Published
- 2021
8. Crowdsourced RNA design discovers diverse, reversible, efficient, self-contained molecular switches
- Author
-
Johan O L, Andreasson, Michael R, Gotrik, Michelle J, Wu, Hannah K, Wayment-Steele, Wipapat, Kladwang, Fernando, Portela, Roger, Wellington-Oguri, Rhiju, Das, and William J, Greenleaf
- Subjects
ComputingMethodologies_PATTERNRECOGNITION ,Multidisciplinary ,Crowdsourcing ,RNA - Abstract
Internet-based scientific communities promise a means to apply distributed, diverse human intelligence toward previously intractable scientific problems. However, current implementations have not allowed communities to propose experiments to test all emerging hypotheses at scale or to modify hypotheses in response to experiments. We report high-throughput methods for molecular characterization of nucleic acids that enable the large-scale video game–based crowdsourcing of RNA sensor design, followed by high-throughput functional characterization. Iterative design testing of thousands of crowdsourced RNA sensor designs produced near–thermodynamically optimal and reversible RNA switches that act as self-contained molecular sensors and couple five distinct small molecule inputs to three distinct protein binding and fluorogenic outputs. This work suggests a paradigm for widely distributed experimental bioscience.
- Published
- 2022
- Full Text
- View/download PDF
9. RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses: a first look
- Author
-
Ivan N Zheludev, Ramya Rangan, Hannah K. Wayment-Steele, Edward A. Pham, Rachel J. Hagey, Jeffrey S. Glenn, and Rhiju Das
- Subjects
Untranslated region ,0303 health sciences ,biology ,030302 biochemistry & molecular biology ,RNA ,Rfam ,Sequence alignment ,Computational biology ,Non-coding RNA ,biology.organism_classification ,Genome ,Nucleic acid secondary structure ,03 medical and health sciences ,Molecular Biology ,Betacoronavirus ,030304 developmental biology - Abstract
As the COVID-19 outbreak spreads, there is a growing need for a compilation of conserved RNA genome regions in the SARS-CoV-2 virus along with their structural propensities to guide development of antivirals and diagnostics. Here we present a first look at RNA sequence conservation and structural propensities in the SARS-CoV-2 genome. Using sequence alignments spanning a range of betacoronaviruses, we rank genomic regions by RNA sequence conservation, identifying 79 regions of length at least 15 nt as exactly conserved over SARS-related complete genome sequences available near the beginning of the COVID-19 outbreak. We then confirm the conservation of the majority of these genome regions across 739 SARS-CoV-2 sequences subsequently reported from the COVID-19 outbreak, and we present a curated list of 30 “SARS-related-conserved” regions. We find that known RNA structured elements curated as Rfam families and in prior literature are enriched in these conserved genome regions, and we predict additional conserved, stable secondary structures across the viral genome. We provide 106 “SARS-CoV-2-conserved-structured” regions as potential targets for antivirals that bind to structured RNA. We further provide detailed secondary structure models for the extended 5′ UTR, frameshifting stimulation element, and 3′ UTR. Lastly, we predict regions of the SARS-CoV-2 viral genome that have low propensity for RNA secondary structure and are conserved within SARS-CoV-2 strains. These 59 “SARS-CoV-2-conserved-unstructured” genomic regions may be most easily accessible by hybridization in primer-based diagnostic strategies.
- Published
- 2020
- Full Text
- View/download PDF
10. Deep learning models for predicting RNA degradation via dual crowdsourcing
- Author
-
Hannah K, Wayment-Steele, Wipapat, Kladwang, Andrew M, Watkins, Do Soon, Kim, Bojan, Tunguz, Walter, Reade, Maggie, Demkin, Jonathan, Romano, Roger, Wellington-Oguri, John J, Nicol, Jiayang, Gao, Kazuki, Onodera, Kazuki, Fujikawa, Hanfei, Mao, Gilles, Vandewiele, Michele, Tinti, Bram, Steenwinckel, Takuya, Ito, Taiga, Noumi, Shujun, He, Keiichiro, Ishi, Youhan, Lee, Fatih, Öztürk, King Yuen, Chiu, Emin, Öztürk, Karim, Amer, Mohamed, Fares, and Rhiju, Das
- Subjects
Human-Computer Interaction ,Artificial Intelligence ,Computer Networks and Communications ,Computer Vision and Pattern Recognition ,Software - Abstract
Medicines based on messenger RNA (mRNA) hold immense potential, as evidenced by their rapid deployment as COVID-19 vaccines. However, worldwide distribution of mRNA molecules has been limited by their thermostability, which is fundamentally limited by the intrinsic instability of RNA molecules to a chemical degradation reaction called in-line hydrolysis. Predicting the degradation of an RNA molecule is a key task in designing more stable RNA-based therapeutics. Here, we describe a crowdsourced machine learning competition (‘Stanford OpenVaccine’) on Kaggle, involving single-nucleotide resolution measurements on 6,043 diverse 102–130-nucleotide RNA constructs that were themselves solicited through crowdsourcing on the RNA design platform Eterna. The entire experiment was completed in less than 6 months, and 41% of nucleotide-level predictions from the winning model were within experimental error of the ground truth measurement. Furthermore, these models generalized to blindly predicting orthogonal degradation data on much longer mRNA molecules (504–1,588 nucleotides) with improved accuracy compared with previously published models. These results indicate that such models can represent in-line hydrolysis with excellent accuracy, supporting their use for designing stabilized messenger RNAs. The integration of two crowdsourcing platforms, one for dataset creation and another for machine learning, may be fruitful for other urgent problems that demand scientific discovery on rapid timescales.
- Published
- 2021
11. Redesigning the Eterna100 for the Vienna 2 folding engine
- Author
-
Eterna Structure Designers, Boris Rudolfs, Rohan V. Koodli, Hannah K. Wayment-Steele, and Rhiju Das
- Subjects
Artificial neural network ,business.industry ,Computer science ,media_common.quotation_subject ,Rational design ,RNA ,Benchmarking ,Folding (DSP implementation) ,Machine learning ,computer.software_genre ,Nucleic acid secondary structure ,Benchmark (computing) ,Artificial intelligence ,business ,Function (engineering) ,computer ,media_common - Abstract
The rational design of RNA is becoming important for rapidly developing technologies in medicine and biochemistry. Recent work has led to the development of several RNA secondary structure design algorithms and corresponding benchmarks to evaluate their performance. However, the performance of these algorithms is linked to the nature of the underlying algorithms for predicting secondary structure from sequences. Here, we show that an online community of RNA design experts is capable of modifying an existing RNA secondary structure design benchmark (Eterna100) with minimal alterations to address changes in the folding engine used (Vienna 1.8 updated to Vienna 2.4). We tested this new Eterna100-V2 benchmark with five RNA design algorithms, and found that neural network-based methods exhibited reduced performance in the folding engine they were evaluated on in their respective papers. We investigated this discrepancy, and determined that structural features, previously classified as difficult, may be dependent on parameters inherent to the RNA energy function itself. These findings suggest that for optimal performance, future algorithms should focus on finding strategies capable of solving RNA secondary structure design benchmarks independently of the free energy benchmark used. Eterna100-V1 and Eterna100-V2 benchmarks and example solutions are freely available at https://github.com/eternagame/eterna100-benchmarking.
- Published
- 2021
- Full Text
- View/download PDF
12. A modular DNA scaffold to study protein–protein interactions at single-molecule resolution
- Author
-
Vijay S. Pande, Jing L. Wang, Terence R. Strick, Dorota Kostrz, Hannah K. Wayment-Steele, Charlie Gosse, Maryne Follenfant, Institut de biologie de l'ENS Paris (UMR 8197/1024) (IBENS), Département de Biologie - ENS Paris, École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS), Institut de biologie de l'ENS Paris (IBENS), Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Département de Biologie - ENS Paris, and Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)
- Subjects
Models, Molecular ,Scaffold ,Biomedical Engineering ,Chemical biology ,DNA, Single-Stranded ,Bioengineering ,Tacrolimus Binding Protein 1A ,02 engineering and technology ,010402 general chemistry ,01 natural sciences ,Protein–protein interaction ,chemistry.chemical_compound ,Protein Interaction Mapping ,Tweezers ,Animals ,Humans ,Nanotechnology ,[CHIM]Chemical Sciences ,General Materials Science ,Electrical and Electronic Engineering ,Sirolimus ,Drug discovery ,TOR Serine-Threonine Kinases ,Force spectroscopy ,DNA ,021001 nanoscience & nanotechnology ,Condensed Matter Physics ,Atomic and Molecular Physics, and Optics ,Nanostructures ,0104 chemical sciences ,3. Good health ,Energy profile ,chemistry ,0210 nano-technology ,Biological system - Abstract
International audience; The residence time of a drug on its target has been suggested as a more pertinent metric of therapeutic efficacy than the traditionally used affinity constant. Here we introduce junctured-DNA (J-DNA) tweezers as a generic platform which enables real-time observation, at the single-molecule level, of biomolecular interactions. This tool corresponds to a double-strand DNA scaffold which can be nanomanipulated and on which proteins of interest can be engrafted thanks to widely-used genetic tagging strategies. Thus, J-DNA tweezers allow straightforward and robust access to single molecule force spectroscopy in drug discovery, and more generally in biophysics. Proof-of-principle experiments are provided for the rapamycin-mediated association between FKBP12 and FRB, a system relevant in both medicine and chemical biology. Individual interactions were monitored under a range of applied forces and temperatures, yielding after analysis the characteristic features of the energy profile along the dissociation landscape.
- Published
- 2019
- Full Text
- View/download PDF
13. Combinatorial optimization of mRNA structure, stability, and translation for RNA-based therapeutics
- Author
-
Kathrin Leppek, Gun Woo Byeon, Wipapat Kladwang, Hannah K. Wayment-Steele, Craig H. Kerr, Adele F. Xu, Do Soon Kim, Ved V. Topkar, Christian Choe, Daphna Rothschild, Gerald C. Tiu, Roger Wellington-Oguri, Kotaro Fujii, Eesha Sharma, Andrew M. Watkins, John J. Nicol, Jonathan Romano, Bojan Tunguz, Fernando Diaz, Hui Cai, Pengbo Guo, Jiewei Wu, Fanyu Meng, Shuai Shi, Eterna Participants, Philip R. Dormitzer, Alicia Solórzano, Maria Barna, and Rhiju Das
- Subjects
mRNA translation ,RNA Stability ,Stability (learning theory) ,General Physics and Astronomy ,Computational biology ,polysome profiling ,Ribosome ,General Biochemistry, Genetics and Molecular Biology ,Article ,Pseudouridine ,chemistry.chemical_compound ,vaccine ,Humans ,RNA, Messenger ,RNA structure ,Eterna ,Messenger RNA ,Multidisciplinary ,Chemistry ,COVID-19 ,RNA ,Translation (biology) ,General Chemistry ,CDS sequence variants ,in-line RNA structure probing ,MRNA Instability ,Combinatorial optimization - Abstract
Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop an RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that highly structured “superfolder” mRNAs can be designed to improve both stability and expression with further enhancement through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.
- Published
- 2021
14. RNA secondary structure packages evaluated and improved by high-throughput experiments
- Author
-
Eterna Participants, Hannah K. Wayment-Steele, Rhiju Das, and Wipapat Kladwang
- Subjects
Riboswitch ,Messenger RNA ,business.industry ,Computer science ,RNA ,Machine learning ,computer.software_genre ,Nucleic acid secondary structure ,Range (mathematics) ,Molecule ,Artificial intelligence ,business ,Throughput (business) ,computer - Abstract
The computer-aided study and design of RNA molecules is increasingly prevalent across a range of disciplines, yet little is known about the accuracy of commonly used structure modeling packages in tasks sensitive to ensemble properties of RNA. Here, we demonstrate that the EternaBench dataset, a set of over 20,000 synthetic RNA constructs designed in iterative cycles on the RNA design platform Eterna, provides incisive discriminative power in evaluating current packages in ensemble-oriented structure prediction tasks. We find that CONTRAfold and RNAsoft, packages with parameters derived through statistical learning, achieve consistently higher accuracy than more widely used packages in their standard settings, which derive parameters primarily from thermodynamic experiments. Motivated by these results, we develop a multitask-learning-based model, EternaFold, which demonstrates improved performance that generalizes to diverse external datasets, including complete mRNAs and viral genomes probed in human cells and synthetic designs modeling mRNA vaccines.
- Published
- 2020
- Full Text
- View/download PDF
15. Correction to ‘Theoretical basis for stabilizing messenger RNA through secondary structure design’
- Author
-
Andrew M. Watkins, Roger Wellington-Oguri, Po-Ssu Huang, John J Nicol, Eterna Participants, Do Soon Kim, Rhiju Das, Hannah K. Wayment-Steele, Christian A Choe, and R Andres Parra Sperberg
- Subjects
Messenger RNA ,Base Sequence ,Basis (linear algebra) ,AcademicSubjects/SCI00010 ,SARS-CoV-2 ,Hydrolysis ,RNA Stability ,COVID-19 ,Computational biology ,Biology ,Spike Glycoprotein, Coronavirus ,Genetics ,Humans ,RNA, Viral ,Thermodynamics ,RNA, Messenger ,Corrigendum ,Base Pairing ,Protein secondary structure ,Algorithms ,RNA, Double-Stranded - Abstract
RNA hydrolysis presents problems in manufacturing, long-term storage, world-wide delivery and in vivo stability of messenger RNA (mRNA)-based vaccines and therapeutics. A largely unexplored strategy to reduce mRNA hydrolysis is to redesign RNAs to form double-stranded regions, which are protected from in-line cleavage and enzymatic degradation, while coding for the same proteins. The amount of stabilization that this strategy can deliver and the most effective algorithmic approach to achieve stabilization remain poorly understood. Here, we present simple calculations for estimating RNA stability against hydrolysis, and a model that links the average unpaired probability of an mRNA, or AUP, to its overall hydrolysis rate. To characterize the stabilization achievable through structure design, we compare AUP optimization by conventional mRNA design methods to results from more computationally sophisticated algorithms and crowdsourcing through the OpenVaccine challenge on the Eterna platform. We find that rational design on Eterna and the more sophisticated algorithms lead to constructs with low AUP, which we term 'superfolder' mRNAs. These designs exhibit a wide diversity of sequence and structure features that may be desirable for translation, biophysical size, and immunogenicity. Furthermore, their folding is robust to temperature, computer modeling method, choice of flanking untranslated regions, and changes in target protein sequence, as illustrated by rapid redesign of superfolder mRNAs for B.1.351, P.1 and B.1.1.7 variants of the prefusion-stabilized SARS-CoV-2 spike protein. Increases in in vitro mRNA half-life by at least two-fold appear immediately achievable.
- Published
- 2021
- Full Text
- View/download PDF
16. Crowdsourced RNA design discovers diverse, reversible, efficient, self-contained molecular sensors
- Author
-
Andreasson Jol, William J. Greenleaf, Wipapat Kladwang, Michael R. Gotrik, Fernando Portela, Eterna Participants, Rhiju Das, Roger Wellington-Oguri, Hannah K. Wayment-Steele, and Michelle J Wu
- Subjects
0303 health sciences ,Iterative design ,Human intelligence ,Computer science ,business.industry ,Scale (chemistry) ,RNA ,010402 general chemistry ,Machine learning ,computer.software_genre ,Non-coding RNA ,Crowdsourcing ,01 natural sciences ,Small molecule ,0104 chemical sciences ,03 medical and health sciences ,Nucleic acid ,Artificial intelligence ,business ,computer ,Implementation ,030304 developmental biology - Abstract
Internet-based scientific communities promise a means to apply distributed, diverse human intelligence towards previously intractable scientific problems. However, current implementations have not allowed communities to propose experiments to test all emerging hypotheses at scale or to modify hypotheses in response to experiments. We report high-throughput methods for molecular characterization of nucleic acids that enable the large-scale videogame-based crowdsourcing of functional RNA sensor design, followed by high-throughput functional characterization. Iterative design testing of thousands of crowdsourced RNA sensor designs produced near-thermodynamically optimal and reversible RNA switches that act as self-contained molecular sensors and couple five distinct small molecule inputs to three distinct protein binding and fluorogenic outputs—results that surpass computational and expert-based design. This work represents a new paradigm for widely distributed experimental bioscience.One Sentence SummaryOnline community discovers standalone RNA sensors.
- Published
- 2019
- Full Text
- View/download PDF
17. Evaluating riboswitch optimality
- Author
-
Rhiju Das, Michael R. Gotrik, Hannah K. Wayment-Steele, and Michelle J Wu
- Subjects
Riboswitch ,0303 health sciences ,RNA Folding ,Thermodynamic equilibrium ,Chemistry ,Aptamer ,030303 biophysics ,RNA ,Computational biology ,Biosensing Techniques ,Aptamers, Nucleotide ,Ligand (biochemistry) ,Ligands ,Article ,03 medical and health sciences ,Thermodynamic limit ,CRISPR ,Nucleic Acid Conformation ,Thermodynamics ,Limit (mathematics) - Abstract
Riboswitches are RNA elements that recognize diverse chemical and biomolecular inputs, and transduce this recognition process to genetic, fluorescent, and other engineered outputs using RNA conformational changes. These systems are pervasive in cellular biology and are a promising biotechnology with applications in genetic regulation and biosensing. Here, we derive a simple expression bounding the activation ratio—the proportion of RNA in the active vs. inactive states—for both ON and OFF riboswitches that operate near thermodynamic equilibrium: [Formula: see text] , where [I] is the input ligand concentration and [Formula: see text] is the intrinsic dissociation constant of the aptamer module toward the input ligand. A survey of published studies of natural and synthetic riboswitches confirms that the vast majority of empirically measured activation ratios have remained well below this thermodynamic limit. A few natural and synthetic riboswitches achieve activation ratios close to the limit, and these molecules highlight important principles for achieving high riboswitch performance. For several applications, including “light-up” fluorescent sensors and chemically-controlled CRISPR/Cas complexes, the thermodynamic limit has not yet been achieved, suggesting that current tools are operating at suboptimal efficiencies. Future riboswitch studies will benefit from comparing observed activation ratios to this simple expression for the optimal activation ratio. We present experimental and computational suggestions for how to make these quantitative comparisons and suggest new molecular mechanisms that may allow non-equilibrium riboswitches to surpass the derived limit.
- Published
- 2019
18. Effects of Al3+ on Phosphocholine and Phosphoglycerol Containing Solid Supported Lipid Bilayers
- Author
-
Angelika Kunze, Lewis E. Johnson, Malkiat S. Johal, Marcus J. Swann, Hannah K. Wayment-Steele, Yujia Jing, Björn Agnarsson, and Sofia Svedhem
- Subjects
Phase transition ,Phosphorylcholine ,Lipid Bilayers ,Molecular Conformation ,02 engineering and technology ,Molecular Dynamics Simulation ,01 natural sciences ,Diffusion ,chemistry.chemical_compound ,Molecular dynamics ,Phase (matter) ,0103 physical sciences ,Electrochemistry ,Fluorescence microscope ,General Materials Science ,Lipid bilayer ,Spectroscopy ,Phosphocholine ,Birefringence ,010304 chemical physics ,Surfaces and Interfaces ,Quartz crystal microbalance ,021001 nanoscience & nanotechnology ,Condensed Matter Physics ,Crystallography ,chemistry ,Glycerophosphates ,0210 nano-technology ,Aluminum - Abstract
Aluminum has attracted great attention recently as it has been suggested by several studies to be associated with increased risks for Alzheimer's and Parkinson's disease. The toxicity of the trivalent ion is assumed to derive from structural changes induced in lipid bilayers upon binding, though the mechanism of this process is still not well understood. In the present study we elucidate the effect of Al(3+) on supported lipid bilayers (SLBs) using fluorescence microscopy, the quartz crystal microbalance with dissipation (QCM-D) technique, dual-polarization interferometry (DPI), and molecular dynamics (MD) simulations. Results from these techniques show that binding of Al(3+) to SLBs containing negatively charged and neutral phospholipids induces irreversible changes such as domain formation. The measured variations in SLB thickness, birefringence, and density indicate a phase transition from a disordered to a densely packed ordered phase.
- Published
- 2016
- Full Text
- View/download PDF
19. A Modular DNA Scaffold to Study Protein-Protein Interactions at Single-Molecule Resolution
- Author
-
Maryne Follenfant, Christian G. Specht, Terence R. Strick, Vijay S. Pande, Charlie Gosse, Antoine Triller, Hannah K. Wayment-Steele, Wang Jinglong, and Dorota Kostrz
- Subjects
chemistry.chemical_compound ,Scaffold ,business.industry ,Chemistry ,Resolution (electron density) ,Biophysics ,Molecule ,Modular design ,business ,DNA ,Protein–protein interaction - Published
- 2020
- Full Text
- View/download PDF
20. Modelling Intrinsically Disordered Protein Dynamics as Networks of Transient Secondary Structure
- Author
-
Hannah K. Wayment-Steele, Carlos X. Hernández, and Vijay S. Pande
- Subjects
Molecular dynamics ,Markov chain ,Computer science ,Protein dynamics ,Feature (machine learning) ,Statistical physics ,Intrinsically disordered proteins ,Transcription factor ,Protein secondary structure ,Domain (software engineering) - Abstract
Describing the dynamics and conformational landscapes of Intrinsically Disordered Proteins (IDPs) is of paramount importance to understanding their functions. Markov State Models (MSMs) are often used to characterize the dynamics of more structured proteins, but models of IDPs built using conventional MSM modelling protocols can be difficult to interpret due to the inherent nature of IDPs, which exhibit fast transitions between disordered microstates. We propose a new method of determining MSM states from all-atom molecular dynamics simulation data of IDPs by using per-residue secondary structure assignments as input features in a MSM model. Because such secondary structure algorithms use a select set of features for assignment (dihedral angles, contact distances, etc.), they represent a knowledge-based refinement of feature sets used for model-building. This method adds interpretability to IDP conformational landscapes, which are increasingly viewed as composed of transient secondary structure, and allows us to readily use MSM analysis tools in this paradigm. We demonstrate the use of our method with the transcription factor p53 c-terminal domain (p53-CTD), a commonly-studied IDP. We are able to characterize the full secondary structure phase space observed for p53-CTD, and describe characteristics of p53-CTD as a network of transient helical and beta-hairpin structures with different network behaviors in different domains of secondary structure. This analysis provides a novel example of how IDPs can be studied and how researchers might better understand a disordered protein conformational landscape.
- Published
- 2018
- Full Text
- View/download PDF
21. Transferable neural networks for enhanced sampling of protein dynamics
- Author
-
Vijay S. Pande, Mohammad M. Sultan, and Hannah K. Wayment-Steele
- Subjects
0301 basic medicine ,FOS: Computer and information sciences ,Computer science ,Machine Learning (stat.ML) ,Molecular Dynamics Simulation ,01 natural sciences ,Force field (chemistry) ,03 medical and health sciences ,Statistics - Machine Learning ,0103 physical sciences ,Physical and Theoretical Chemistry ,Alanine ,010304 chemical physics ,Artificial neural network ,Proteins ,Biomolecules (q-bio.BM) ,Dipeptides ,Autoencoder ,Independent component analysis ,Simple extension ,Computer Science Applications ,Linear map ,Nonlinear system ,030104 developmental biology ,Quantitative Biology - Biomolecules ,FOS: Biological sciences ,Embedding ,Biological system - Abstract
Variational auto-encoder frameworks have demonstrated success in reducing complex nonlinear dynamics in molecular simulation to a single non-linear embedding. In this work, we illustrate how this non-linear latent embedding can be used as a collective variable for enhanced sampling, and present a simple modification that allows us to rapidly perform sampling in multiple related systems. We first demonstrate our method is able to describe the effects of force field changes in capped alanine dipeptide after learning a model using AMBER99. We further provide a simple extension to variational dynamics encoders that allows the model to be trained in a more efficient manner on larger systems by encoding the outputs of a linear transformation using time-structure based independent component analysis (tICA). Using this technique, we show how such a model trained for one protein, the WW domain, can efficiently be transferred to perform enhanced sampling on a related mutant protein, the GTT mutation. This method shows promise for its ability to rapidly sample related systems using a single transferable collective variable and is generally applicable to sets of related simulations, enabling us to probe the effects of variation in increasingly large systems of biophysical interest., Comment: 20 pages, 10 figures
- Published
- 2018
- Full Text
- View/download PDF
22. Note: Variational Encoding of Protein Dynamics Benefits from Maximizing Latent Autocorrelation
- Author
-
Vijay S. Pande and Hannah K. Wayment-Steele
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Scale (ratio) ,Computer science ,Protein Conformation ,General Physics and Astronomy ,FOS: Physical sciences ,Machine Learning (stat.ML) ,010402 general chemistry ,Space (mathematics) ,01 natural sciences ,Machine Learning (cs.LG) ,Statistics - Machine Learning ,Encoding (memory) ,Physics - Chemical Physics ,0103 physical sciences ,Physics - Biological Physics ,Physical and Theoretical Chemistry ,Chemical Physics (physics.chem-ph) ,010304 chemical physics ,Artificial neural network ,Protein dynamics ,Autocorrelation ,Proteins ,0104 chemical sciences ,Models, Chemical ,Biological Physics (physics.bio-ph) ,Neural Networks, Computer ,Algorithm - Abstract
As deep Variational Auto-Encoder (VAE) frameworks become more widely used for modeling biomolecular simulation data, we emphasize the capability of the VAE architecture to concurrently maximize the time scale of the latent space while inferring a reduced coordinate, which assists in finding slow processes as according to the variational approach to conformational dynamics. We provide evidence that the VDE framework [Hernandez et al., Phys. Rev. E 97, 062412 (2018)], which uses this autocorrelation loss along with a time-lagged reconstruction loss, obtains a variationally optimized latent coordinate in comparison with related loss functions. We thus recommend leveraging the autocorrelation of the latent space while training neural network models of biomolecular simulation data to better represent slow processes.
- Published
- 2018
- Full Text
- View/download PDF
23. A Minimum Variance Clustering Approach Produces Robust and Interpretable Coarse-Grained Models
- Author
-
Brooke E. Husic, Mohammad M. Sultan, Vijay S. Pande, Keri A. McKiernan, and Hannah K. Wayment-Steele
- Subjects
0301 basic medicine ,Quantitative Biology::Biomolecules ,Protein Folding ,010304 chemical physics ,Markov chain ,Computer science ,Proteins ,Observable ,Molecular Dynamics Simulation ,01 natural sciences ,Microstate (statistical mechanics) ,Markov Chains ,Computer Science Applications ,Hierarchical clustering ,03 medical and health sciences ,030104 developmental biology ,Minimum-variance unbiased estimator ,Similarity (network science) ,0103 physical sciences ,Physical and Theoretical Chemistry ,Cluster analysis ,Divergence (statistics) ,Algorithm ,Algorithms - Abstract
Markov state models (MSMs) are a powerful framework for the analysis of molecular dynamics data sets, such as protein folding simulations, because of their straightforward construction and statistical rigor. The coarse-graining of MSMs into an interpretable number of macrostates is a crucial step for connecting theoretical results with experimental observables. Here we present the minimum variance clustering approach (MVCA) for the coarse-graining of MSMs into macrostate models. The method utilizes agglomerative clustering with Ward's minimum variance objective function, and the similarity of the microstate dynamics is determined using the Jensen-Shannon divergence between the corresponding rows in the MSM transition probability matrix. We first show that MVCA produces intuitive results for a simple tripeptide system and is robust toward long-duration statistical artifacts. MVCA is then applied to two protein folding simulations of the same protein in different force fields to demonstrate that a different number of macrostates is appropriate for each model, revealing a misfolded state present in only one of the simulations. Finally, we show that the same method can be used to analyze a data set containing many MSMs from simulations in different force fields by aggregating them into groups and quantifying their dynamical similarity in the context of force field parameter choices. The minimum variance clustering approach with the Jensen-Shannon divergence provides a powerful tool to group dynamics by similarity, both among model states and among dynamical models themselves.
- Published
- 2017
24. Investigating the role of boundary bricks in DNA brick self-assembly
- Author
-
Hannah K, Wayment-Steele, Daan, Frenkel, and Aleks, Reinhardt
- Subjects
Models, Molecular ,Molecular Conformation ,DNA, Single-Stranded ,Monte Carlo Method - Abstract
In the standard DNA brick set-up, distinct 32-nucleotide strands of single-stranded DNA are each designed to bind specifically to four other such molecules. Experimentally, it has been demonstrated that the overall yield is increased if certain bricks which occur on the outer faces of target structures are merged with adjacent bricks. However, it is not well understood by what mechanism such 'boundary bricks' increase the yield, as they likely influence both the nucleation process and the final stability of the target structure. Here, we use Monte Carlo simulations with a patchy particle model of DNA bricks to investigate the role of boundary bricks in the self-assembly of complex multicomponent target structures. We demonstrate that boundary bricks lower the free-energy barrier to nucleation and that boundary bricks on edges stabilize the final structure. However, boundary bricks are also more prone to aggregation, as they can stabilize partially assembled intermediates. We explore some design strategies that permit us to benefit from the stabilizing role of boundary bricks whilst minimizing their ability to hinder assembly; in particular, we show that maximizing the total number of boundary bricks is not an optimal strategy.
- Published
- 2017
25. Variational Encoding of Complex Dynamics
- Author
-
Brooke E. Husic, Vijay S. Pande, Hannah K. Wayment-Steele, Mohammad M. Sultan, and Carlos X. Hernández
- Subjects
FOS: Computer and information sciences ,Computer science ,FOS: Physical sciences ,Machine Learning (stat.ML) ,010402 general chemistry ,01 natural sciences ,Article ,Statistics - Machine Learning ,Physics - Chemical Physics ,Encoding (memory) ,0103 physical sciences ,Physics - Biological Physics ,Chemical Physics (physics.chem-ph) ,010304 chemical physics ,business.industry ,Deep learning ,Biomolecules (q-bio.BM) ,Computational Physics (physics.comp-ph) ,Autoencoder ,0104 chemical sciences ,Nonlinear system ,Complex dynamics ,Quantitative Biology - Biomolecules ,Biological Physics (physics.bio-ph) ,FOS: Biological sciences ,Brownian dynamics ,Embedding ,Artificial intelligence ,business ,Physics - Computational Physics ,Algorithm ,Encoder - Abstract
Often the analysis of time-dependent chemical and biophysical systems produces high-dimensional time-series data for which it can be difficult to interpret which individual features are most salient. While recent work from our group and others has demonstrated the utility of time-lagged co-variate models to study such systems, linearity assumptions can limit the compression of inherently nonlinear dynamics into just a few characteristic components. Recent work in the field of deep learning has led to the development of variational autoencoders (VAE), which are able to compress complex datasets into simpler manifolds. We present the use of a time-lagged VAE, or variational dynamics encoder (VDE), to reduce complex, nonlinear processes to a single embedding with high fidelity to the underlying dynamics. We demonstrate how the VDE is able to capture nontrivial dynamics in a variety of examples, including Brownian dynamics and atomistic protein folding. Additionally, we demonstrate a method for analyzing the VDE model, inspired by saliency mapping, to determine what features are selected by the VDE model to describe dynamics. The VDE presents an important step in applying techniques from deep learning to more accurately model and interpret complex biophysics., Comment: Fixed typos and added references
- Published
- 2017
- Full Text
- View/download PDF
26. Surface and Stability Characterization of a Nanoporous ZIF-8 Thin Film
- Author
-
Fangyuan Tian, Elizabeth R. Webster, Andrew M. Cerro, Aileen Park, Amber M. Mosier, Ryan S. Shine, Lewis E. Johnson, Malkiat S. Johal, Hannah K. Wayment-Steele, and Lauren Benz
- Subjects
Materials science ,Nanoporous ,Scanning electron microscope ,Analytical chemistry ,Nanoparticle ,Surfaces, Coatings and Films ,Electronic, Optical and Magnetic Materials ,General Energy ,Adsorption ,Chemical engineering ,X-ray photoelectron spectroscopy ,Thermal stability ,Physical and Theoretical Chemistry ,Thin film ,Zeolitic imidazolate framework - Abstract
Zeolitic imidazolate frameworks (ZIFs) have been widely investigated for numerous applications including energy storage, heterogeneous catalysis, and greenhouse gas adsorption. Much of the early work has focused on the bulk properties of microcrystalline ZIFs. Herein, we focus on identifying the nature of the surface of ZIF-8 by studying a supported ZIF-8 nanoparticle film using surface characterization techniques. We have experimentally identified the presence of a zinc-rich surface terminated by carbonates and water/hydroxyl groups (in addition to the expected methylimidazole terminations) using X-ray photoelectron spectroscopy (XPS). The thermal stability of ZIF-8 thin films was also investigated using scanning electron microscopy (SEM) and temperature-programmed reaction spectroscopy (TPRS). We determined the onset of decomposition of ZIF-8 thin films to be approximately 630 K using TPRS in an ultrahigh vacuum (UHV) environment. This work presents the first characterization steps needed to study the e...
- Published
- 2014
- Full Text
- View/download PDF
27. Monitoring of single and double lipid membrane formation with high spatiotemporal resolution using evanescent light scattering microscopy
- Author
-
Angelika Kunze, Fredrik Höök, Björn Agnarsson, and Hannah K. Wayment-Steele
- Subjects
Chemistry ,Vesicle ,Resolution (electron density) ,Analytical chemistry ,02 engineering and technology ,010402 general chemistry ,021001 nanoscience & nanotechnology ,01 natural sciences ,Light scattering ,0104 chemical sciences ,Membrane ,Microscopy ,Biophysics ,lipids (amino acids, peptides, and proteins) ,General Materials Science ,Spatiotemporal resolution ,0210 nano-technology ,Lipid bilayer ,Electrostatic interaction - Abstract
Formation and quality of single solid supported lipid membranes and double lipid membranes were investigated with single vesicle resolution using label-free evanescence light scattering microscopy (EvSM). For the formation of double lipid membranes we made use of electrostatic interaction between charged lipids and oppositely charged cations.
- Published
- 2016
28. Hierarchical Clustering of Markov State Models Reveals Sequence Effects in p53-CTD Dynamic Behavior
- Author
-
Brooke E. Husic, Hannah K. Wayment-Steele, Vijay S. Pande, and Carlos X. Hernández
- Subjects
State model ,Markov chain ,Computer science ,Biophysics ,CTD ,Algorithm ,Hierarchical clustering ,Sequence (medicine) - Published
- 2018
- Full Text
- View/download PDF
29. On the Origins of Regulated Disorder within the C-Terminus of P53
- Author
-
Hannah K. Wayment-Steele, Carlos X. Hernández, and Vijay S. Pande
- Subjects
Stereochemistry ,C-terminus ,Biophysics - Published
- 2018
- Full Text
- View/download PDF
30. Monitoring N3 dye adsorption and desorption on TiO2 surfaces: a combined QCM-D and XPS study
- Author
-
Lewis E. Johnson, Malkiat S. Johal, Lauren Benz, Fangyuan Tian, Hannah K. Wayment-Steele, and Matthew C. Dixon
- Subjects
Materials science ,Tetrabutylammonium hydroxide ,Inorganic chemistry ,Oxide ,Langmuir adsorption model ,Quartz crystal microbalance ,chemistry.chemical_compound ,symbols.namesake ,Dye-sensitized solar cell ,Adsorption ,chemistry ,X-ray photoelectron spectroscopy ,Desorption ,symbols ,General Materials Science - Abstract
Understanding the kinetics of dye adsorption and desorption on semiconductors is crucial for optimizing the performance of dye-sensitized solar cells (DSSCs). Quartz crystal microbalance with dissipation monitoring (QCM-D) measures adsorbed mass in real time, allowing determination of binding kinetics. In this work, we characterize adsorption of the common RuBipy dye N3 to the native oxide layer of a planar, sputter-coated titanium surface, simulating the TiO2 substrate of a DSSC. We report adsorption equilibrium constants consistent with prior optical measurements of N3 adsorption. Dye binding and surface integrity were also verified by scanning electron microscopy, energy-dispersive X-ray spectroscopy, and X-ray photoelectron spectroscopy (XPS). We further study desorption of the dye from the native oxide layer on the QCM sensors using tetrabutylammonium hydroxide (TBAOH), a commonly used industrial desorbant. We find that using TBAOH as a desorbant does not fully regenerate the surface, though little ruthenium or nitrogen is observed by XPS after desorption, suggesting that carboxyl moieties of N3 remain bound. We demonstrate the native oxide layer of a titanium sensor as a valid and readily available planar TiO2 morphology to study dye adsorption and desorption and begin to investigate the mechanism of dye desorption in DSSCs, a system that requires further study.
- Published
- 2014
31. Characterization of N3 dye adsorption on TiO2using quartz-crystal microbalance with dissipation monitoring
- Author
-
Malkiat S. Johal, Lewis E. Johnson, Hannah K. Wayment-Steele, and Matthew C. Dixon
- Subjects
Materials science ,Inorganic chemistry ,Oxide ,chemistry.chemical_element ,Langmuir adsorption model ,Substrate (electronics) ,Quartz crystal microbalance ,Absorbance ,symbols.namesake ,Dye-sensitized solar cell ,chemistry.chemical_compound ,Adsorption ,chemistry ,symbols ,Titanium - Abstract
Understanding the kinetics of dye adsorption on semiconductors is crucial for designing dye-sensitized solar cells (DSSCs) with enhanced efficiency. Harms et al. recently applied the Quartz-Crystal Microbalance with Dissipation Monitoring (QCM-D) to study in situ dye adsorption on flat TiO2 surfaces. QCM-D measures adsorption in real time and therefore allows one to determine the kinetics of the process. In this work, we characterize the adsorption of N3, a commercial RuBipy dye, using the native oxide layer of a titanium sensor to simulate the TiO2 substrate of a DSSC. We report equilibrium constants that are in agreement with previous absorbance studies of N3 adsorption, and therefore demonstrate the native oxide layer of a titanium sensor as a valid and readily available planar TiO2 morphology to study dye adsorption.
- Published
- 2013
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.