41 results on '"Stombaugh J"'
Search Results
2. Motif prediction in ribosomal RNAs Lessons and prospects for automated motif prediction in homologous RNA molecules
- Author
-
Leontis, N.B, Stombaugh, J, and Westhof, E
- Published
- 2002
- Full Text
- View/download PDF
3. Sharing and archiving nucleic acid structure mapping data
- Author
-
Knight, R., Bellaousov, S., Duncan, C. D. S., Das, R., Cordero, P., Rocca-Serra, P., Weeks, K. M., Chen, C., Birmingham, A., Mathews, D. H., Zirbel, C. L., Davis-Neulander, L., Stombaugh, J., Ritz, J., Halvorsen, M., Leontis, N. B., and Laederach, A.
- Abstract
Nucleic acids are particularly amenable to structural characterization using chemical and enzymatic probes. Each individual structure mapping experiment reveals specific information about the structure and/or dynamics of the nucleic acid. Currently, there is no simple approach for making these data publically available in a standardized format. We therefore developed a standard for reporting the results of single nucleotide resolution nucleic acid structure mapping experiments, or SNRNASMs. We propose a schema for sharing nucleic acid chemical probing data that uses generic public servers for storing, retrieving, and searching the data. We have also developed a consistent nomenclature (ontology) within the Ontology of Biomedical Investigations (OBI), which provides unique identifiers (termed persistent URLs, or PURLs) for classifying the data. Links to standardized data sets shared using our proposed format along with a tutorial and links to templates can be found at http://snrnasm.bio.unc.edu.
- Published
- 2011
- Full Text
- View/download PDF
4. Understanding Sequence Variability of RNA Motifs Using Geometric Search and IsoDiscrepancy Matrices.
- Author
-
Petrov, A.I., Stombaugh, J., Zirbel, C.L., and Leontis, N.B.
- Published
- 2009
- Full Text
- View/download PDF
5. The Biological Observation Matrix (BIOM) format or: how I learned to stop worrying and love the ome-ome
- Author
-
McDonald Daniel, Clemente Jose C, Kuczynski Justin, Rideout Jai, Stombaugh Jesse, Wendel Doug, Wilke Andreas, Huse Susan, Hufnagle John, Meyer Folker, Knight Rob, and Caporaso J
- Subjects
Microbial ecology ,Comparative genomics ,Metagenomics ,QIIME ,MG-RAST ,VAMPS ,BIOM ,Computer applications to medicine. Medical informatics ,R858-859.7 - Abstract
Abstract Background We present the Biological Observation Matrix (BIOM, pronounced “biome”) format: a JSON-based file format for representing arbitrary observation by sample contingency tables with associated sample and observation metadata. As the number of categories of comparative omics data types (collectively, the “ome-ome”) grows rapidly, a general format to represent and archive this data will facilitate the interoperability of existing bioinformatics tools and future meta-analyses. Findings The BIOM file format is supported by an independent open-source software project (the biom-format project), which initially contains Python objects that support the use and manipulation of BIOM data in Python programs, and is intended to be an open development effort where developers can submit implementations of these objects in other programming languages. Conclusions The BIOM file format and the biom-format project are steps toward reducing the “bioinformatics bottleneck” that is currently being experienced in diverse areas of biological sciences, and will help us move toward the next phase of comparative omics where basic science is translated into clinical and environmental applications. The BIOM file format is currently recognized as an Earth Microbiome Project Standard, and as a Candidate Standard by the Genomic Standards Consortium.
- Published
- 2012
- Full Text
- View/download PDF
6. An aptamer-mediated base editing platform for simultaneous knockin and multiple gene knockout for allogeneic CAR-T cells generation.
- Author
-
Porreca I, Blassberg R, Harbottle J, Joubert B, Mielczarek O, Stombaugh J, Hemphill K, Sumner J, Pazeraitis D, Touza JL, Francescatto M, Firth M, Selmi T, Collantes JC, Strezoska Z, Taylor B, Jin S, Wiggins CM, van Brabant Smith A, and Lambourne JJ
- Subjects
- Humans, Immunotherapy, Adoptive methods, Receptors, Chimeric Antigen genetics, Receptors, Chimeric Antigen metabolism, Gene Knock-In Techniques methods, Transgenes, Gene Editing methods, Gene Knockout Techniques, CRISPR-Cas Systems, Aptamers, Nucleotide genetics, T-Lymphocytes metabolism, T-Lymphocytes immunology
- Abstract
Gene editing technologies hold promise for enabling the next generation of adoptive cellular therapies. In conventional gene editing platforms that rely on nuclease activity, such as clustered regularly interspaced short palindromic repeats CRISPR-associated protein 9 (CRISPR-Cas9), allow efficient introduction of genetic modifications; however, these modifications occur via the generation of DNA double-strand breaks (DSBs) and can lead to unwanted genomic alterations and genotoxicity. Here, we apply a novel modular RNA aptamer-mediated Pin-point base editing platform to simultaneously introduce multiple gene knockouts and site-specific integration of a transgene in human primary T cells. We demonstrate high editing efficiency and purity at all target sites and significantly reduced frequency of chromosomal translocations compared with the conventional CRISPR-Cas9 system. Site-specific knockin of a chimeric antigen receptor and multiplex gene knockout are achieved within a single intervention and without the requirement for additional sequence-targeting components. The ability to perform complex genome editing efficiently and precisely highlights the potential of the Pin-point platform for application in a range of advanced cell therapies., Competing Interests: Declaration of interests M. Francescatto, M. Firth, J.L.T., D.P., J. Sumner, and B.T. are all current or past (while engaged in the research project) employees of AstraZeneca. I.P., R.B., J.H., B.J., O.M., J. Stombaugh, K.H., T.S., Z.S., C.W., A.v.B.S., and J.J.L. are current or past (while engaged in the research project) employees at Revvity. Revvity has an exclusive license from Rutgers University to certain base editing patents. Rutgers University and Horizon Discovery Limited have filed patent applications on this work., (Copyright © 2024 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF
7. A Novel CRISPR Interference Effector Enabling Functional Gene Characterization with Synthetic Guide RNAs.
- Author
-
Mills C, Riching A, Keller A, Stombaugh J, Haupt A, Maksimova E, Dickerson SM, Anderson E, Hemphill K, Ebmeier C, Schiel JA, Levenga J, Perkett M, Smith AVB, and Strezoska Z
- Subjects
- Humans, Gene Editing, CRISPR-Associated Protein 9 genetics, RNA, Guide, CRISPR-Cas Systems, CRISPR-Cas Systems genetics, Induced Pluripotent Stem Cells
- Abstract
While CRISPR interference (CRISPRi) systems have been widely implemented in pooled lentiviral screening, there has been limited use with synthetic guide RNAs for the complex phenotypic readouts enabled by experiments in arrayed format. Here we describe a novel deactivated Cas9 fusion protein, dCas9-SALL1-SDS3, which produces greater target gene repression than first or second generation CRISPRi systems when used with chemically modified synthetic single guide RNAs (sgRNAs), while exhibiting high target specificity. We show that dCas9-SALL1-SDS3 interacts with key members of the histone deacetylase and Swi-independent three complexes, which are the endogenous functional effectors of SALL1 and SDS3. Synthetic sgRNAs can also be used with in vitro -transcribed dCas9-SALL1-SDS3 mRNA for short-term delivery into primary cells, including human induced pluripotent stem cells and primary T cells. Finally, we used dCas9-SALL1-SDS3 for functional gene characterization of DNA damage host factors, orthogonally to small interfering RNA, demonstrating the ability of the system to be used in arrayed-format screening.
- Published
- 2022
- Full Text
- View/download PDF
8. Development and Characterization of a Modular CRISPR and RNA Aptamer Mediated Base Editing System.
- Author
-
Collantes JC, Tan VM, Xu H, Ruiz-Urigüen M, Alasadi A, Guo J, Tao H, Su C, Tyc KM, Selmi T, Lambourne JJ, Harbottle JA, Stombaugh J, Xing J, Wiggins CM, and Jin S
- Subjects
- Animals, Bacteria genetics, Bacteria metabolism, CRISPR-Cas Systems, Green Fluorescent Proteins genetics, HEK293 Cells, Humans, INDEL Mutation, RNA, Guide, CRISPR-Cas Systems genetics, Recombinational DNA Repair, Exome Sequencing, Aptamers, Nucleotide, Clustered Regularly Interspaced Short Palindromic Repeats, Gene Editing, RNA Editing
- Abstract
Conventional CRISPR approaches for precision genome editing rely on the introduction of DNA double-strand breaks (DSB) and activation of homology-directed repair (HDR), which is inherently genotoxic and inefficient in somatic cells. The development of base editing (BE) systems that edit a target base without requiring generation of DSB or HDR offers an alternative. Here, we describe a novel BE system called Pin-point
TM that recruits a DNA base-modifying enzyme through an RNA aptamer within the gRNA molecule. Pin-point is capable of efficiently modifying base pairs in the human genome with precision and low on-target indel formation. This system can potentially be applied for correcting pathogenic mutations, installing premature stop codons in pathological genes, and introducing other types of genetic changes for basic research and therapeutic development.- Published
- 2021
- Full Text
- View/download PDF
9. CRISPR-mediated transcriptional activation with synthetic guide RNA.
- Author
-
Strezoska Ž, Dickerson SM, Maksimova E, Chou E, Gross MM, Hemphill K, Hardcastle T, Perkett M, Stombaugh J, Miller GW, Anderson EM, Vermeulen A, and Smith AVB
- Subjects
- Animals, Aptamers, Nucleotide genetics, HEK293 Cells, Humans, Mice, NIH 3T3 Cells, CRISPR-Cas Systems, Gene Editing methods, RNA, Guide, CRISPR-Cas Systems, Transcriptional Activation genetics
- Abstract
The CRISPR-Cas9 system has been adapted for transcriptional activation (CRISPRa) and several second-generation CRISPRa systems (including VPR, SunTag, and SAM) have been developed to recruit different transcriptional activators to a deactivated Cas9, which is guided to a transcriptional start site via base complementarity with a target guide RNA. Multiple studies have shown the benefit of CRISPRa using plasmid or lentiviral expressed guide RNA, but the use of synthetic guide RNA has not been reported. Here we demonstrate the effective use of synthetic guide RNA for gene activation via CRISPRa. CRISPRa crRNA may be used with a canonical tracrRNA using the VPR or SunTag activation systems or with an extended tracrRNA containing an aptamer sequence for the SAM system. Transcriptional activation with synthetic crRNA:tracrRNA is comparable to activation achieved with expression vectors and combining several crRNA sequences targeting the same gene can enhance transcriptional activation. The use of synthetic crRNA is also ideal for simultaneous activation of multiple genes or use with dCas9-VPR mRNA when viral transduction is not feasible. Here, we perform a proof-of-principle arrayed screen using a CRISPRa crRNA library consisting of 153 cytokine receptor targets to identify regulators of IL-6 cytokine secretion. Together, these results demonstrate the suitability of synthetic CRISPRa guide RNA for high throughput, arrayed screening applications which allow for more complex phenotypic readouts to complement viability and drug resistance assays typically used in a pooled screening format., Competing Interests: Declaration of Competing Interest The authors report no declarations of interest., (Copyright © 2020 Elsevier B.V. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF
10. The Power Decoder Simulator for the Evaluation of Pooled shRNA Screen Performance.
- Author
-
Stombaugh J, Licon A, Strezoska Ž, Stahl J, Anderson SB, Banos M, van Brabant Smith A, Birmingham A, and Vermeulen A
- Subjects
- Cell Line, Computer Simulation, Gene Library, High-Throughput Nucleotide Sequencing, Humans, Reproducibility of Results, Models, Biological, RNA Interference, RNA, Small Interfering genetics, Software
- Abstract
RNA interference screening using pooled, short hairpin RNA (shRNA) is a powerful, high-throughput tool for determining the biological relevance of genes for a phenotype. Assessing an shRNA pooled screen's performance is difficult in practice; one can estimate the performance only by using reproducibility as a proxy for power or by employing a large number of validated positive and negative controls. Here, we develop an open-source software tool, the Power Decoder simulator, for generating shRNA pooled screening experiments in silico that can be used to estimate a screen's statistical power. Using the negative binomial distribution, it models both the relative abundance of multiple shRNAs within a single screening replicate and the biological noise between replicates for each individual shRNA. We demonstrate that this simulator can successfully model the data from an actual laboratory experiment. We then use it to evaluate the effects of biological replicates and sequencing counts on the performance of a pooled screen, without the necessity of gathering additional data. The Power Decoder simulator is written in R and Python and is available for download under the GNU General Public License v3.0., (© 2015 Society for Laboratory Automation and Screening.)
- Published
- 2015
- Full Text
- View/download PDF
11. Meta-analyses of studies of the human microbiota.
- Author
-
Lozupone CA, Stombaugh J, Gonzalez A, Ackermann G, Wendel D, Vázquez-Baeza Y, Jansson JK, Gordon JI, and Knight R
- Subjects
- Adult, Aging, Bacteria genetics, Biodiversity, Crohn Disease epidemiology, Crohn Disease genetics, Crohn Disease microbiology, Female, Humans, Infant, Metagenome, Pregnancy, Pregnancy Trimester, First, Sequence Analysis, DNA, Bacteria classification, DNA, Bacterial genetics, Feces microbiology, Metagenomics methods, Microbiota, RNA, Ribosomal, 16S genetics
- Abstract
Our body habitat-associated microbial communities are of intense research interest because of their influence on human health. Because many studies of the microbiota are based on the same bacterial 16S ribosomal RNA (rRNA) gene target, they can, in principle, be compared to determine the relative importance of different disease/physiologic/developmental states. However, differences in experimental protocols used may produce variation that outweighs biological differences. By comparing 16S rRNA gene sequences generated from diverse studies of the human microbiota using the QIIME database, we found that variation in composition of the microbiota across different body sites was consistently larger than technical variability across studies. However, samples from different studies of the Western adult fecal microbiota generally clustered by study, and the 16S rRNA target region, DNA extraction technique, and sequencing platform produced systematic biases in observed diversity that could obscure biologically meaningful compositional differences. In contrast, systematic compositional differences in the fecal microbiota that occurred with age and between Western and more agrarian cultures were great enough to outweigh technical variation. Furthermore, individuals with ileal Crohn's disease and in their third trimester of pregnancy often resembled infants from different studies more than controls from the same study, indicating parallel compositional attributes of these distinct developmental/physiological/disease states. Together, these results show that cross-study comparisons of human microbiota are valuable when the studied parameter has a large effect size, but studies of more subtle effects on the human microbiota require carefully selected control populations and standardized protocols.
- Published
- 2013
- Full Text
- View/download PDF
12. Widespread colonization of the lung by Tropheryma whipplei in HIV infection.
- Author
-
Lozupone C, Cota-Gomez A, Palmer BE, Linderman DJ, Charlson ES, Sodergren E, Mitreva M, Abubucker S, Martin J, Yao G, Campbell TB, Flores SC, Ackerman G, Stombaugh J, Ursell L, Beck JM, Curtis JL, Young VB, Lynch SV, Huang L, Weinstock GM, Knox KS, Twigg H, Morris A, Ghedin E, Bushman FD, Collman RG, Knight R, and Fontenot AP
- Subjects
- Cohort Studies, Humans, Longitudinal Studies, HIV Infections complications, Lung microbiology, Tropheryma, Whipple Disease complications, Whipple Disease microbiology
- Abstract
Rationale: Lung infections caused by opportunistic or virulent pathogens are a principal cause of morbidity and mortality in HIV infection. It is unknown whether HIV infection leads to changes in basal lung microflora, which may contribute to chronic pulmonary complications that increasingly are being recognized in individuals infected with HIV., Objectives: To determine whether the immunodeficiency associated with HIV infection resulted in alteration of the lung microbiota., Methods: We used 16S ribosomal RNA targeted pyrosequencing and shotgun metagenomic sequencing to analyze bacterial gene sequences in bronchoalveolar lavage (BAL) and mouths of 82 HIV-positive and 77 HIV-negative subjects., Measurements and Main Results: Sequences representing Tropheryma whipplei, the etiologic agent of Whipple's disease, were significantly more frequent in BAL of HIV-positive compared with HIV-negative individuals. T. whipplei dominated the community (>50% of sequence reads) in 11 HIV-positive subjects, but only 1 HIV-negative individual (13.4 versus 1.3%; P = 0.0018). In 30 HIV-positive individuals sampled longitudinally, antiretroviral therapy resulted in a significantly reduced relative abundance of T. whipplei in the lung. Shotgun metagenomic sequencing was performed on eight BAL samples dominated by T. whipplei 16S ribosomal RNA. Whole genome assembly of pooled reads showed that uncultured lung-derived T. whipplei had similar gene content to two isolates obtained from subjects with Whipple's disease., Conclusions: Asymptomatic subjects with HIV infection have unexpected colonization of the lung by T. whipplei, which is reduced by effective antiretroviral therapy and merits further study for a potential pathogenic role in chronic pulmonary complications of HIV infection.
- Published
- 2013
- Full Text
- View/download PDF
13. Lake microbial communities are resilient after a whole-ecosystem disturbance.
- Author
-
Shade A, Read JS, Youngblut ND, Fierer N, Knight R, Kratz TK, Lottig NR, Roden EE, Stanley EH, Stombaugh J, Whitaker RJ, Wu CH, and McMahon KD
- Subjects
- Bacteria classification, Bacteria genetics, Biota, Models, Statistical, Oxygen analysis, Phylogeny, RNA, Ribosomal, 16S genetics, Seasons, Sequence Analysis, DNA, Temperature, Water chemistry, Bacteria growth & development, Ecosystem, Lakes microbiology, Water Microbiology
- Abstract
Disturbances act as powerful structuring forces on ecosystems. To ask whether environmental microbial communities have capacity to recover after a large disturbance event, we conducted a whole-ecosystem manipulation, during which we imposed an intense disturbance on freshwater microbial communities by artificially mixing a temperate lake during peak summer thermal stratification. We employed environmental sensors and water chemistry analyses to evaluate the physical and chemical responses of the lake, and bar-coded 16S ribosomal RNA gene pyrosequencing and automated ribosomal intergenic spacer analysis (ARISA) to assess the bacterial community responses. The artificial mixing increased mean lake temperature from 14 to 20 °C for seven weeks after mixing ended, and exposed the microorganisms to very different environmental conditions, including increased hypolimnion oxygen and increased epilimnion carbon dioxide concentrations. Though overall ecosystem conditions remained altered (with hypolimnion temperatures elevated from 6 to 20 °C), bacterial communities returned to their pre-manipulation state as some environmental conditions, such as oxygen concentration, recovered. Recovery to pre-disturbance community composition and diversity was observed within 7 (epilimnion) and 11 (hypolimnion) days after mixing. Our results suggest that some microbial communities have capacity to recover after a major disturbance.
- Published
- 2012
- Full Text
- View/download PDF
14. Using QIIME to analyze 16S rRNA gene sequences from microbial communities.
- Author
-
Kuczynski J, Stombaugh J, Walters WA, González A, Caporaso JG, and Knight R
- Subjects
- Bacteria classification, Bacteria isolation & purification, Biodiversity, Computational Biology instrumentation, DNA, Bacterial genetics, Phylogeny, Sequence Analysis, DNA instrumentation, Bacteria genetics, Computational Biology methods, RNA, Ribosomal, 16S genetics, Sequence Analysis, DNA methods, Software
- Abstract
QIIME (canonically pronounced "chime") is a software application that performs microbial community analysis. It is an acronym for Quantitative Insights Into Microbial Ecology, and has been used to analyze and interpret nucleic acid sequence data from fungal, viral, bacterial, and archaeal communities. The following protocols describe how to install QIIME on a single computer and use it to analyze microbial 16S sequence data from nine distinct microbial communities., (© 2012 by John Wiley & Sons, Inc.)
- Published
- 2012
- Full Text
- View/download PDF
15. Microbiota regulate intestinal absorption and metabolism of fatty acids in the zebrafish.
- Author
-
Semova I, Carten JD, Stombaugh J, Mackey LC, Knight R, Farber SA, and Rawls JF
- Subjects
- Animals, Diet, Energy Metabolism, Fluorescence, Image Processing, Computer-Assisted, Intestinal Absorption, Intestinal Mucosa metabolism, Intestines microbiology, Liver metabolism, Fatty Acids metabolism, Metagenome, Zebrafish metabolism, Zebrafish microbiology
- Abstract
Regulation of intestinal dietary fat absorption is critical to maintaining energy balance. While intestinal microbiota clearly impact the host's energy balance, their role in intestinal absorption and extraintestinal metabolism of dietary fat is less clear. Using in vivo imaging of fluorescent fatty acid (FA) analogs delivered to gnotobiotic zebrafish hosts, we reveal that microbiota stimulate FA uptake and lipid droplet (LD) formation in the intestinal epithelium and liver. Microbiota increase epithelial LD number in a diet-dependent manner. The presence of food led to the intestinal enrichment of bacteria from the phylum Firmicutes. Diet-enriched Firmicutes and their products were sufficient to increase epithelial LD number, whereas LD size was increased by other bacterial types. Thus, different members of the intestinal microbiota promote FA absorption via distinct mechanisms. Diet-induced alterations in microbiota composition might influence fat absorption, providing mechanistic insight into how microbiota-diet interactions regulate host energy balance., (Copyright © 2012 Elsevier Inc. All rights reserved.)
- Published
- 2012
- Full Text
- View/download PDF
16. RNASTAR: an RNA STructural Alignment Repository that provides insight into the evolution of natural and artificial RNAs.
- Author
-
Widmann J, Stombaugh J, McDonald D, Chocholousova J, Gardner P, Iyer MK, Liu Z, Lozupone CA, Quinn J, Smit S, Wikman S, Zaneveld JR, and Knight R
- Subjects
- Algorithms, Base Sequence, Magnetic Resonance Spectroscopy, Models, Molecular, Molecular Sequence Data, Nucleotide Motifs, Databases, Nucleic Acid, Evolution, Molecular, Nucleic Acid Conformation, RNA chemistry, Sequence Alignment
- Abstract
Automated RNA alignment algorithms often fail to recapture the essential conserved sites that are critical for function. To assist in the refinement of these algorithms, we manually curated a set of 148 alignments with a total of 9600 unique sequences, in which each alignment was backed by at least one crystal or NMR structure. These alignments included both naturally and artificially selected molecules. We used principles of isostericity to improve the alignments from an average of 83%-94% isosteric base pairs. We expect that this alignment collection will assist in a wide range of benchmarking efforts and provide new insight into evolutionary principles governing change in RNA structural motifs. The improved alignments have been contributed to the Rfam database.
- Published
- 2012
- Full Text
- View/download PDF
17. Responses of gut microbiota to diet composition and weight loss in lean and obese mice.
- Author
-
Ravussin Y, Koren O, Spor A, LeDuc C, Gutman R, Stombaugh J, Knight R, Ley RE, and Leibel RL
- Subjects
- Adiposity, Animals, Diet, High-Fat, Energy Intake, Energy Metabolism, Male, Mice, Mice, Inbred C57BL, Adipose Tissue, White pathology, Cecum microbiology, Intestinal Mucosa microbiology, Leptin blood, Metagenome, Weight Loss
- Abstract
Maintenance of a reduced body weight is accompanied by a decrease in energy expenditure beyond that accounted for by reduced body mass and composition, as well as by an increased drive to eat. These effects appear to be due--in part--to reductions in circulating leptin concentrations due to loss of body fat. Gut microbiota have been implicated in the regulation of body weight. The effects of weight loss on qualitative aspects of gut microbiota have been studied in humans and mice, but these studies have been confounded by concurrent changes in diet composition, which influence microbial community composition. We studied the impact of 20% weight loss on the microbiota of diet-induced obese (DIO: 60% calories fat) mice on a high-fat diet (HFD). Weight-reduced DIO (DIO-WR) mice had the same body weight and composition as control (CON) ad-libitum (AL) fed mice being fed a control diet (10% calories fat), allowing a direct comparison of diet and weight-perturbation effects. Microbial community composition was assessed by pyrosequencing 16S rRNA genes derived from the ceca of sacrificed animals. There was a strong effect of diet composition on the diversity and composition of the microbiota. The relative abundance of specific members of the microbiota was correlated with circulating leptin concentrations and gene expression levels of inflammation markers in subcutaneous white adipose tissue in all mice. Together, these results suggest that both host adiposity and diet composition impact microbiota composition, possibly through leptin-mediated regulation of mucus production and/or inflammatory processes that alter the gut habitat.
- Published
- 2012
- Full Text
- View/download PDF
18. Comprehensive survey and geometric classification of base triples in RNA structures.
- Author
-
Abu Almakarem AS, Petrov AI, Stombaugh J, Zirbel CL, and Leontis NB
- Subjects
- Base Pairing, Cluster Analysis, Hydrogen Bonding, Models, Molecular, Nucleotide Motifs, RNA, Ribosomal chemistry, RNA chemistry
- Abstract
Base triples are recurrent clusters of three RNA nucleobases interacting edge-to-edge by hydrogen bonding. We find that the central base in almost all triples forms base pairs with the other two bases of the triple, providing a natural way to geometrically classify base triples. Given 12 geometric base pair families defined by the Leontis-Westhof nomenclature, combinatoric enumeration predicts 108 potential geometric base triple families. We searched representative atomic-resolution RNA 3D structures and found instances of 68 of the 108 predicted base triple families. Model building suggests that some of the remaining 40 families may be unlikely to form for steric reasons. We developed an on-line resource that provides exemplars of all base triples observed in the structure database and models for unobserved, predicted triples, grouped by triple family, as well as by three-base combination (http://rna.bgsu.edu/Triples). The classification helps to identify recurrent triple motifs that can substitute for each other while conserving RNA 3D structure, with applications in RNA 3D structure prediction and analysis of RNA sequence evolution.
- Published
- 2012
- Full Text
- View/download PDF
19. SitePainter: a tool for exploring biogeographical patterns.
- Author
-
Gonzalez A, Stombaugh J, Lauber CL, Fierer N, and Knight R
- Subjects
- Bacteria genetics, Humans, Bacteria classification, Hand microbiology, Metagenome, Metagenomics methods, Software
- Abstract
Unlabelled: As microbial ecologists take advantage of high-throughput analytical techniques to describe microbial communities across ever-increasing numbers of samples, the need for new analysis tools that reveal the intrinsic spatial patterns and structures of these populations is crucial. Here we present SitePainter, an interactive graphical tool that allows investigators to create or upload pictures of their study site, load diversity analyses data and display both diversity and taxonomy results in a spatial context. Features of SitePainter include: visualizing α -diversity, using taxonomic summaries; visualizing β -diversity, using results from multidimensional scaling methods; and animating relationships among microbial taxa or pathways overtime. SitePainter thus increases the visual power and ability to explore spatially explicit studies., Availability: https://sourceforge.net/projects/sitepainter, Supplementary Information: Supplementary data are available at Bioinformatics online., Contact: antoniog@colorado.edu, Rob.Knight@colorado.edu.
- Published
- 2012
- Full Text
- View/download PDF
20. Using QIIME to analyze 16S rRNA gene sequences from microbial communities.
- Author
-
Kuczynski J, Stombaugh J, Walters WA, González A, Caporaso JG, and Knight R
- Subjects
- Bacteria classification, Biodiversity, DNA, Bacterial chemistry, Phylogeny, Bacteria genetics, Genes, rRNA, RNA, Ribosomal, 16S genetics, Sequence Analysis, RNA methods, Software
- Abstract
QIIME (canonically pronounced "chime") is a software application that performs microbial community analysis. It is an acronym for Quantitative Insights Into Microbial Ecology, and has been used to analyze and interpret nucleic acid sequence data from fungal, viral, bacterial, and archaeal communities. The following protocols describe how to install QIIME on a single computer and use it to analyze microbial 16S sequence data from nine distinct microbial communities., (© 2011 by John Wiley & Sons, Inc.)
- Published
- 2011
- Full Text
- View/download PDF
21. TopiaryExplorer: visualizing large phylogenetic trees with environmental metadata.
- Author
-
Pirrung M, Kennedy R, Caporaso JG, Stombaugh J, Wendel D, and Knight R
- Subjects
- Environment, Proteobacteria classification, Proteobacteria isolation & purification, Phylogeny, Software
- Abstract
Motivation: Microbial community profiling is a highly active area of research, but tools that facilitate visualization of phylogenetic trees and associated environmental data have not kept up with the increasing quantity of data generated in these studies., Results: TopiaryExplorer supports the visualization of very large phylogenetic trees, including features such as the automated coloring of branches by environmental data, manipulation of trees and incorporation of per-tip metadata (e.g. taxonomic labels)., Availability: http://topiaryexplorer.sourceforge.net., Contact: rob.knight@colorado.edu.
- Published
- 2011
- Full Text
- View/download PDF
22. Combined phylogenetic and genomic approaches for the high-throughput study of microbial habitat adaptation.
- Author
-
Zaneveld JR, Parfrey LW, Van Treuren W, Lozupone C, Clemente JC, Knights D, Stombaugh J, Kuczynski J, and Knight R
- Subjects
- Adaptation, Physiological genetics, Archaea genetics, Bacteria genetics, Sequence Analysis, DNA, Archaea physiology, Bacterial Physiological Phenomena, Ecosystem, Genomics methods, High-Throughput Nucleotide Sequencing methods, Phylogeny
- Abstract
High-throughput sequencing technologies provide new opportunities to address longstanding questions about habitat adaptation in microbial organisms. How have microbes managed to adapt to such a wide range of environments, and what genomic features allow for such adaptation? We review recent large-scale studies of habitat adaptation, with emphasis on those that utilize phylogenetic techniques. On the basis of current trends, we summarize methodological challenges faced by investigators, and the tools, techniques and analytical approaches available to overcome them. Phylogenetic approaches and detailed information about each environmental sample will be crucial as the ability to collect genome sequences continues to expand., (Copyright © 2011 Elsevier Ltd. All rights reserved.)
- Published
- 2011
- Full Text
- View/download PDF
23. Sharing and archiving nucleic acid structure mapping data.
- Author
-
Rocca-Serra P, Bellaousov S, Birmingham A, Chen C, Cordero P, Das R, Davis-Neulander L, Duncan CD, Halvorsen M, Knight R, Leontis NB, Mathews DH, Ritz J, Stombaugh J, Weeks KM, Zirbel CL, and Laederach A
- Subjects
- Algorithms, Archives, Base Sequence, Chromosome Mapping classification, Humans, Molecular Sequence Data, Nucleic Acids analysis, Nucleic Acids chemistry, RNA analysis, Research Design, Validation Studies as Topic, Chromosome Mapping methods, Databases, Nucleic Acid, Information Dissemination, Nucleic Acid Conformation, RNA chemistry
- Abstract
Nucleic acids are particularly amenable to structural characterization using chemical and enzymatic probes. Each individual structure mapping experiment reveals specific information about the structure and/or dynamics of the nucleic acid. Currently, there is no simple approach for making these data publically available in a standardized format. We therefore developed a standard for reporting the results of single nucleotide resolution nucleic acid structure mapping experiments, or SNRNASMs. We propose a schema for sharing nucleic acid chemical probing data that uses generic public servers for storing, retrieving, and searching the data. We have also developed a consistent nomenclature (ontology) within the Ontology of Biomedical Investigations (OBI), which provides unique identifiers (termed persistent URLs, or PURLs) for classifying the data. Links to standardized data sets shared using our proposed format along with a tutorial and links to templates can be found at http://snrnasm.bio.unc.edu.
- Published
- 2011
- Full Text
- View/download PDF
24. Boulder ALignment Editor (ALE): a web-based RNA alignment tool.
- Author
-
Stombaugh J, Widmann J, McDonald D, and Knight R
- Subjects
- Base Pairing, Internet, Nucleic Acid Conformation, RNA chemistry, Sequence Alignment methods, Sequence Analysis, RNA, Software
- Abstract
Summary: The explosion of interest in non-coding RNAs, together with improvements in RNA X-ray crystallography, has led to a rapid increase in RNA structures at atomic resolution from 847 in 2005 to 1900 in 2010. The success of whole-genome sequencing has led to an explosive growth of unaligned homologous sequences. Consequently, there is a compelling and urgent need for user-friendly tools for producing structure-informed RNA alignments. Most alignment software considers the primary sequence alone; some specialized alignment software can also include Watson-Crick base pairs, but none adequately addresses the needs introduced by the rapid influx of both sequence and structural data. Therefore, we have developed the Boulder ALignment Editor (ALE), which is a web-based RNA alignment editor, designed for editing and assessing alignments using structural information. Some features of BoulderALE include the annotation and evaluation of an alignment based on isostericity of Watson-Crick and non-Watson-Crick base pairs, along with the collapsing (horizontally and vertically) of the alignment, while maintaining the ability to edit the alignment., Availability: http://www.microbio.me/boulderale.
- Published
- 2011
- Full Text
- View/download PDF
25. Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications.
- Author
-
Yilmaz P, Kottmann R, Field D, Knight R, Cole JR, Amaral-Zettler L, Gilbert JA, Karsch-Mizrachi I, Johnston A, Cochrane G, Vaughan R, Hunter C, Park J, Morrison N, Rocca-Serra P, Sterk P, Arumugam M, Bailey M, Baumgartner L, Birren BW, Blaser MJ, Bonazzi V, Booth T, Bork P, Bushman FD, Buttigieg PL, Chain PS, Charlson E, Costello EK, Huot-Creasy H, Dawyndt P, DeSantis T, Fierer N, Fuhrman JA, Gallery RE, Gevers D, Gibbs RA, San Gil I, Gonzalez A, Gordon JI, Guralnick R, Hankeln W, Highlander S, Hugenholtz P, Jansson J, Kau AL, Kelley ST, Kennedy J, Knights D, Koren O, Kuczynski J, Kyrpides N, Larsen R, Lauber CL, Legg T, Ley RE, Lozupone CA, Ludwig W, Lyons D, Maguire E, Methé BA, Meyer F, Muegge B, Nakielny S, Nelson KE, Nemergut D, Neufeld JD, Newbold LK, Oliver AE, Pace NR, Palanisamy G, Peplies J, Petrosino J, Proctor L, Pruesse E, Quast C, Raes J, Ratnasingham S, Ravel J, Relman DA, Assunta-Sansone S, Schloss PD, Schriml L, Sinha R, Smith MI, Sodergren E, Spo A, Stombaugh J, Tiedje JM, Ward DV, Weinstock GM, Wendel D, White O, Whiteley A, Wilke A, Wortman JR, Yatsunenko T, and Glöckner FO
- Subjects
- Checklist, Databases, Genetic, Genes, rRNA, Genetic Variation, Humans, Information Storage and Retrieval standards, Internet, Programming Languages, Software, Biomarkers, Environment, Metagenomics standards, Sequence Analysis, DNA standards
- Abstract
Here we present a standard developed by the Genomic Standards Consortium (GSC) for reporting marker gene sequences--the minimum information about a marker gene sequence (MIMARKS). We also introduce a system for describing the environment from which a biological sample originates. The 'environmental packages' apply to any genome sequence of known origin and can be used in combination with MIMARKS and other GSC checklists. Finally, to establish a unified standard for describing sequence data and to provide a single point of entry for the scientific community to access and learn about GSC checklists, we present the minimum information about any (x) sequence (MIxS). Adoption of MIxS will enhance our ability to analyze natural genetic diversity documented by massive DNA sequencing efforts from myriad ecosystems in our ever-changing biosphere.
- Published
- 2011
- Full Text
- View/download PDF
26. Meeting report of the RNA Ontology Consortium January 8-9, 2011.
- Author
-
Birmingham A, Clemente JC, Desai N, Gilbert J, Gonzalez A, Kyrpides N, Meyer F, Nawrocki E, Sterk P, Stombaugh J, Weinberg Z, Wendel D, Leontis NB, Zirbel C, Knight R, and Laederach A
- Abstract
This report summarizes the proceedings of the structure mapping working group meeting of the RNA Ontology Consortium (ROC), held in Kona, Hawaii on January 8-9, 2011. The ROC hosted this workshop to facilitate collaborations among those researchers formalizing concepts in RNA, those developing RNA-related software, and those performing genome annotation and standardization. The workshop included three software presentations, extended round-table discussions, and the constitution of two new working groups, the first to address the need for better software integration and the second to discuss standardization and benchmarking of existing RNA annotation pipelines. These working groups have subsequently pursued concrete implementation of actions suggested during the discussion. Further information about the ROC and its activities can be found at http://roc.bgsu.edu/.
- Published
- 2011
- Full Text
- View/download PDF
27. Technology and data-intensive science in the beginning of the 21st century.
- Author
-
Bernstein PA, Wecker D, Krishnamurthy A, Manocha D, Gardner J, Kolker N, Reschke C, Stombaugh J, Vagata P, Stewart E, Welch D, and Kolker E
- Subjects
- Biological Science Disciplines methods, Technology methods
- Abstract
This article is a summary of the technology issues and challenges of data-intensive science and cloud computing as discussed in the Data-Intensive Science (DIS) workshop in Seattle, September 19-20, 2010., (© Mary Ann Liebert, Inc.)
- Published
- 2011
- Full Text
- View/download PDF
28. Succession of microbial consortia in the developing infant gut microbiome.
- Author
-
Koenig JE, Spor A, Scalfone N, Fricker AD, Stombaugh J, Knight R, Angenent LT, and Ley RE
- Subjects
- Age Factors, Base Sequence, Cluster Analysis, DNA Barcoding, Taxonomic, DNA Primers genetics, Feces microbiology, Humans, Infant, Infant, Newborn, Molecular Sequence Data, RNA, Ribosomal, 16S genetics, Sequence Analysis, DNA, Species Specificity, Gastrointestinal Tract microbiology, Metagenome genetics
- Abstract
The colonization process of the infant gut microbiome has been called chaotic, but this view could reflect insufficient documentation of the factors affecting the microbiome. We performed a 2.5-y case study of the assembly of the human infant gut microbiome, to relate life events to microbiome composition and function. Sixty fecal samples were collected from a healthy infant along with a diary of diet and health status. Analysis of >300,000 16S rRNA genes indicated that the phylogenetic diversity of the microbiome increased gradually over time and that changes in community composition conformed to a smooth temporal gradient. In contrast, major taxonomic groups showed abrupt shifts in abundance corresponding to changes in diet or health. Community assembly was nonrandom: we observed discrete steps of bacterial succession punctuated by life events. Furthermore, analysis of ≈ 500,000 DNA metagenomic reads from 12 fecal samples revealed that the earliest microbiome was enriched in genes facilitating lactate utilization, and that functional genes involved in plant polysaccharide metabolism were present before the introduction of solid food, priming the infant gut for an adult diet. However, ingestion of table foods caused a sustained increase in the abundance of Bacteroidetes, elevated fecal short chain fatty acid levels, enrichment of genes associated with carbohydrate utilization, vitamin biosynthesis, and xenobiotic degradation, and a more stable community composition, all of which are characteristic of the adult microbiome. This study revealed that seemingly chaotic shifts in the microbiome are associated with life events; however, additional experiments ought to be conducted to assess how different infants respond to similar life events.
- Published
- 2011
- Full Text
- View/download PDF
29. Human oral, gut, and plaque microbiota in patients with atherosclerosis.
- Author
-
Koren O, Spor A, Felin J, Fåk F, Stombaugh J, Tremaroli V, Behre CJ, Knight R, Fagerberg B, Ley RE, and Bäckhed F
- Subjects
- Aged, Base Sequence, Cluster Analysis, Female, Humans, Male, Molecular Sequence Data, Phylogeny, Polymerase Chain Reaction, RNA, Ribosomal, 16S genetics, Sequence Analysis, DNA, Species Specificity, Sweden, Atherosclerosis microbiology, Bacteria genetics, Gastrointestinal Tract microbiology, Metagenome genetics, Mouth microbiology, Plaque, Atherosclerotic microbiology
- Abstract
Periodontal disease has been associated with atherosclerosis, suggesting that bacteria from the oral cavity may contribute to the development of atherosclerosis and cardiovascular disease. Furthermore, the gut microbiota may affect obesity, which is associated with atherosclerosis. Using qPCR, we show that bacterial DNA was present in the atherosclerotic plaque and that the amount of DNA correlated with the amount of leukocytes in the atherosclerotic plaque. To investigate the microbial composition of atherosclerotic plaques and test the hypothesis that the oral or gut microbiota may contribute to atherosclerosis in humans, we used 454 pyrosequencing of 16S rRNA genes to survey the bacterial diversity of atherosclerotic plaque, oral, and gut samples of 15 patients with atherosclerosis, and oral and gut samples of healthy controls. We identified Chryseomonas in all atherosclerotic plaque samples, and Veillonella and Streptococcus in the majority. Interestingly, the combined abundances of Veillonella and Streptococcus in atherosclerotic plaques correlated with their abundance in the oral cavity. Moreover, several additional bacterial phylotypes were common to the atherosclerotic plaque and oral or gut samples within the same individual. Interestingly, several bacterial taxa in the oral cavity and the gut correlated with plasma cholesterol levels. Taken together, our findings suggest that bacteria from the oral cavity, and perhaps even the gut, may correlate with disease markers of atherosclerosis.
- Published
- 2011
- Full Text
- View/download PDF
30. UniFrac: an effective distance metric for microbial community comparison.
- Author
-
Lozupone C, Lladser ME, Knights D, Stombaugh J, and Knight R
- Subjects
- Animals, Bacteria classification, Bacteria genetics, Humans, Phylogeny, Data Interpretation, Statistical, Ecology methods, Ecosystem, Software standards
- Published
- 2011
- Full Text
- View/download PDF
31. Microbial biogeography of public restroom surfaces.
- Author
-
Flores GE, Bates ST, Knights D, Lauber CL, Stombaugh J, Knight R, and Fierer N
- Subjects
- Bacteria classification, Female, Humans, Male, Surface Properties, Bacteria genetics, Bacteria isolation & purification, Phylogeography, Toilet Facilities
- Abstract
We spend the majority of our lives indoors where we are constantly exposed to bacteria residing on surfaces. However, the diversity of these surface-associated communities is largely unknown. We explored the biogeographical patterns exhibited by bacteria across ten surfaces within each of twelve public restrooms. Using high-throughput barcoded pyrosequencing of the 16 S rRNA gene, we identified 19 bacterial phyla across all surfaces. Most sequences belonged to four phyla: Actinobacteria, Bacteriodetes, Firmicutes and Proteobacteria. The communities clustered into three general categories: those found on surfaces associated with toilets, those on the restroom floor, and those found on surfaces routinely touched with hands. On toilet surfaces, gut-associated taxa were more prevalent, suggesting fecal contamination of these surfaces. Floor surfaces were the most diverse of all communities and contained several taxa commonly found in soils. Skin-associated bacteria, especially the Propionibacteriaceae, dominated surfaces routinely touched with our hands. Certain taxa were more common in female than in male restrooms as vagina-associated Lactobacillaceae were widely distributed in female restrooms, likely from urine contamination. Use of the SourceTracker algorithm confirmed many of our taxonomic observations as human skin was the primary source of bacteria on restroom surfaces. Overall, these results demonstrate that restroom surfaces host relatively diverse microbial communities dominated by human-associated bacteria with clear linkages between communities on or in different body sites and those communities found on restroom surfaces. More generally, this work is relevant to the public health field as we show that human-associated microbes are commonly found on restroom surfaces suggesting that bacterial pathogens could readily be transmitted between individuals by the touching of surfaces. Furthermore, we demonstrate that we can use high-throughput analyses of bacterial communities to determine sources of bacteria on indoor surfaces, an approach which could be used to track pathogen transmission and test the efficacy of hygiene practices.
- Published
- 2011
- Full Text
- View/download PDF
32. The mind-body-microbial continuum.
- Author
-
Gonzalez A, Stombaugh J, Lozupone C, Turnbaugh PJ, Gordon JI, and Knight R
- Subjects
- Gastrointestinal Tract microbiology, Genome, Bacterial genetics, Humans, Metagenomics methods, Neurotransmitter Agents therapeutic use, Behavioral Symptoms genetics, Behavioral Symptoms microbiology, Behavioral Symptoms therapy, Mental Disorders genetics, Mental Disorders microbiology, Mental Disorders therapy, Metagenome
- Abstract
Our understanding of the vast collection of microbes that live on and inside us (microbiota) and their collective genes (microbiome) has been revolutionized by culture-independent "metagenomic" techniques and DNA sequencing technologies. Most of our microbes live in our gut, where they function as a metabolic organ and provide attributes not encoded in our human genome. Metagenomic studies are revealing shared and distinctive features of microbial communities inhabiting different humans. A central question in psychiatry is the relative role of genes and environment in shaping behavior. The human microbiome serves as the interface between our genes and our history of environmental exposures; explorations of our microbiomes thus offer the possibility of providing new insights into our neurodevelopment and our behavioral phenotypes by affecting complex processes such as inter- and intra personal variations in cognition, personality, mood, sleep, and eating behavior, and perhaps even a variety of neuropsychiatric diseases ranging from affective disorders to autism. Better understanding of microbiome-encoded pathways for xenobiotic metabolism also has important implications for improving the efficacy of pharmacologic interventions with neuromodulatory agents.
- Published
- 2011
33. Moving pictures of the human microbiome.
- Author
-
Caporaso JG, Lauber CL, Costello EK, Berg-Lyons D, Gonzalez A, Stombaugh J, Knights D, Gajer P, Ravel J, Fierer N, Gordon JI, and Knight R
- Subjects
- Bacteria classification, Computer Simulation, High-Throughput Nucleotide Sequencing, Humans, Inflammatory Bowel Diseases microbiology, Obesity microbiology, Organ Specificity, RNA, Ribosomal, 16S genetics, Seasons, Time Factors, Bacteria genetics, Gastrointestinal Tract microbiology, Genomics methods, Metagenome genetics
- Abstract
Background: Understanding the normal temporal variation in the human microbiome is critical to developing treatments for putative microbiome-related afflictions such as obesity, Crohn’s disease, inflammatory bowel disease and malnutrition. Sequencing and computational technologies, however, have been a limiting factor in performing dense time series analysis of the human microbiome. Here, we present the largest human microbiota time series analysis to date, covering two individuals at four body sites over 396 timepoints., Results: We find that despite stable differences between body sites and individuals, there is pronounced variability in an individual’s microbiota across months, weeks and even days. Additionally, only a small fraction of the total taxa found within a single body site appear to be present across all time points, suggesting that no core temporal microbiome exists at high abundance (although some microbes may be present but drop below the detection threshold). Many more taxa appear to be persistent but non-permanent community members., Conclusions: DNA sequencing and computational advances described here provide the ability to go beyond infrequent snapshots of our human-associated microbial ecology to high-resolution assessments of temporal variations over protracted periods, within and between body habitats and individuals. This capacity will allow us to define normal variation and pathologic states, and assess responses to therapeutic interventions.
- Published
- 2011
- Full Text
- View/download PDF
34. QIIME allows analysis of high-throughput community sequencing data.
- Author
-
Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK, Fierer N, Peña AG, Goodrich JK, Gordon JI, Huttley GA, Kelley ST, Knights D, Koenig JE, Ley RE, Lozupone CA, McDonald D, Muegge BD, Pirrung M, Reeder J, Sevinsky JR, Turnbaugh PJ, Walters WA, Widmann J, Yatsunenko T, Zaneveld J, and Knight R
- Subjects
- Animals, Feces microbiology, Humans, Mice, Twins, Dizygotic genetics, Twins, Monozygotic genetics, RNA, Ribosomal, 16S genetics, Sequence Analysis, RNA methods, Software
- Published
- 2010
- Full Text
- View/download PDF
35. The RNA structure alignment ontology.
- Author
-
Brown JW, Birmingham A, Griffiths PE, Jossinet F, Kachouri-Lafond R, Knight R, Lang BF, Leontis N, Steger G, Stombaugh J, and Westhof E
- Subjects
- Animals, Base Sequence, Humans, Models, Biological, Molecular Sequence Data, Nucleic Acid Conformation, Phylogeny, RNA chemistry, Sequence Alignment trends, Sequence Analysis, RNA methods, Sequence Homology, Nucleic Acid, RNA analysis, Sequence Alignment methods, Software
- Abstract
Multiple sequence alignments are powerful tools for understanding the structures, functions, and evolutionary histories of linear biological macromolecules (DNA, RNA, and proteins), and for finding homologs in sequence databases. We address several ontological issues related to RNA sequence alignments that are informed by structure. Multiple sequence alignments are usually shown as two-dimensional (2D) matrices, with rows representing individual sequences, and columns identifying nucleotides from different sequences that correspond structurally, functionally, and/or evolutionarily. However, the requirement that sequences and structures correspond nucleotide-by-nucleotide is unrealistic and hinders representation of important biological relationships. High-throughput sequencing efforts are also rapidly making 2D alignments unmanageable because of vertical and horizontal expansion as more sequences are added. Solving the shortcomings of traditional RNA sequence alignments requires explicit annotation of the meaning of each relationship within the alignment. We introduce the notion of "correspondence," which is an equivalence relation between RNA elements in sets of sequences as the basis of an RNA alignment ontology. The purpose of this ontology is twofold: first, to enable the development of new representations of RNA data and of software tools that resolve the expansion problems with current RNA sequence alignments, and second, to facilitate the integration of sequence data with secondary and three-dimensional structural information, as well as other experimental information, to create simultaneously more accurate and more exploitable RNA alignments.
- Published
- 2009
- Full Text
- View/download PDF
36. Classification and energetics of the base-phosphate interactions in RNA.
- Author
-
Zirbel CL, Sponer JE, Sponer J, Stombaugh J, and Leontis NB
- Subjects
- Adenine chemistry, Base Pairing, Base Sequence, Binding Sites, Conserved Sequence, Cytosine chemistry, Guanine chemistry, Hydrogen Bonding, Models, Molecular, Molecular Sequence Data, Nucleic Acid Conformation, Quantum Theory, RNA, Ribosomal chemistry, Sequence Homology, Nucleic Acid, Uracil chemistry, Phosphates chemistry, RNA chemistry
- Abstract
Structured RNA molecules form complex 3D architectures stabilized by multiple interactions involving the nucleotide base, sugar and phosphate moieties. A significant percentage of the bases in structured RNA molecules in the Protein Data Bank (PDB) hydrogen-bond with phosphates of other nucleotides. By extracting and superimposing base-phosphate (BPh) interactions from a reduced-redundancy subset of 3D structures from the PDB, we identified recurrent phosphate-binding sites on the RNA bases. Quantum chemical calculations were carried out on model systems representing each BPh interaction. The calculations show that the centers of each cluster obtained from the structure superpositions correspond to energy minima on the potential energy hypersurface. The calculations also show that the most stable phosphate-binding sites occur on the Watson-Crick edge of guanine and the Hoogsteen edge of cytosine. We modified the 'Find RNA 3D' (FR3D) software suite to automatically find and classify BPh interactions. Comparison of the 3D structures of the 16S and 23S rRNAs of Escherichia coli and Thermus thermophilus revealed that most BPh interactions are phylogenetically conserved and they occur primarily in hairpin, internal or junction loops or as part of tertiary interactions. Bases that form BPh interactions, which are conserved in the rRNA 3D structures are also conserved in homologous rRNA sequence alignments.
- Published
- 2009
- Full Text
- View/download PDF
37. Frequency and isostericity of RNA base pairs.
- Author
-
Stombaugh J, Zirbel CL, Westhof E, and Leontis NB
- Subjects
- Base Pairing, Base Sequence, Models, Molecular, Nucleic Acid Conformation, RNA, Bacterial chemistry, RNA, Ribosomal chemistry, Sequence Alignment, Sequence Analysis, RNA, RNA chemistry
- Abstract
Most of the hairpin, internal and junction loops that appear single-stranded in standard RNA secondary structures form recurrent 3D motifs, where non-Watson-Crick base pairs play a central role. Non-Watson-Crick base pairs also play crucial roles in tertiary contacts in structured RNA molecules. We previously classified RNA base pairs geometrically so as to group together those base pairs that are structurally similar (isosteric) and therefore able to substitute for each other by mutation without disrupting the 3D structure. Here, we introduce a quantitative measure of base pair isostericity, the IsoDiscrepancy Index (IDI), to more accurately determine which base pair substitutions can potentially occur in conserved motifs. We extract and classify base pairs from a reduced-redundancy set of RNA 3D structures from the Protein Data Bank (PDB) and calculate centroids (exemplars) for each base combination and geometric base pair type (family). We use the exemplars and IDI values to update our online Basepair Catalog and the Isostericity Matrices (IM) for each base pair family. From the database of base pairs observed in 3D structures we derive base pair occurrence frequencies for each of the 12 geometric base pair families. In order to improve the statistics from the 3D structures, we also derive base pair occurrence frequencies from rRNA sequence alignments.
- Published
- 2009
- Full Text
- View/download PDF
38. FR3D: finding local and composite recurrent structural motifs in RNA 3D structures.
- Author
-
Sarver M, Zirbel CL, Stombaugh J, Mokdad A, and Leontis NB
- Subjects
- Algorithms, RNA, Ribosomal, 23S chemistry, RNA, Ribosomal, 5S chemistry, Ricin chemistry, Software, Computational Biology methods, Nucleic Acid Conformation, RNA chemistry
- Abstract
New methods are described for finding recurrent three-dimensional (3D) motifs in RNA atomic-resolution structures. Recurrent RNA 3D motifs are sets of RNA nucleotides with similar spatial arrangements. They can be local or composite. Local motifs comprise nucleotides that occur in the same hairpin or internal loop. Composite motifs comprise nucleotides belonging to three or more different RNA strand segments or molecules. We use a base-centered approach to construct efficient, yet exhaustive search procedures using geometric, symbolic, or mixed representations of RNA structure that we implement in a suite of MATLAB programs, "Find RNA 3D" (FR3D). The first modules of FR3D preprocess structure files to classify base-pair and -stacking interactions. Each base is represented geometrically by the position of its glycosidic nitrogen in 3D space and by the rotation matrix that describes its orientation with respect to a common frame. Base-pairing and base-stacking interactions are calculated from the base geometries and are represented symbolically according to the Leontis/Westhof basepairing classification, extended to include base-stacking. These data are stored and used to organize motif searches. For geometric searches, the user supplies the 3D structure of a query motif which FR3D uses to find and score geometrically similar candidate motifs, without regard to the sequential position of their nucleotides in the RNA chain or the identity of their bases. To score and rank candidate motifs, FR3D calculates a geometric discrepancy by rigidly rotating candidates to align optimally with the query motif and then comparing the relative orientations of the corresponding bases in the query and candidate motifs. Given the growing size of the RNA structure database, it is impossible to explicitly compute the discrepancy for all conceivable candidate motifs, even for motifs with less than ten nucleotides. The screening algorithm that we describe finds all candidate motifs whose geometric discrepancy with respect to the query motif falls below a user-specified cutoff discrepancy. This technique can be applied to RMSD searches. Candidate motifs identified geometrically may be further screened symbolically to identify those that contain particular basepair types or base-stacking arrangements or that conform to sequence continuity or nucleotide identity constraints. Purely symbolic searches for motifs containing user-defined sequence, continuity and interaction constraints have also been implemented. We demonstrate that FR3D finds all occurrences, both local and composite and with nucleotide substitutions, of sarcin/ricin and kink-turn motifs in the 23S and 5S ribosomal RNA 3D structures of the H. marismortui 50S ribosomal subunit and assigns the lowest discrepancy scores to bona fide examples of these motifs. The search algorithms have been optimized for speed to allow users to search the non-redundant RNA 3D structure database on a personal computer in a matter of minutes.
- Published
- 2008
- Full Text
- View/download PDF
39. Tertiary structure and function of an RNA motif required for plant vascular entry to initiate systemic trafficking.
- Author
-
Zhong X, Tao X, Stombaugh J, Leontis N, and Ding B
- Subjects
- DNA Mutational Analysis, Models, Molecular, Molecular Sequence Data, Molecular Structure, Plant Viruses genetics, RNA metabolism, Nicotiana anatomy & histology, Nicotiana genetics, Nicotiana metabolism, Nicotiana virology, Water chemistry, Base Sequence, Biological Transport physiology, Nucleic Acid Conformation, Plant Viruses metabolism, Plants anatomy & histology, Plants genetics, Plants metabolism, RNA genetics
- Abstract
Vascular entry is a decisive step for the initiation of long-distance movement of infectious and endogenous RNAs, silencing signals and developmental/defense signals in plants. However, the mechanisms remain poorly understood. We used Potato spindle tuber viroid (PSTVd) as a model to investigate the direct role of the RNA itself in vascular entry. We report here the identification of an RNA motif that is required for PSTVd to traffic from nonvascular into the vascular tissue phloem to initiate systemic infection. This motif consists of nucleotides U/C that form a water-inserted cis Watson-Crick/Watson-Crick base pair flanked by short helices that comprise canonical Watson-Crick/Watson-Crick base pairs. This tertiary structural model was inferred by comparison with X-ray crystal structures of similar motifs in rRNAs and is supported by combined mutagenesis and covariation analyses. Hydration pattern analysis suggests that water insertion induces a widened minor groove conducive to protein and/or RNA interactions. Our model and approaches have broad implications to investigate the RNA structural motifs in other RNAs for vascular entry and to study the basic principles of RNA structure-function relationships.
- Published
- 2007
- Full Text
- View/download PDF
40. Computer identification of snoRNA genes using a Mammalian Orthologous Intron Database.
- Author
-
Fedorov A, Stombaugh J, Harr MW, Yu S, Nasalean L, and Shepelev V
- Subjects
- Algorithms, Animals, Base Sequence, Computational Biology, Conserved Sequence, Genes, Humans, Mice, Molecular Sequence Data, RNA chemistry, RNA, Small Nucleolar chemistry, Rats, Sequence Alignment, Databases, Nucleic Acid, Genomics, Introns, RNA, Small Nucleolar genetics, Software
- Abstract
Based on comparative genomics, we created a bioinformatic package for computer prediction of small nucleolar RNA (snoRNA) genes in mammalian introns. The core of our approach was the use of the Mammalian Orthologous Intron Database (MOID), which contains all known introns within the human, mouse and rat genomes. Introns from orthologous genes from these three species, that have the same position relative to the reading frame, are grouped in a special orthologous intron table. Our program SNO.pl searches for conserved snoRNA motifs within MOID and reports all cases when characteristic snoRNA-like structures are present in all three orthologous introns of human, mouse and rat sequences. Here we report an example of the SNO.pl usage for searching a particular pattern of conserved C/D-box snoRNA motifs (canonical C- and D-boxes and the 6 nt long terminal stem). In this computer analysis, we detected 57 triplets of snoRNA-like structures in three mammals. Among them were 15 triplets that represented known C/D-box snoRNA genes. Six triplets represented snoRNA genes that had only been partially characterized in the mouse genome. One case represented a novel snoRNA gene, and another three cases, putative snoRNAs. Our programs are publicly available and can be easily adapted and/or modified for searching any conserved motifs within mammalian introns.
- Published
- 2005
- Full Text
- View/download PDF
41. The non-Watson-Crick base pairs and their associated isostericity matrices.
- Author
-
Leontis NB, Stombaugh J, and Westhof E
- Subjects
- Algorithms, Base Sequence, Hydrogen Bonding, Models, Molecular, RNA classification, RNA genetics, Sequence Homology, Terminology as Topic, Base Pairing, RNA chemistry
- Abstract
RNA molecules exhibit complex structures in which a large fraction of the bases engage in non-Watson-Crick base pairing, forming motifs that mediate long-range RNA-RNA interactions and create binding sites for proteins and small molecule ligands. The rapidly growing number of three-dimensional RNA structures at atomic resolution requires that databases contain the annotation of such base pairs. An unambiguous and descriptive nomenclature was proposed recently in which RNA base pairs were classified by the base edges participating in the interaction (Watson-Crick, Hoogsteen/CH or sugar edge) and the orientation of the glycosidic bonds relative to the hydrogen bonds (cis or trans). Twelve basic geometric families were identified and all 12 have been observed in crystal structures. For each base pairing family, we present here the 4 x 4 'isostericity matrices' summarizing the geometric relationships between the 16 pairwise combinations of the four standard bases, A, C, G and U. Whenever available, a representative example of each observed base pair from X-ray crystal structures (3.0 A resolution or better) is provided or, otherwise, theoretically plausible models. This format makes apparent the recurrent geometric patterns that are observed and helps identify isosteric pairs that co-vary or interchange in sequences of homologous molecules while maintaining conserved three-dimensional motifs.
- Published
- 2002
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.