357 results on '"Snipen Lars"'
Search Results
102. Mining for genotype-phenotype relations in Saccharomyces using partial least squares
- Author
-
Mehmood, Tahir, primary, Martens, Harald, additional, Sæbø, Solve, additional, Warringer, Jonas, additional, and Snipen, Lars, additional
- Published
- 2011
- Full Text
- View/download PDF
103. Complete Genome Sequence of the Commensal Enterococcus faecalis 62, Isolated from a Healthy Norwegian Infant
- Author
-
Brede, Dag Anders, primary, Snipen, Lars Gustav, additional, Ussery, David Wayne, additional, Nederbragt, Alexander J., additional, and Nes, Ingolf F., additional
- Published
- 2011
- Full Text
- View/download PDF
104. Smoothing Gene Expression Data with Network Information Improves Consistency of Regulated Genes
- Author
-
Dørum, Guro, primary, Snipen, Lars, additional, Solheim, Margrete, additional, and Saebo, Solve, additional
- Published
- 2011
- Full Text
- View/download PDF
105. Global Transcriptional Analysis of Spontaneous Sakacin P-Resistant Mutant Strains of Listeria monocytogenes during Growth on Different Sugars
- Author
-
Tessema, Girum Tadesse, primary, Møretrø, Trond, additional, Snipen, Lars, additional, Axelsson, Lars, additional, and Naterstad, Kristine, additional
- Published
- 2011
- Full Text
- View/download PDF
106. Comparative genomic analysis reveals significant enrichment of mobile genetic elements and genes encoding surface structure-proteins in hospital-associated clonal complex 2 Enterococcus faecalis
- Author
-
Solheim, Margrete, primary, Brekke, Mari C, additional, Snipen, Lars G, additional, Willems, Rob JL, additional, Nes, Ingolf F, additional, and Brede, Dag A, additional
- Published
- 2011
- Full Text
- View/download PDF
107. Global transcriptome response in Lactobacillus sakei during growth on ribose
- Author
-
McLeod, Anette, primary, Snipen, Lars, additional, Naterstad, Kristine, additional, and Axelsson, Lars, additional
- Published
- 2011
- Full Text
- View/download PDF
108. Comparative Genomic Analysis of Pathogenic and Probiotic Enterococcus faecalis Isolates, and Their Transcriptional Responses to Growth in Human Urine
- Author
-
Vebø, Heidi C., primary, Solheim, Margrete, additional, Snipen, Lars, additional, Nes, Ingolf F., additional, and Brede, Dag A., additional
- Published
- 2010
- Full Text
- View/download PDF
109. Genomic Characterization of Campylobacter jejuni Strain M1
- Author
-
Friis, Carsten, primary, Wassenaar, Trudy M., additional, Javed, Muhammad A., additional, Snipen, Lars, additional, Lagesen, Karin, additional, Hallin, Peter F., additional, Newell, Diane G., additional, Toszeghy, Monique, additional, Ridley, Anne, additional, Manning, Georgina, additional, and Ussery, David W., additional
- Published
- 2010
- Full Text
- View/download PDF
110. Standard operating procedure for computing pangenome trees
- Author
-
Snipen, Lars, primary and Ussery, David W., additional
- Published
- 2010
- Full Text
- View/download PDF
111. The Response ofEnterococcus faecalisV583 to Chloramphenicol Treatment
- Author
-
Aakra, Ågot, primary, Vebø, Heidi, additional, Indahl, Ulf, additional, Snipen, Lars, additional, Gjerstad, Øystein, additional, Lunde, Merete, additional, and Nes, Ingolf F., additional
- Published
- 2010
- Full Text
- View/download PDF
112. The Transcriptome of the Nosocomial Pathogen Enterococcus faecalis V583 Reveals Adaptive Responses to Growth in Blood
- Author
-
Vebø, Heidi C., primary, Snipen, Lars, additional, Nes, Ingolf F., additional, and Brede, Dag A., additional
- Published
- 2009
- Full Text
- View/download PDF
113. On the Origins of a Vibrio Species
- Author
-
Vesth, Tammi, primary, Wassenaar, Trudy M., additional, Hallin, Peter F., additional, Snipen, Lars, additional, Lagesen, Karin, additional, and Ussery, David W., additional
- Published
- 2009
- Full Text
- View/download PDF
114. Improved analysis of bacterial CGH data beyond the log-ratio paradigm
- Author
-
Snipen, Lars, primary, Nyquist, Otto L, additional, Solheim, Margrete, additional, Aakra, Ågot, additional, and Nes, Ingolf F, additional
- Published
- 2009
- Full Text
- View/download PDF
115. Rotation Testing in Gene Set Enrichment Analysis for Small Direct Comparison Experiments
- Author
-
Dørum, Guro, primary, Snipen, Lars, additional, Solheim, Margrete, additional, and Sæbø, Solve, additional
- Published
- 2009
- Full Text
- View/download PDF
116. Comparative genomics of Enterococcus faecalis from healthy Norwegian infants
- Author
-
Solheim, Margrete, primary, Aakra, Ågot, additional, Snipen, Lars G, additional, Brede, Dag A, additional, and Nes, Ingolf F, additional
- Published
- 2009
- Full Text
- View/download PDF
117. Microbial comparative pan-genomics using binomial mixture models
- Author
-
Snipen, Lars, primary, Almøy, Trygve, additional, and Ussery, David W, additional
- Published
- 2009
- Full Text
- View/download PDF
118. Diversity of Lactobacillus sakei strains investigated by phenotypic and genotypic methods
- Author
-
McLeod, Anette, primary, Nyquist, O. Ludvig, additional, Snipen, Lars, additional, Naterstad, Kristine, additional, and Axelsson, Lars, additional
- Published
- 2008
- Full Text
- View/download PDF
119. Mixture Models as a Method to Find Present and Divergent Genes in Comparative Genomic Hybridization Studies on Bacteria
- Author
-
Feten, Guri, primary, Almøy, Trygve, additional, Snipen, Lars, additional, Aakra, Ågot, additional, Nyquist, O. Ludvig, additional, and Aastveit, Are H., additional
- Published
- 2007
- Full Text
- View/download PDF
120. A Discussion concerning the Inclusion of Variety Effect when Analysis of Variance is Used to Detect Differentially Expressed Genes
- Author
-
Feten, Guri, primary, Aastveit, Are Halvor, additional, Snipen, Lars, additional, and Almøy, Trygve, additional
- Published
- 2007
- Full Text
- View/download PDF
121. Regression as a Method to Predict Copy Numbers in Comparative Genomic Hybridization Studies on Bacteria
- Author
-
Feten, Guri, primary, Almøy, Trygve, additional, Snipen, Lars, additional, Aakra, Ågot, additional, and Aastveit, Are H., additional
- Published
- 2006
- Full Text
- View/download PDF
122. Detection of divergent genes in microbial aCGH experiments
- Author
-
Snipen, Lars, primary, Repsilber, Dirk, additional, Nyquist, Ludvig, additional, Ziegler, Andreas, additional, Aakra, Ågot, additional, and Aastveit, Are, additional
- Published
- 2006
- Full Text
- View/download PDF
123. Transcriptional Response of Enterococcus faecalis V583 to Erythromycin
- Author
-
Aakra, Ågot, primary, Vebø, Heidi, additional, Snipen, Lars, additional, Hirt, Helmut, additional, Aastveit, Are, additional, Kapur, Vivek, additional, Dunny, Gary, additional, Murray, Barbara E., additional, and Nes, Ingolf F., additional
- Published
- 2005
- Full Text
- View/download PDF
124. Adjusted Prediction of Plant Height with Application to Greenhouse Grown Poinsettias
- Author
-
Sæbø, Solve, primary, Almøy, Trygve, additional, and Snipen, Lars, additional
- Published
- 2001
- Full Text
- View/download PDF
125. Transcriptomic and Functional Analysis of NaCl-Induced Stress in Enterococcus faecalis.
- Author
-
Solheim, Margrete, La Rosa, Sabina Leanti, Mathisen, Thomas, Snipen, Lars G., Nes, Ingolf F., and Brede, Dag Anders
- Subjects
GENETIC transcription ,ENTEROCOCCUS faecalis ,POLYSACCHARIDES ,ANTIGEN analysis ,ENTEROCOCCAL infections ,CELL envelope (Biology) - Abstract
The robust physiology of Enterococcus faecalis facilitates tolerance to various stresses. We here report the transcriptional response of E. faecalis V583 to growth in the presence of 6.5% NaCl. Among the early responses observed was an immediate down-regulation of mscL, accompanied by an up-regulation of genes predicted to be involved in uptake of extracellular potassium and glycine betaine. The high NaCl concentration also induced expression of chaperons and cell envelope related traits, such as the enterococcal polysaccharide antigen (epa) locus. Functional genetic analysis revealed reduced salt stress resistance in both epaB and epaE mutants. The reduced salt resistance phenotype associated with the epaB mutant was restored by complementation, hence demonstrating a role of Epa in the physiological robustness of E. faecalis. Furthermore, we demonstrate that Epa confers increased resistance towards multiple cell envelope stress-inducing factors. Accordingly, these findings delineate a potential link between the robust nature of E. faecalis and its ability to perform as a human pathogen, and provide a new perspective on the mechanisms by which Epa contributes to virulence. Notably, the high NaCl concentration also resulted in strict repression of the gelE-sprE operon and impaired gelatinase activity. We demonstrate that NaCl antagonize the GBAP-pheromone dependent induction in a concentration dependent manner. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
126. Predicting Plant Height of Greenhouse Grown Crops with a Polynomial Growth Rate Model
- Author
-
Snipen, Lars Gustav, primary
- Published
- 1998
- Full Text
- View/download PDF
127. A Genomic Virulence Reference Map of Enterococcus faecalisReveals an Important Contribution of Phage03-Like Elements in Nosocomial Genetic Lineages to Pathogenicity in a Caenorhabditis elegansInfection Model
- Author
-
La Rosa, Sabina Leanti, Snipen, Lars-Gustav, Murray, Barbara E., Willems, Rob J. L., Gilmore, Michael S., Diep, Dzung B., Nes, Ingolf F., and Brede, Dag Anders
- Abstract
ABSTRACTIn the present study, the commensal and pathogenic host-microbe interaction of Enterococcus faecaliswas explored using a Caenorhabditis elegansmodel system. The virulence of 28 E. faecalisisolates representing 24 multilocus sequence types (MLSTs), including human commensal and clinical isolates as well as isolates from animals and of insect origin, was investigated using C. elegansstrain glp-4(bn2ts); sek-1(km4). This revealed that 6 E. faecalisisolates behaved in a commensal manner with no nematocidal effect, while the remaining strains showed a time to 50% lethality ranging from 47 to 120 h. Principal component analysis showed that the difference in nematocidal activity explained 94% of the variance in the data. Assessment of known virulence traits revealed that gelatinase and cytolysin production accounted for 40.8% and 36.5% of the observed pathogenicity, respectively. However, coproduction of gelatinase and cytolysin did not increase virulence additively, accounting for 50.6% of the pathogenicity and therefore indicating a significant (26.7%) saturation effect. We employed a comparative genomic analysis approach using the 28 isolates comprising a collection of 82,356 annotated coding sequences (CDS) to identify 2,325 patterns of presence or absence among the investigated strains. Univariate statistical analysis of variance (ANOVA) established that individual patterns positively correlated (n= 61) with virulence. The patterns were investigated to identify potential new virulence traits, among which we found five patterns consisting of the phage03-like gene clusters. Strains harboring phage03 showed, on average, 17% higher killing of C. elegans(P= 4.4e−6). The phage03 gene cluster was also present in gelatinase-and-cytolysin-negative strain E. faecalisJH2-2. Deletion of this phage element from the JH2-2 clinical strain rendered the mutant apathogenic in C. elegans, and a similar mutant of the nosocomial V583 isolate showed significantly attenuated virulence. Bioinformatics investigation indicated that, unlike other E. faecalisvirulence traits, phage03-like elements were found at a higher frequency among nosocomial isolates. In conclusion, our report provides a valuable virulence map that explains enhancement in E. faecalisvirulence and contributes to a deeper comprehension of the genetic mechanism leading to the transition from commensalism to a pathogenic lifestyle.
- Published
- 2015
- Full Text
- View/download PDF
128. Interaction between phytoplankton and zooplankton in lakes
- Author
-
Seip, Knut L., primary and Snipen, Lars G., additional
- Published
- 1993
- Full Text
- View/download PDF
129. The Response of Enterococcus faecalis V583 to Chloramphenicol Treatment.
- Author
-
Aakra, Ågot, Vebø, Heidi, Indahl, Ulf, Snipen, Lars, Gjerstad, Øystein, Lunde, Merete, and Nes, Ingolf F.
- Subjects
ENTEROCOCCUS faecalis ,CHLORAMPHENICOL ,ANTIBIOTICS ,DRUG resistance in microorganisms ,DRUG tolerance ,PROTEIN synthesis ,AMINO acids ,BIOSYNTHESIS ,ENERGY metabolism - Abstract
Many Enterococcus faecalis strains display tolerance or resistance to many antibiotics, but genes that contribute to the resistance cannot be specified. The multiresistant E. faecalis V583, for which the complete genome sequence is available, survives and grows in media containing relatively high levels of chloramphenicol. No specific genes coding for chloramphenicol resistance has been recognized in V583. We used microarrays to identify genes and mechanisms behind the tolerance to chloramphenicol in V583, by comparison of cells treated with subinhibitory concentrations of chloramphenicol and untreated V583 cells. During a time course experiment, more than 600 genes were significantly differentially transcribed. Since chloramphenicol affects protein synthesis in bacteria, many genes involved in protein synthesis, for example, genes for ribosomal proteins, were induced. Genes involved in amino acid biosynthesis, for example, genes for tRNA synthetases and energy metabolism were downregulated, mainly. Among the upregulated genes were EF1732 and EF1733, which code for potential chloramphenicol transporters. Efflux of drug out of the cells may be one mechanism used by V583 to overcome the effect of chloramphenicol. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
130. Transcriptional Response of Enterococcus faecalisV583 to Erythromycin
- Author
-
Aakra, Ågot, Vebø, Heidi, Snipen, Lars, Hirt, Helmut, Aastveit, Are, Kapur, Vivek, Dunny, Gary, Murray, Barbara, and Nes, Ingolf F.
- Abstract
ABSTRACTA transcriptional profile of Enterococcus faecalisV583 (V583) treated with erythromycin is presented. This is the first study describing a complete transcriptional profile of Enterococcus. E. faecalisis a common and nonvirulent bacterium in many natural environments, but also an important cause of nosocomial infections. We have used a genome-wide microarray based on the genome sequence of V583 to study gene expression in cells exposed to erythromycin. V583 is resistant to relatively high concentrations of erythromycin, but growth is retarded by the treatment. The effect of erythromycin treatment on V583 was studied by a time course experiment; samples were extracted at five time points over a period of 90 min. A drastic change in gene transcription was seen with the erythromycin-treated cells compared to the untreated cells. Altogether, 260 genes were down-regulated at one or more time points, while 340 genes were up-regulated. Genes encoding hypothetical proteins and genes encoding transport and binding proteins were the two most dominating groups of differentially expressed genes. The gene encoding ermB(EFA0007) was expressed, but not differentially, which indicated that other genes are important for the survival and growth maintenance of V583 treated with erythromycin. One of these genes is a putative MsrC-like protein, which was up-regulated at all time points studied. Other specific genes that were found to be up-regulated were genes encoding ABC transporters and two-component regulatory systems, and these may be genes that are important for the specific response of V583 to erythromycin.
- Published
- 2005
- Full Text
- View/download PDF
131. Assessing time dependent changes in microbial composition of biological crime scene traces using microbial RNA markers.
- Author
-
Salzmann, Andrea Patrizia, Arora, Natasha, Russo, Giancarlo, Kreutzer, Susanne, Snipen, Lars, and Haas, Cordula
- Subjects
CRIME scenes ,RNA ,BODY fluids ,RNA sequencing ,SALIVA ,REGRESSION analysis - Abstract
Current body fluid identification methods do not reveal any information about the time since deposition (TsD) of biological traces, even though determining the age of traces could be crucial for the investigative process. To determine the utility of microbial RNA markers for TsD estimation, we examined RNA sequencing data from five forensically relevant body fluids (blood, menstrual blood, saliva, semen, and vaginal secretion) over seven time points, ranging from fresh to 1.5 years. One set of samples was stored indoors while another was exposed to outdoor conditions. In outdoor samples, we observed a consistent compositional shift, occurring after 4 weeks: this shift was characterized by an overall increase in non-human eukaryotic RNA and an overall decrease in prokaryotic RNA. In depth analyses showed a high fraction of tree, grass and fungal signatures, which are characteristic for the environment the samples were exposed to. When examining the prokaryotic fraction in more detail, three bacterial phyla were found to exhibit the largest changes in abundance, namely Actinobacteria, Proteobacteria and Firmicutes. More detailed analyses at the order level were done using a Lasso regression analysis to find a predictive subset of bacterial taxa. We found 26 bacterial orders to be indicative of sample age. Indoor samples did not reveal such a clear compositional change at the domain level: eukaryotic and prokaryotic abundance remained relatively stable across the assessed time period. Nonetheless, a Lasso regression analysis identified 32 bacterial orders exhibiting clear changes over time, enabling the prediction of TsD. For both indoor and outdoor samples, a larger number (around 60%) of the bacterial orders identified as indicative of TsD are part of the Actinobacteria, Proteobacteria and Firmicutes. In summary, we found that the observed changes across time are not primarily due to changes associated with body fluid specific bacteria but mostly due to accumulation of bacteria from the environment. Orders of these environmental bacteria could be evaluated for TsD prediction , considering the location and environment of the crime scene. However, further studies are needed to verify these findings, determine the applicability across samples, replicates, donors, and other variables, and also to further assess the effect of different seasons and locations on the samples. • We present the first study evaluating microbial RNA markers for TsD estimation. • We tracked time dependent changes in microbial composition of biological crime scene traces. • RNA sequencing data from forensically relevant body fluids exposed to indoor/outdoor conditions and aged up to 1.5 years were analyzed. • Changes across time were mostly due to accumulation of bacteria from the environment. • Candidate bacterial taxa suitable to predict sample age were identified. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
132. Assessing time dependent changes in microbial composition of biological crime scene traces using microbial RNA markers
- Author
-
Salzmann, Andrea P., Arora, Natasha, Russo, Giancarlo, Kreutzer, Susanne, Snipen, Lars, and Haas, Cordula
- Subjects
Body fluids ,13. Climate action ,Microbial composition ,Time since deposition (TsD) ,Transcriptome analysis ,16. Peace & justice ,Forensics - Abstract
Current body fluid identification methods do not reveal any information about the time since deposition (TsD) of biological traces, even though determining the age of traces could be crucial for the investigative process. To determine the utility of microbial RNA markers for TsD estimation, we examined RNA sequencing data from five forensically relevant body fluids (blood, menstrual blood, saliva, semen, and vaginal secretion) over seven time points, ranging from fresh to 1.5 years. One set of samples was stored indoors while another was exposed to outdoor conditions. In outdoor samples, we observed a consistent compositional shift, occurring after 4 weeks: this shift was characterized by an overall increase in non-human eukaryotic RNA and an overall decrease in prokaryotic RNA. In depth analyses showed a high fraction of tree, grass and fungal signatures, which are characteristic for the environment the samples were exposed to. When examining the prokaryotic fraction in more detail, three bacterial phyla were found to exhibit the largest changes in abundance, namely Actinobacteria, Proteobacteria and Firmicutes. More detailed analyses at the order level were done using a Lasso regression analysis to find a predictive subset of bacterial taxa. We found 26 bacterial orders to be indicative of sample age. Indoor samples did not reveal such a clear compositional change at the domain level: eukaryotic and prokaryotic abundance remained relatively stable across the assessed time period. Nonetheless, a Lasso regression analysis identified 32 bacterial orders exhibiting clear changes over time, enabling the prediction of TsD. For both indoor and outdoor samples, a larger number (around 60%) of the bacterial orders identified as indicative of TsD are part of the Actinobacteria, Proteobacteria and Firmicutes. In summary, we found that the observed changes across time are not primarily due to changes associated with body fluid specific bacteria but mostly due to accumulation of bacteria from the environment. Orders of these environmental bacteria could be evaluated for TsD prediction, considering the location and environment of the crime scene. However, further studies are needed to verify these findings, determine the applicability across samples, replicates, donors, and other variables, and also to further assess the effect of different seasons and locations on the samples., Forensic Science International: Genetics, 53
133. METASEED: a novel approach to full-length 16S rRNA gene reconstruction from short read data.
- Author
-
Philip, Melcy, Rudi, Knut, Ormaasen, Ida, Angell, Inga Leena, Pettersen, Ragnhild, Keeley, Nigel B., and Snipen, Lars-Gustav
- Subjects
- *
RIBOSOMAL RNA , *SHOTGUN sequencing , *DATABASES , *GENES , *DATA analysis - Abstract
Background: With the emergence of Oxford Nanopore technology, now the on-site sequencing of 16S rRNA from environments is available. Due to the error level and structure, the analysis of such data demands some database of reference sequences. However, many taxa from complex and diverse environments, have poor representation in publicly available databases. In this paper, we propose the METASEED pipeline for the reconstruction of full-length 16S sequences from such environments, in order to improve the reference for the subsequent use of on-site sequencing. Results: We show that combining high-precision short-read sequencing of both 16S and full metagenome from the same samples allow us to reconstruct high-quality 16S sequences from the more abundant taxa. A significant novelty is the carefully designed collection of metagenome reads that matches the 16S amplicons, based on a combination of uniqueness and abundance. Compared to alternative approaches this produces superior results. Conclusion: Our pipeline will facilitate numerous studies associated with various unknown microorganisms, thus allowing the comprehension of the diverse environments. The pipeline is a potential tool in generating a full length 16S rRNA gene database for any environment. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
134. Gut bacteria at 6 months of age are associated with immune cell status in 1‐year‐old children.
- Author
-
Nilsen, Morten, Nygaard, Unni Cecilie, Brodin, Petter, Carlsen, Karin Cecilie Lødrup, Fredheim, Cecilie, Haugen, Guttorm, Hedlin, Gunilla, Jonassen, Christine Monceyron, Jonsmoen, Unni Lise Albertsdottir, Lakshmikanth, Tadepally, Nordlund, Björn, Olin, Axel, Rehbinder, Eva Maria, Skjerven, Håvard O., Snipen, Lars, Staff, Anne Cathrine, Söderhäll, Cilla, Vettukattil, Riyas, and Rudi, Knut
- Subjects
- *
IMMUNITY , *SHORT-chain fatty acids , *BACTEROIDES fragilis , *GUT microbiome , *BIFIDOBACTERIUM longum , *BACTERIA - Abstract
Age‐related gut bacterial changes during infancy have been widely studied, but it remains still unknown how these changes are associated with immune cell composition. This study's aim was to explore if the temporal development of gut bacteria during infancy prospectively affects immune cell composition. Faecal bacteria and short‐chain fatty acids were analysed from 67 PreventADALL study participants at four timepoints (birth to 12 months) using reduced metagenome sequencing and gas chromatography. Immune cell frequencies were assessed using mass cytometry in whole blood samples at 12 months. The infants clustered into four groups based on immune cell composition: clusters 1 and 2 showed a high relative abundance of naïve cells, cluster 3 exhibited increased abundance of classical‐ and non‐classical monocytes and clusters 3 and 4 had elevated neutrophil levels. At all age groups, we did observe significant associations between the gut microbiota and immune cell clusters; however, these were generally from low abundant species. Only at 6 months of age we observed significant associations between abundant (>8%) species and immune cell clusters. Bifidobacterium adolescentis and Porphyromonadaceae are associated with cluster 1, while Bacteroides fragilis and Bifidobacterium longum are associated with clusters 3 and 4 respectively. These species have been linked to T‐cell polarization and maturation. No significant correlations were found between short‐chain fatty acids and immune cell composition. Our findings suggest that abundant gut bacteria at 6 months may influence immune cell frequencies at 12 months, highlighting the potential role of gut microbiota in shaping later immune cell composition. Our manuscript's main contribution to the field is the novel finding of associations between gut bacteria at different age categories to immune cell composition at 1 year, which supports the notion that gut microbiota composition affects immune cell development during the first year of life. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
135. Metagenome-mining indicates an association between bacteriocin presence and strain diversity in the infant gut.
- Author
-
Ormaasen, Ida, Rudi, Knut, Diep, Dzung B., and Snipen, Lars
- Subjects
- *
INFANTS , *GUT microbiome , *HUMAN microbiota , *ROLE conflict , *ANTIMICROBIAL peptides - Abstract
Background: Our knowledge about the ecological role of bacterial antimicrobial peptides (bacteriocins) in the human gut is limited, particularly in relation to their role in the diversification of the gut microbiota during early life. The aim of this paper was therefore to address associations between bacteriocins and bacterial diversity in the human gut microbiota. To investigate this, we did an extensive screening of 2564 healthy human gut metagenomes for the presence of predicted bacteriocin-encoding genes, comparing bacteriocin gene presence to strain diversity and age. Results: We found that the abundance of bacteriocin genes was significantly higher in infant-like metagenomes (< 2 years) compared to adult-like metagenomes (2–107 years). By comparing infant-like metagenomes with and without a given bacteriocin, we found that bacteriocin presence was associated with increased strain diversities. Conclusions: Our findings indicate that bacteriocins may play a role in the strain diversification during the infant gut microbiota establishment. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
136. Human gut microbiome : GA-map coverage and genomic categories in HumGut
- Author
-
Johansen, Iben Amalie Lund, Snipen, Lars Gustav, Hiseni, Pranvera, Lars Snipen, and Pranvera Hiseni
- Abstract
Microorganisms in the gut are proven to affect health in multiple ways, and they have been studied extensively for the last decade. HumGut is a database containing microorganisms from the healthy human gut. The company Genetic Analysis (GA) has a GA-map dysbiosis test that aims to characterize microorganisms in the human gut to identify dysbiosis patients. This thesis provides an overview of the taxonomic and functional categorization of the human gut, gained by analyzing HumGut, and how well the GA-map spans the healthy human gut by investigating how it overlaps the database. Prodigal, Tax4FUN, and Diamond were used to acquire functional profiles for the HumGut genomes. GA-map identifies microorganisms by matching GA-map probes to the 16S sequences in their genomes. Most genomes in HumGut do not have an identified 16S sequence. Still, the results show that the GA-map spans the majority of higher taxonomic ranks in HumGut. There are both taxonomic and functional parts missed by the map. The functional space missed by the GA-map seems to be relative similar to matched functional regions. The genome categories in the HumGut collection are also evaluated in this thesis. The genomes in HumGut come from two sources: RefSeq and UHGG. Most of the UHGG genomes are Metagenome Assembled Genomes (MAGs). The findings show that MAGs may have lower quality on the 16S sequence than RefSeq-genomes, of which the latter is expected to have higher quality. The results also suggest that UHGG-genomes might have functional differences from other genomes. Det er bevist at mikroorganismer i tarmen påvirker helsen på flere måter, og det har blitt forsket mye på disse organismene det siste tiåret. HumGut er en database over mikroorganismer fra frisk human tarm. Bedriften Genetic Analysis (GA) har en GA-map dysbiose test med mål om å karakterisere mikroorganismer i human tarm for å identifisere dysbiose pasienter. Denne masteroppgaven presenterer en oversikt over human tarm gjennom taksonomisk og funksjonell kategorisering, oppnådd ved å analysere HumGut, og hvor godt GA-map spenner frisk human tarm ved å undersøke hvordan den identifiserer HumGutgenomene. Prodigal, Tax4FUN og Diamond ble brukt for å få funksjonelle profiler. GA-map identifiserer mikroorganismer ved å matche med 16S-sekvensen til genomet. De fleste HumGut-genome har ikke en identifisert 16S-sekvens. Likevel viser resultatene at GAmap dekker over de fleste høyere taksonomiske nivåene i HumGut. Det er både taksonomiske og funksjonelle regioner som ikke dekkes. De funksjonelle regionene som ikke blir dekket av GA-kartet, ser ut til å være hovedsakelig like funksjonelle regioner som blir dekket. En annen del av oppgaven er å evaluere genomkategoriene i HumGut. Genomene i HumGut kommer fra to kilder: RefSeq, som har validerte genomer, og UHGG. De fleste genomene i UHGG er Metagenom sammensatte genomer (MAGs). Funnene kan tyde på at disse genomene har lavere kvalitet på 16S sekvensene enn RefSeq-genomene, som er forventet å ha høyere kvalitet. Resultatene kan også indikere at UHGG-genomer kan ha funksjonelle forskjeller fra andre genomer. M-KB
- Published
- 2023
137. Comparison of reduced metagenome and 16S rRNA gene sequencing for determination of genetic diversity and mother-child overlap of the gut associated microbiota.
- Author
-
Ravi, Anuradha, Avershina, Ekaterina, Angell, Inga Leena, Ludvigsen, Jane, Manohar, Prasanth, Padmanaban, Sumathi, Nachimuthu, Ramesh, Snipen, Lars, and Rudi, Knut
- Subjects
- *
RIBOSOMAL RNA , *METAGENOMICS , *GUT microbiome , *GENOME size , *MOTHER-infant relationship - Abstract
Use of the 16S rRNA gene in microbiota studies is limited by the lack of taxonomic and functional resolution. High resolution analyses are particularly important for understanding transmission and persistence of bacteria. The aim of our work was therefore to compare a novel reduced metagenome sequencing (RMS) approach with 16S rRNA gene sequencing to determine both the metagenome genetic diversity and the mother-to-child sharing of the microbiota in a cohort of 17 mother-child pairs. We found that although both approaches gave comparable results with respect to sample separation and taxonomy, RMS gave higher resolution and the potential for genomic-/functional assignment. Using RMS we estimated that the metagenome size increased from about 60 Mbp for 4-day-old children to about 225 Mbp for mothers. The 4-day-old children shared 7% of the metagenome sequences with the mothers, while the metagenome sequence sharing was >30% among the mothers. We found 15 genomes shared across >50% of the mothers, of which 10 belonged to Clostridia . Only Bacteroides showed a direct mother-child association, with B. vulgatus being abundant in both 4-day-old children and mothers. For the functional assignments, we identified a significant association between antibiotic usage during labor, and quantity of Fosfomycin resistance genes. In conclusion, our results show a higher functional and taxonomic resolution for RMS compared to 16S rRNA gene sequencing, where RMS enabled a detailed description of mother to child gut microbiota transmission – supporting a late recruitment of most gut bacteria and an effect of antibiotic treatment during labor on infant antibiotic resistance gene patterns. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
138. Classification and quantification of the contents of fishmeal using hash-based classification algorithms on metagenomic sequence data
- Author
-
Ekeland, Halvor, Vinje, Hilde, Snipen, Lars, and Ortiz, Jamie
- Abstract
With the abundance of sequence data and the development of effective computer algorithms, new practical applications using sequence data are possible, among the authentication of the contents of food and feed. This study aims to explore if metagenomic sequence data can be used to classify and quantify the contents of fishmeal samples. In this study the hash-based classification software Kraken2+Bracken was used to classify and quantify the contents of fishmeal samples. The study utilized simulated sequences as a best-case scenario and real DNA sequence data and assess the ability of Kraken2+Bracken to classify and quantify the contents with the measurements purity, completeness, Bray-Curtis dissimilarity and Principal component analysis. The results indicate that Kraken2+Bracken is able to classify the DNA sequence in the samples, but that the accuracy of the classifications and quantifications are dependent on the threshold and settings used, and for the samples based on real meatgenomic data, sample preparation likely affect the accuracy. Med den store mengden sekvensdata og utviklingen av effektive data algoritmer, nye praktiske anvendelser av sekvensdata er mulige, blant dem autentisering av innholdet i mat og fòr. Denne studien har som mål å utforske om metagenomisk sekvensdata kan brukes til å klassifisere og kvantifisere innholdet i fiskemel blandinger. I denne studien ble den hash-baserte klassifiserings programvaren Kraken2+Bracken som ble brukt til å klassifisere og kvantifisere innholdet i fiskemel blandinger. Studien benyttet simulert data som et ideal-tilfelle og ekte DNA sekvensdata og vurderte Kraken3+Bracken sin kvantifiserings og kvantifiseringsevne ved å bruke målene purity, completeness, Bray-Curtis ulikhet og principal component analysis. Resultatene antyder at Kraken2+Bracken er i stand til å klassifisere DNA sekvensene i blandingene, men at nøyaktigheten til klassifiseringen og kvantifiseringen er avhengig av terskelverdier og innstilingene brukt, og for blandingene basert på ekte metagenomisk data er det sannsynlig at hvordan prøvene tilberedes påvirker nøyaktigheten. M-KB
- Published
- 2022
139. Genomanalyse av Salmonella enterica subsp. enterica serovar Typhimurium i Norge
- Author
-
Obstfelder, Andrea, Snipen, Lars Gustav, Lagesen, Karin, and Sekse, Camilla
- Abstract
Salmonella enterica subspecies enterica serovar Typhimurium, referred to as S. Typhimurium in this master`s thesis, is one of the most common pathogenic serovariants of Salmonella spp. S. Typhimurium has been isolated in a wide range of hosts from the wild fauna. These hosts are believed to be responsible for several of the human cases of Salmonella infections in Norway. The main goal of this thesis is to test the hypothesis that wild animals are a potential source of infection from S. Typhimurium among production animals in Norway. In this project a totalt of 127 S. Typhimurium isolates taken between 2001 and 2021 from wild animals, production animals and sport- and family animals were selected from the laboratory inventory at the Norwegian Veterinary Institute. The information present in the genome sequences of the S. Typhimurium isolates were examined using whole genome sequencing (WGS). WGS of pathogens is an important tool to improve the understanding of how contagious diseases spread between hosts. Due to limitations in the sequencing technology, the DNA are fragmented and only up to 300 bp are sequenced, this is then called reads. Bioinformatic tools were used for quality check a large number of the raw data from the sequencing before the original genom was attempted reconstructed using de novo SPAdes assembly. ” Multilocus Sequence Typing” (MLST) were used in further analysis to determine allele- variations based on 7 housekeeping genes, and each bacterial isolate were assigned one sequence type where ST19 and ST34 were the most common. Then the evolutionary similarity between isolates within the same ST was assessed by SNP analysis. Using phylogenetic analysis we were able to observe clusters with evolutionary similarity between wild animals and production animals, which indicates that transmission of S. Typhimurium had taken place. Based on the results from the applied dataset we found several of these cases, which gives a strong indication that wild animals can be the source of infection for S. Typhimurium among production animals in Norway. Salmonella enterica subspecies enterica serovar Typhimurium, omtalt som S. Typhimurium i denne masteroppgaven, er en av de vanligste patogene serovariantene av Salmonella spp. S. Typhimurium har blitt isolert hos et bredt spekter av verter fra villfaunaen som er antatt ansvarlig for flere av de humane smittetilfellene av Salmonella i Norge. Hovedmålet med oppgaven var å teste hypotesen om at ville dyr er en potensiell smittekilde for S. Typhimurium blant produksjonsdyr i Norge. Det ble i dette prosjektet plukket ut totalt 127 S. Typhimurium isolater tatt mellom 2001 og 2021 fra ville dyr, produksjonsdyr og sport- og familiedyr fra laboratoriebeholdningen hos Veterinærinstituttet. Ved hjelp av helgenomsekvensering (WGS) ble informasjon som fantes i genomsekvensen til S. Typhimurium isolatene undersøkt. WGS av patogener er et viktig verktøy for å forbedre forståelsen av smittsomme sykdommer som sprer seg mellom verter. Begrensninger i sekvenseringsteknologien gjør at DNAet kuttes opp i kortere fragmenter, og kun korte fragmenter opp til 300 bp sekvenseres, også kalt reads. Ved hjelp av bioinformatiske verktøy ble store mengder rådata fra sekvenseringen kvalitetsjekket før det opprinnelige genomet ble forsøkt rekonstruert ved hjelp av de novo SPAdes assembly. For videre analyser ble ” Multilocus Sequence Typing” (MLST) brukt for bestemmelse av allele-variasjoner basert på 7 husholdningsgener, og hvert bakterieisolat fikk tildelt en sekvenstype der ST19 og ST34 var mest vanlige. Deretter ble evolusjonær likhet mellom isolatene innenfor samme ST vurdert ved SNP analyse. For fylogenetiske analyser ble det observert klustre med en evolusjonær likhet mellom ville dyr og produksjonsdyr som indikerer smitteoverføring. Basert på resultatene fra det brukte datasettet var det i flere tilfeller sterke indikasjoner på at ville dyr kan være smittekilden for S. Typhimurium blant produksjonsdyr i Norge. M-KB
- Published
- 2022
140. Bruk av Liquid Array Diagnostics (LAD) som verktøy for analyse av sammensetning og funksjon av tarmens mikrobiota
- Author
-
Hiseni, Pranvera, Rudi, Knut, Wilson, Robert C., Snipen, Lars, Hegge, Finn Terje, and Furu, Kari
- Subjects
LAD ,HumGut ,Gut microbiota - Abstract
The microbial species residing in the human gut exercise vital functions for the host. They produce different metabolites that are crucial for human wellbeing. A variety of such molecules mediate signalling along the gut-brain axis, regulate host gene expression, develop and maintain intestinal and blood-brain barriers, are involved in lipogenesis and gluconeogenesis, in addition to taking part in a wide range of other functions. A deviation in the intestinal flora composition is mechanistically linked to various health disorders, including inflammatory bowel disease (IBD), irritable bowel syndrome (IBS), type 2 diabetes, Parkinson’s and Alzheimer’s disease. Such a deviation, known as dysbiosis, represents an unbalanced composition where certain microbial groups are promoted in the expense of others. These species are considered as promising biomarkers, valuable for disease diagnosis, monitoring and treatment. Of particular interest are those markers that can additionally unveil phenotypical characteristics, such as the overall level of short-chain fatty acids (SCFA) in human gut samples. The prospect of discovering additional markers is high, considering that the content of healthy human guts worldwide is not fully characterized. The field of gut microbiota is at a stage of switching focus to clinically relevant species, particularly to their rapid detection, as a means of offering simple diagnostic solutions with increased availability and accessibility. This affords putting biological findings to practical clinical use, which is often not feasible with current species identification platforms. With the intention of filling this need, the main aim of this thesis was to develop a targeted approach for rapid gut microbiota testing based on the novel Liquid Array Diagnostics (LAD) technology. LAD is adopted to target 16S rRNA gene sites unique for specific microbial groups. Requiring only commonplace qPCR instrumentation, it can detect up to 30 distinct microbial markers in a single-tube multiplex reaction within a working day. LAD’s utility in microbiome studies was validated by testing the prevalence and abundance of 15 microbial markers in 541 samples collected from mothers and their children, as reported in Paper I. Paper II, on the other hand, describes a comprehensive human gut prokaryotic genome collection, HumGut. It was built after screening thousands of human gut metagenome samples, collected from healthy people worldwide, for the presence of any high quality publicly available prokaryote genome. The main rationale for creating it was to enable functional studies through LAD-based 16S targeting. It was demonstrated that HumGut, as a reference database, aids whole genome sequencing studies by significantly increasing the number of mapped sequencing reads, thus elevating the potential for an improved taxonomic classification. However, as it is, HumGut exhibits limited practical use for 16S rRNA gene targeted approaches like LAD. This because most of the representative genomes either lack this gene, or the quality of 16S sequences is compromised (addressed in Paper III). Nonetheless, LAD was exploited to infer a segment of human gut microbiota functionality by targeting the 16S rRNA gene. This was performed based on data retrieved from 16S rDNA sequencing and short-chain fatty acid (SCFA) measurements. LAD’s value in classifying samples with disturbed SCFA ratios (namely high propionate-to-butyrate ratio) - an indication of functional dysbiosis - is presented in Paper IV. Taken together, this thesis introduces two tools, LAD and HumGut, both pointing at the direction of simplified human gut functional analysis via gut microbial composition detection. De mikrobielle artene som bor i menneskets tarm utøver vitale funksjoner for verten. De produserer forskjellige metabolitter avgjørende for menneskers helse. En rekke av disse molekylene deltar i prosesser som signaltransduksjon langs tarm-hjerne-aksen, regulering av genekspresjon, utvikling og vedlikehold av tarm- og blod-hjerne-barrieren, lipogenese og glukoneogenese, samt en rekke andre funksjoner. Avvik i tarmflorasammensetningen kan knyttes til mange ulike sykdommer og lidelser, inkludert irritabel tarm (IBS), innflammatorisk tarmsykdom (IBD), type -2 diabetes, Parkinsons og Alzheimers sykdom. Slike avvik, kjent som dysbiose, kjennetegnes av at visse mikrobielle grupper fremmes på bekostning av andre. Disse artene har potensiale som biomarkører, og kan slik være verdifulle for sykdomsdiagnose og behandling. Spesielt lovende er biomarkører i tarm som kan knyttes opp mot phenotypiske trekk, slik som kortkjedede fettsyrer (SCFA). Det antas at enda flere slike arter vil identifiseres i fremtiden, da mikrobiota-komposisjonen i sunne tarmer ikke er fullt karakterisert globalt. Mikrobiota-feltet er nå på et stadium hvor fokuset endres fra eksplorative studier til identifisering av klinisk relevante arter. Det vil da bli spesielt viktig med metoder som muliggjør rask deteksjon, da dette vil innebære enkle diagnostiske løsninger tilgjengelig for praktisk klinisk bruk, noe som ofte ikke er gjennomførbart med dagens artsidentifikasjonsplattformer. Hovedmålet med denne oppgaven var å utvikle en målrettet tilnærming for rask tarmmikrobiotatesting basert på det nye Liquid Array Diagnostics (LAD)-prinsippet. LAD er utviklet for å identifisere sekvenser i 16S rRNA-genet som er unike for spesifikke mikrobielle markører. Metoden krever kun et vanlig qPCR-instrument og kan oppdage inntil 30 forskjellige mikrobielle markører i étt enkelt test-rør i løpet av en arbeidsdag. LADs nytteverdi i mikrobiomstudier ble validert ved å teste forekomsten av 15 mikrobielle markører i 541 prøver samlet fra mødre og deres barn, som rapportert i Artikel I. Artikel II beskriver genereringen av en omfattende prokaryot genomsamling av menneskets tarm. Den ble bygget ved å screene tusenvis av metagenom fra tarmprøver samlet inn fra friske mennesker over hele verden. Metagenomene ble screenet for tilstedeværelse av alle offentlig tilgjengelige prokaryote genom. Sekvenser av dårlig kvalitet ble fjernet mens alle andre sekvenser ble samlet i én stor referansedatabase, HumGut. Hovedmålet med å lage denne referansedatabasen var å muliggjøre LAD-baserte funksjonelle studier. Det ble vist at HumGut fungerer som et nyttig verktøy for full-genoms sekvenseringsstudier ved å øke antallet artlagte sekvenseringsavlesninger betydelig, da dette gir forbedret taksonomisk klassifisering. HumGut har imidlertid begrenset nytteverdi for 16S rRNA-baserte metoder som LAD. Dette fordi de fleste genom i samlingen enten mangler dette genet fullstendig, eller har for dårlig kvalitet på 16S-sekvensene (behandlet i Artikel III). Til tross for begrensningene knyttet til 16S rRNA-genet i HumGut, ble LAD benyttet til å utvikle en 16S rDNA-basert test for måling av menneskelig tarmmikrobiotafunksjonalitet. Dette ble utført basert på data hentet fra 16S-sekvensering og målinger av kortkjedede fettsyrer (SCFA). LADs evne til å klassifisere prøver med forstyrret SCFA-forhold (nemlig høyt propionat-tilbutyrat-forhold) - en indikasjon på funksjonell dysbiose - er presentert i Artikel IV. Til sammen presenterer denne oppgaven to verktøy, LAD og HumGut, som begge peker i retning av forenklet funksjonell analyse av menneskelig tarm via deteksjon av mikrobiell sammensetning i tarmen.
- Published
- 2022
141. DeepGene : gene finding based on upstream sequence data
- Author
-
Almestrand, Trude Haug, Liland, Kristian, and Snipen, Lars Gustav
- Abstract
Genome annotation is a process of identifying functional elements along a genome. By correctly locating and finding the information stored within a sequence, knowledge about structural features and functional roles can be revealed. With the number of sequences doubling approximately every 18 months, there is a severe need for automatic annotation of genomes. Today there are many different annotation software tools available, however they produce far from perfect results. Here a new project, DeepGene, is presented. Using data from the RefSeq prokaryotic database we have started an effort to improve on the prokaryotic genome annotation process. This thesis presents the initial efforts of said improvement with a focus on discerning between coding and non-coding sequences using upstream sequence data from open reading frames. Using the 15 prokaryotic genomes available in the RefSeq database, upstream data was retrieved and processed into two datasets, and were then trained using several popular classification models. The performance of the models was compared with a standard annotation tool to create a general baseline for our model. The models created from the datasets show many similarities in terms of metrics. With the K-mer data having a mean precision at 0.22 and mean recall of 0.74, and the sequential data having a mean precision at 0.30 and mean recall at 0.77. Both the datasets performed worse than our standard annotation software with a mean recall and precision of, respectively, 0.83 and 0.82. As far as upstream sequences are concerned, the models managed to pull all the information available from both datasets. The initial results gave limited information in terms of classification and motif presence indicating that other attributes surrounding the genome should be looked at for a possible improvement on the annotation problem. An ideal step forward is to expand into a pipeline so that the complex false negative classifications may be explained. Genomannotering er en prosess som skal identifisere funksjonelle elementer langs et genom. Ved å finne informasjonen lagret i en sekvens kan man avsløre kunnskap rundt strukturelle og funksjonelle roller. Ettersom antall sekvenser dobler rundt hver 18. måned er det et sterkt behov for automatisk gjenkjenning av genomer. I dag er det mange tilgjengelige annoteringsverktøy, men de produserer langt fra perfekte resultater. Et nytt prosjekt ved navn DeepGene er her presentert. Ved hjelp av data fra RefSeq prokaryotiske database har vi startet et forsøk på å forbedre den prokaryotiske annoteringsprosessen. I denne oppgaven presenteres begynnelsen på forbedringen. Hovedfokuset var å skille mellom kodende og ikke-kodende sekvenser ved hjelp av sekvensdata oppstrøms for åpne leserammer. Ved å benytte seg av de 15 prokaryotiske genomene tilgjengelig i RefSeq databasen, ble oppstrømsdata hentet og prosessert til to datasett. Disse datasettene ble videre trent ved hjelp av populære klassifiseringsmodeller. Ytelsen til disse modellene ble sammenlignet med et standard annoteringsverktøy for å lage et generelt utgangspunkt til vår modell. Modellene trent av datasettet viser mange likheter når det kommer til ytelse. K-mer datasettet hadde en gjennomsnittlig presisjon på 0.22 og nøyaktighet på 0.74. Videre hadde det sekvensielle datasettet en gjennomsnittlig presisjon på 0.30 og en nøyaktighet på 0.77. Begge datasettene hadde dårligere resultater enn vårt standard annoteringsverktøy som hadde en gjennomsnittlig nøyaktighet og presisjon på henholdsvis 0.83 og 0.82. Når det kommer til oppstrømssekvenser klarer modellene å hente ut all informasjon tilgjengelig fra datasettene. Resultatene ga begrenset med informasjon når det kommer til klassifisering og motif-tilstedeværelse. Denne begrensningen indikerer at andre attributter rundt genomet bør undersøkes for en mulig forbedring rundt annoteringsproblemet. Et ideelt steg videre er å utvide modellene til en «pipeline» slik at komplekse falske negative klassifiseringer kan bli forklart. M-KB
- Published
- 2022
142. Benchmarking of sequence-based machine learning methods for prediction of antibody polyreactivity
- Author
-
Mathisen, Ingvild Frøberg, Robert, Philippe, Greiff, Victor, Snipen, Lars Gustav, and Akbar, Rahmad
- Abstract
Antibodies are of great importance as therapeutics and a prerequisite for their success is high specificity to a desired target. Some antibodies however, termed polyreactive, are able to recognize and react to a diverse range of antigens. Therapeutic antibodies have been successful in the treatment of cancers, autoimmune conditions as well as infectious diseases. The process of developing a new treatment requires a lot of resources and time. Many candidates are not approved following clinical trials, one of the negative indicators of success is polyreactivity. In silico methods that predict polyreactivity can aid in prioritizing good candidates for further development. In this work we benchmarked machine learning approaches for predicting polyreactivity, using data from the simulation framework Absolut!, with various combinations of sequence encodings and machine learning architectures. We found that polyreactivity could be predicted with close to 90% macro f1 using logistic-regression and amino acid composition. Marginal but significant improvements in macro f1 score were obtained by inclusion of positional information using logistic regression and feed-forward neural network with one hidden layer was able to achieve macro f1 of ~93%. The best logistic regression model and neural network were able to generalize to sequences that were more than 50% different to the sequences used for training the model. Further we looked into interpretability of the models and size of possible sequence motifs, suggesting that short motifs can be predictive of polyreactivity. Our results demonstrate how different approaches compare in predicting polyreactivity given large amounts of data and information on the CDRH3 sequence. As of now, experimental (in vitro) datasets containing information on polyreactivity have been limited in size. Larger datasets are being produced and in the future, we anticipate that our findings will be relevant for guiding the development of machine learning models for predicting polyreactivity. Antistoffer er av stor betydning som legemidler og en forutsetning for at de skal lykkes er høy spesifisitet til et ønsket mål. Noen antistoffer, kalt polyreaktive antistoffer, er imidlertid i stand til å gjenkjenne og reagere på flere ulike antigener. Terapeutiske antistoffer har hatt stor suksess i behandling av kreft, autoimmune sykdommer, så vel som infeksjonssykdommer. Prosessen med å utvikle en ny antistoffbehandling krever imidlertid mye ressurser og tid. Mange kandidater blir ikke godkjent etter kliniske studier og polyreaktivitet kan være en av årsakene til dette. In silico metoder som predikerer polyreaktivitet kan hjelpe til med å prioritere kandidater med større sannsynlighet for å lykkes. I dette arbeidet har vi sammenlignet maskinlærings metoder med varierende måter å fremstille sekvensdata og arkitektur for å forutsi polyreaktivitet ved hjelp av data fra en antistoff-antigen bindings simulator kalt Absolut!. Vi fant at polyreaktivitet kunne forutsies med nær 90% makro f1 score ved bruk av logistisk regresjon og aminosyresammensetning alene. Marginal men signifikant økning i f1 score ble registret når informasjon om aminosyrenes posisjon var inkludert ved bruk av logistisk regresjon. Nevrale nettverk var i stand til å oppnå makro f1 på ~93%. Den logistiske regresjonsmodellen og det beste nevrale nettverket var i stand til å generalisere til sekvenser som var mer enn 50 % forskjellige fra sekvensene som ble brukt for trening av modellen. Videre så vi på tolkbarhet av modellene og størrelsen på eventuelle sekvens motiver, og fant at korte motiver kan være prediktive for polyreaktivitet. Resultatene våre viser hvordan forskjellige tilnærminger sammenlignes for å forutsi polyreaktivitet gitt store mengder data og informasjon om CDRH3-sekvensen. Tidligere har de fleste eksperimentelle (in vitro) datasett brukt til å etterforske polyreaktivitet vært av begrenset størrelse. Det produseres nå større datasett og i fremtiden regner vi med at funnene våre vil være relevante for styre utvikling av maskinlæringsmodeller for å forutsi polyreaktivitet. M-BIAS
- Published
- 2022
143. Reduced Metagenomic Sequencing Resolution of the Human Gut Using HumGut as a Reference Database
- Author
-
Olsen, Helene Drennan and Snipen, Lars
- Abstract
The gut microbiome is a source of great genetic diversity in humans and is prevalent in topics concerning human health and diseases. Today, most compositions of human gut microbiomes are estimated on 16S amplicon sequencing and shotgun sequencing data. Although 16S amplicon sequencing is cheap and fast, it does not gain a taxonomic resolution down to species level like the more costly and computationally expensive shotgun sequencing. Reduced Metagenomic Sequencing (RMS) has been suggested as an alternative method, as it has been shown to gain a taxonomic resolution down to species level and is more cost-effective per sample than shotgun sequencing. However, it requires a good reference database. The HumGut genome collection was used as a reference database in this thesis, and it contains >30 000 genomes prevalent in a healthy human gut. This thesis investigated the taxonomic resolution RMS can achieve using HumGut as a reference database. Further, a dataset from the PreventADALL-study was provided, containing human gut samples from mother-child pairs. The data was used to investigate whether a vertical transmission of Bifidobacterium could be detected in the samples. The HumGut dataset was divided into subgroups based on the genomes’ genus, as genome clustering of all genomes prove to be too computationally demanding. The genomes within most genera were able to cluster to a condition value below 10, meaning the lowest possible taxonomic resolution was obtained but with the advantage of more stable abundancy estimates which is important when investigating vertical transmission. Results indicated that the genera containing more RMS fragments per genome returned a higher taxonomic resolution, some even down to strain level, while genera with fewer RMS fragments per genome returned a lower resolution, some displaying a resolution not much lower than genus level. Bifidobacterium were of the genera that obtained a lower taxonomic resolution, resulting in a >98% reduction of genomes after genome clustering. As a result, no correlation of Bifidobacterium distribution was found between mother and child.
- Published
- 2022
144. Cost of resistance : stress tolerance and gene expression in wild type and mutant strains of Staphylococcus haemolyticus LMGT4071
- Author
-
Solberg, Kristin Kleivan, Snipen, Lars-Gustav, and Diep, Dzung Bao
- Abstract
Several studies have shown that a bacteria’s development of resistance to an antimicrobial substance often comes with a cost related to their general fitness and adaptation to environmental changes. Further research is needed to increase the understanding on how bacteria with mutations in specific genes associated with stress tolerance, regulate their gene expression in response to stress. The excessive use of antibiotics for treatment of pathogenic infections in patients have led to an increasing number of bacteria resistant to antibiotics. As it is crucial that we find alternative treatment methods, researchers are looking into the possibility of utilizing bacterial produced bacteriocins in combination with other antimicrobials. In the search for appropriate medicine, we need a broader knowledge of the bacteria’s response mechanisms related to the treatments. Through this master’s thesis a mutant’s loss of fitness due to a resistance mutation will be examined. The study will attempt to give new inside on how a mutant of a bacterium regulates its gene expression when exposed to stress compared to the wild type within the same strain. Specific for this study, mutants within five bacterial strains with mutations connected to the function of the stress-response related gene rseP, are inspected in the laboratory. The bacteria were exposed to the bacteriocin Garvicin KS which do not use RseP as a receptor. RNA from samples of S. haemolyticus LMGT4071 will be isolated and sequenced to analyse the gene expression in wild type and mutant, with and without exposure to GarKS. The experiments with S. haemolyticus in the laboratory show that the mutant tolerates the exposure to GarKS poorer than the WT, illustrating that the development of resistance comes with a fitness cost. The transcriptome analysis show that the mutant responds to the stress by altering its gene expression to a large extent, in contrast to the WT which turns on only a few genes. Flere studier har vist at en bakteries utvikling av resistens mot et antimikrobielt stoff ofte kommer med en kostnad knyttet til deres generelle fitness og tilpasning til miljøendringer. Det er behov for videre forskning for å øke forståelsen av hvordan bakterier med mutasjoner i spesifikke gener knyttet til stresstoleranse regulerer genuttrykket sitt i respons på stress. Overdreven bruk av antibiotika for behandling av patogene infeksjoner hos pasienter har ført til at et økende antall bakterier er resistente mot antibiotika. Siden det er avgjørende at vi finner alternative behandlingsmetoder, ser forskerne på muligheten for å utnytte bakteriell produserte bacteriociner i kombinasjon med andre antimikrobielle midler. I jakten på passende medisin trenger vi bredere kunnskap om bakterienes responsmekanismer knyttet til behandlingene. Gjennom denne masteroppgaven vil en mutants tap av fitness grunnet en resistensmutasjon undersøkes. Studien vil prøve å belyse hvordan en mutant av en bakterie regulerer genuttrykket sitt når det utsettes for stress sammenlignet med villtypen innenfor samme stamme. Spesifikt for denne studien inspiseres mutanter innenfor fem bakteriestammer med mutasjoner knyttet til funksjonen til det stress-responsrelaterte genet rseP i laboratoriet. Bakteriene ble eksponert for bacteriocinet Garvicin KS som ikke bruker RseP som reseptor. RNA fra prøver av S. haemolyticus LMGT4071 vil bli isolert og sekvensert for å analysere genuttrykket i villtype og mutant, med og uten eksponering av GarKS. Forsøkene med S. haemolyticus i laboratoriet viser at mutanten tåler eksponeringen av GarKS dårligere enn villtypen, noe som illustrerer at utvikling av resistens kommer med en fitnesskostnad. Transkriptomanalysen viser at mutanten reagerer på stresset ved å endre genuttrykket sitt i stor grad, i motsetning til villtypen som kun slår på noen få gener. M-BIAS
- Published
- 2022
145. Evaluation of machine learning approaches for prediction of protein coding genes in prokaryotic DNA sequences
- Author
-
Sandvik, Yva Jacob, Liland, Kristian Hovde, and Snipen, Lars
- Abstract
According to the National Human Genome Research Institute the amount of genomic data generated on a yearly basis is constantly increasing. This rapid growth in genomic data has led to a subsequent surge in the demand for efficient analysis and handling of said data. Gene prediction involves identifying the areas of a DNA sequence that code for proteins, also called protein coding genes. This task falls within the scope of bioinformatics, and there has been surprisingly little development in this field of study, over the past years. Despite there being sufficient state-of-the-art gene prediction tools, there is still room for improvement in terms of efficiency and accuracy. Advances made within the field of gene prediction can, among other things, aid the medical and pharmaceutical industry, as well as impact environmental and anthropological research. Machine learning techniques such as the Random Forest classifiers and Artificial Neural Networks (ANN) have proved successful at the task of gene prediction. In this thesis one deep learning model and two other machine learning models were tested. The first model implemented was the established Random Forest classifier. When it comes to the use of ensemble methods, such as the Random Forest classifier, feature engineering is critical for the success of such models. The exploration of different feature selection and extraction techniques underpinned its relevance. It also showed that feature importance varies greatly among genomes, and revealed possibilities that can be further explored in future work. The second model tested was the ensemble method Extreme Gradient Boosting (XGBoost), which served as a good competitor to the Random Forest classifier. Finally, a Recurrent Neural Network (RNN) was implemented. RNNs are known to be good with handling sequential data, therefore it seemed like a good candidate for gene prediction. The 15 prokaryotic genomes used to train the models were extracted from the NCBI genome database. Each model was tasked with classifying sub-sequences of the genomes, called open reading frames (ORFs), as either protein coding ORFs, or non-coding ORFs. One challenge when preparing these datasets was that the number of protein coding ORFs was very small compared to the number of non-coding ORFs. Another problem encountered in the dataset was that protein coding ORFs in general are longer than non-coding ORFs, which can bias the models to simply classify long ORFs as protein coding, and short ORFs as non-coding. For these reasons, two datasets for each genome were created, taking each imbalance into account. The models were trained, tuned and tested on both datasets for all genomes, and a combination of genomes. The models were evaluated with regard to accuracy, precision and recall. The results show that all three methods have potential and attained somewhat similar performance scores. Despite the fact that both time and data were limited during model development, they still yielded promising results. Considering there are several parameters that have not yet been tuned in all models, many possibilities for further research remain. The fact that a relatively simple RNN architecture performed so well, and has no requirement for feature engineering, shows great promise for further applications in gene prediction, and possibly other fields in bioinformatics. M-DV
- Published
- 2022
146. The GDR : a novel approach to detect large-scale genomic sequence patterns
- Author
-
Bull, Nora Borge, Snipen, Lars Gustav, and Rayner, Simon
- Abstract
Utvikling av ny sekvenseringsteknologi de to siste tiårene har tillatt dypere dykk ned i de biomolekylære aspektene ved menneskets oppskrift. Hel-genom data fra flere hundre tusen mennesker er allerede tilgjengelig, men hvordan den økende mengden informasjon kan settes sammen til meningsfull funksjonell tolkning er komplisert og krever nye metoder. MikroRNA - mRNA interaksjoner utgjør et enormt genreguleringsnettverk som er vanskelig å predikere, selv for dagens beste maskinlæringsalgoritmer(1). Disse ikke-kodende elementene er involvert i omtrent alle cellulære prosesser i mennesket, primært via delvis komplementær baseparing mellom mikroRNA og mRNA, men det er mye vi ikke forstår av dette nettverkets betydning i vår biologi (2-4). Nye metoder er nødvendige for å kunne utforske genetisk variasjon i dette nettverket, som kan gi nye innblikk i hvordan genene våre reguleres. Her presenteres «The Group Diversity Ratio» (GDR) som en ny målenhet til å møte denne utfordringen. GDR kan kvantifisere evolusjonær struktur av variasjon i store mengder genomisk sekvensdata, med et resultat som kan statistisk valideres. Metoden baserer seg på å måle gruppe-struktur i et distanse-basert fylogenetisk tre av sekvensdata, for forhåndsdefinerte grupper av «blader» i treet. Gruppene representerer en egenskap som kan relateres til sekvensdataen, og det undersøkes til hvilken grad det finnes en sammenheng mellom de to. Metoden kan primært brukes til å raskt skaffe overblikk over store mengder genomisk sekvensdata, som kan gi verdifulle innblikk til videre etterforskning. For å teste metoden ble GDR brukt til å identifisere variasjon assosiert med etniske populasjoner i 3’UTR data fra «The 1000 Genomes Project» (1KGP). 1KGP var det første store prosjektet som adresserte den etniske skjevheten som nå finnes i genom-databaser, og som utgjør en god grunn til å utforske etnisk genetisk variasjon (5). I tillegg til identifikasjon av mer enn 1000 3’UTR sekvenser som inneholder signifikant etnisitet-spesifikk variasjon, viser dette studiet GDR-metodens høye potensial til å undersøke genetisk variasjon i stor skala. The emergence of new sequencing technologies over the past two decades has enabled us to dive deeper into the biomolecular aspect of the human recipe. Entire genomes from several hundred thousand people are already accessible, but how to interpretate the connections between the blueprints and the phenotypes are complicated, even for the best developed machine learning algorithms. Prediction of the microRNA-mRNA targeting network is a classic example, which is involved with gene regulation of all living cell processes. These non-coding features make up complex networks of interactions, where microRNAs primarily target 3’UTRs through partial complementary base-pairing. Thus, the challenge to investigate patterns in such large-scaled genomic sequence data requires new approaches. The Group Diversity Ratio (GDR) metric is presented here as a novel approach to aid in this challenge. The GDR quantifies genome-wide structure in large-scale sequence data with a statistically testable result. Patterns are measured for a group feature that may be related to variation in sequence samples, based on phylogenetic distance estimations. It opens opportunities to quickly gain insights into genomic regions of interests and used to guide further research. To demonstrate the use of the GDR metric, ethnicity-associated variation patterns in more than 1000 human 3’UTRs was identified with the GDR. The study set was from 1000 Genomes project, which was the first major effort to address the problem of ethnic bias in genetic studies and contained more than 2500 whole-genome sequences from 26 ethnic lineages. In addition to detecting significantly distinct 3’UTR elements for ethnic populations, the key finding of this study was the high potentials of the GDR to facilitate more high-throughput characterization of genomic sequence data. M-BIAS
- Published
- 2021
147. Transcriptomic and Functional Analysis of NaCl-Induced Stress in Enterococcus faecalis.
- Author
-
Solheim, Margrete, La Rosa, Sabina Leanti, Mathisen, Thomas, Snipen, Lars G., Nes, Ingolf F., and Brede, Dag Anders
- Subjects
- *
GENETIC transcription , *ENTEROCOCCUS faecalis , *POLYSACCHARIDES , *ANTIGEN analysis , *ENTEROCOCCAL infections , *CELL envelope (Biology) - Abstract
The robust physiology of Enterococcus faecalis facilitates tolerance to various stresses. We here report the transcriptional response of E. faecalis V583 to growth in the presence of 6.5% NaCl. Among the early responses observed was an immediate down-regulation of mscL, accompanied by an up-regulation of genes predicted to be involved in uptake of extracellular potassium and glycine betaine. The high NaCl concentration also induced expression of chaperons and cell envelope related traits, such as the enterococcal polysaccharide antigen (epa) locus. Functional genetic analysis revealed reduced salt stress resistance in both epaB and epaE mutants. The reduced salt resistance phenotype associated with the epaB mutant was restored by complementation, hence demonstrating a role of Epa in the physiological robustness of E. faecalis. Furthermore, we demonstrate that Epa confers increased resistance towards multiple cell envelope stress-inducing factors. Accordingly, these findings delineate a potential link between the robust nature of E. faecalis and its ability to perform as a human pathogen, and provide a new perspective on the mechanisms by which Epa contributes to virulence. Notably, the high NaCl concentration also resulted in strict repression of the gelE-sprE operon and impaired gelatinase activity. We demonstrate that NaCl antagonize the GBAP-pheromone dependent induction in a concentration dependent manner. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
148. Exploration of multivariate analysis in microbial coding sequence modeling.
- Author
-
Mehmood, Tahir, Bohlin, Jon, Bråthen Kristoffersen, Anja, S�b�, Solve, Warringer, Jonas, and Snipen, Lars
- Subjects
- *
GENES , *MICROBIAL genomes , *MULTIVARIATE analysis , *NUCLEOTIDE sequence , *BIOINFORMATICS , *COMPUTER software - Abstract
Background: Gene finding is a complicated procedure that encapsulates algorithms for coding sequence modeling,identification of promoter regions, issues concerning overlapping genes and more. In the present study we focus on coding sequence modeling algorithms; that is, algorithms for identification and prediction of the actual coding sequences from genomic DNA. In this respect, we promote a novel multivariate method known as Canonical Powered Partial Least Squares (CPPLS) as an alternative to the commonly used Interpolated Markov model (IMM). Comparisons between the methods were performed on DNA, codon and protein sequences with highly conserved genes taken from several species with different genomic properties. Results: The multivariate CPPLS approach classified coding sequence substantially better than the commonly used IMM on the same set of sequences. We also found that the use of CPPLS with codon representation gave significantly better classification results than both IMM with protein (p < 0.001) and with DNA (p < 0.001). Further, although the mean performance was similar, the variation of CPPLS performance on codon representation was significantly smaller than for IMM (p < 0.001). Conclusions: The performance of coding sequence modeling can be substantially improved by using an algorithm based on the multivariate CPPLS method applied to codon or DNA frequencies. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
149. Growth Rate-Dependent Control in Enterococcus faecalis: Effects on the Transcriptome and Proteome, and Strong Regulation of Lactate Dehydrogenase.
- Author
-
Mehmeti, Ibrahim, Faergestad, Ellen M., Bekker, Martijn, Snipen, Lars, Nes, Ingolf F., and Holo, Helge
- Subjects
- *
ENTEROCOCCUS faecalis , *PROTEOMICS , *LACTATE dehydrogenase , *POLYMERASE chain reaction , *MOLECULAR biology - Abstract
Enterococcus faecalis V583 was grown in a glucose-limited chemostat at three different growth rates (0.05, 0.15, and 0.4 h-1). The fermentation pattern changed with growth rate, from a mostly homolactic profile at a high growth rate to a fermentation dominated by formate, acetate, and ethanol production at a low growth rate. A number of amino acids were consumed at the lower growth rates but not by fast-growing cells. The change in metabolic profile was caused mainly by decreased flux through lactate dehydrogenase. The transcription of ldh-1, encoding the principal lactate dehydrogenase, showed very strong growth rate dependence and differed by three orders of magnitude between the highest and the lowest growth rates. Despite the increase in ldh-1 transcript, the content of the Ldh-1 protein was the same under all conditions. Using microarrays and quantitative PCR, the levels of 227 gene transcripts were found to be affected by the growth rate, and 56 differentially expressed proteins were found by proteomic analyses. Few genes or proteins showed a growth rate-dependent increase or decrease in expression across the whole range of conditions, and many showed a maximum or minimum at the middle growth rate (i.e., 0.15 h-1). For many gene products, a discrepancy between transcriptomic and proteomic data were seen, indicating posttranscriptional regulation of expression. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
150. Investigating prevalence and geographical distribution of Mycoplasma sp. in the gut of Atlantic salmon (Salmo salar L.)
- Author
-
Raudstein, Mari, Rudi, Knut, and Snipen, Lars-Gustav
- Subjects
animal diseases ,Gut microbiota - Abstract
The fish gut microbiota has gotten considerable attention in recent years, and the microbes harboring the intestine of fish are thought to grant the host various effects related to size, metabolism, feeding behavior, and immune response. A Mycoplasma species has been discovered as highly abundant in the salmon gut. However, the resident strain has not yet been isolated. Knowledge regarding its colonization and the impact it may have on the host is, therefore, limited. This study aimed to map the prevalence and geographical distribution of Mycoplasma in the salmon gut and discover its potential role as part of the gut microbiota. Salmon gut content was sampled for both cultivation purposes and direct DNA analyses in this project. Samples were collected from two salmon farms in Norway, Skjervøy (n = 23) and Bømlo (n = 19), and one in Chile (n = 20). A selection of Bømlo samples (n = 10) was cultivated in enriched growth medium. The prevalence of Mycoplasma at different geographical sites was investigated by analyzing the bacterial composition in the Bømlo and Chile samples using 16S rRNA gene sequencing. Moreover, selected samples from Bømlo (n = 4), Skjervøy (n = 7), and Chile (n = 1) were further processed for whole-genome shotgun sequencing to obtain genomic information of the salmon-associated Mycoplasma. Mycoplasma was found abundantly in Norwegian salmon but was not detected in Chilean salmon. Thus, in this study, we observed a geographical difference (p = 0.00023) in the mycoplasmas’ prevalence in the gut of farmed Atlantic salmon. The underlying reasons for the absence of Mycoplasma in Chilean salmon must be further investigated to explain our findings. Further, we found that the salmon-associated Mycoplasma’s DNA was most frequently classified as M. penetrans, which may suggest relatedness between these species. Whether the salmon Mycoplasma exhibits pathogenic or protective characteristics is not known. However, given the seemingly large prevalence of mycoplasmas in salmon, it is likely they exist in the gut microbiota as commensals. Further research is necessary to discover potential negative or positive impacts the salmon-associated Mycoplasma might have on the physiology and immunology of the fish. Tarmmikrobiotaen til fisk har fått auka merksemd dei siste åra, og mikroorganismane som utgjer denne har truleg innverknad på verten relatert til storleik, metabolisme, fôringsåtferd og immunrespons. Ein Mykoplasma-art har blitt oppdaga i rikelege mengder i laksetarmen. Arten er enno ikkje isolert, og det er lite kunnskap om denne bakterien si kolonisering, og om verknaden den kan ha på verten. Målet med denne studien var difor å kartlegge utbreiinga, samt den geografiske fordelinga av Mykoplasma i laksetarm, og å undersøke kva rolle denne bakterien potensielt har som del av tarmmikrobiotaen. M-BIOTEK
- Published
- 2020
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.