50 results on '"Gerben Menschaert"'
Search Results
2. Table S5 from Deletion 6q Drives T-cell Leukemia Progression by Ribosome Modulation
- Author
-
Jean Soulier, François Sigaux, Hugues de Thé, Claude Gazin, Jean-Jacques Diaz, Emmanuelle Clappier, Hervé Dombret, André Baruchel, Jules P. Meijerink, Pieter Van Vlierberghe, Tom Taghon, Isabelle André-Schmutz, Gerben Menschaert, Wouter Van Loocke, Jessica G. Buijs-Gladdines, Willem K. Smits, Marika Pla, Lucie Hernandez, Godelieve Meunier, Delphine Briot, Gabriel Thérizols, Marc Delord, Samuel Quentin, Frédéric Catez, Eulalia Genesca, David Avran, Tiama El-Chaar, and Stéphanie Gachet
- Abstract
Differentially expressed genes in Xenograted T-ALL shSYNCRIP-SNHG5 versus ShCtrl
- Published
- 2023
- Full Text
- View/download PDF
3. Table S4 from Deletion 6q Drives T-cell Leukemia Progression by Ribosome Modulation
- Author
-
Jean Soulier, François Sigaux, Hugues de Thé, Claude Gazin, Jean-Jacques Diaz, Emmanuelle Clappier, Hervé Dombret, André Baruchel, Jules P. Meijerink, Pieter Van Vlierberghe, Tom Taghon, Isabelle André-Schmutz, Gerben Menschaert, Wouter Van Loocke, Jessica G. Buijs-Gladdines, Willem K. Smits, Marika Pla, Lucie Hernandez, Godelieve Meunier, Delphine Briot, Gabriel Thérizols, Marc Delord, Samuel Quentin, Frédéric Catez, Eulalia Genesca, David Avran, Tiama El-Chaar, and Stéphanie Gachet
- Abstract
Differentially translated transcripts - translatome
- Published
- 2023
- Full Text
- View/download PDF
4. Supplementary Data from Deletion 6q Drives T-cell Leukemia Progression by Ribosome Modulation
- Author
-
Jean Soulier, François Sigaux, Hugues de Thé, Claude Gazin, Jean-Jacques Diaz, Emmanuelle Clappier, Hervé Dombret, André Baruchel, Jules P. Meijerink, Pieter Van Vlierberghe, Tom Taghon, Isabelle André-Schmutz, Gerben Menschaert, Wouter Van Loocke, Jessica G. Buijs-Gladdines, Willem K. Smits, Marika Pla, Lucie Hernandez, Godelieve Meunier, Delphine Briot, Gabriel Thérizols, Marc Delord, Samuel Quentin, Frédéric Catez, Eulalia Genesca, David Avran, Tiama El-Chaar, and Stéphanie Gachet
- Abstract
Supp Fig. S1-6, Supp Tables S1-S3, Supp Methods and References
- Published
- 2023
- Full Text
- View/download PDF
5. Data from Deletion 6q Drives T-cell Leukemia Progression by Ribosome Modulation
- Author
-
Jean Soulier, François Sigaux, Hugues de Thé, Claude Gazin, Jean-Jacques Diaz, Emmanuelle Clappier, Hervé Dombret, André Baruchel, Jules P. Meijerink, Pieter Van Vlierberghe, Tom Taghon, Isabelle André-Schmutz, Gerben Menschaert, Wouter Van Loocke, Jessica G. Buijs-Gladdines, Willem K. Smits, Marika Pla, Lucie Hernandez, Godelieve Meunier, Delphine Briot, Gabriel Thérizols, Marc Delord, Samuel Quentin, Frédéric Catez, Eulalia Genesca, David Avran, Tiama El-Chaar, and Stéphanie Gachet
- Abstract
Deletion of chromosome 6q is a well-recognized abnormality found in poor-prognosis T-cell acute lymphoblastic leukemia (T-ALL). Using integrated genomic approaches, we identified two candidate haploinsufficient genes contiguous at 6q14, SYNCRIP (encoding hnRNP-Q) and SNHG5 (that hosts snoRNAs), both involved in regulating RNA maturation and translation. Combined silencing of both genes, but not of either gene alone, accelerated leukemogeneis in a Tal1/Lmo1/Notch1-driven mouse model, demonstrating the tumor-suppressive nature of the two-gene region. Proteomic and translational profiling of cells in which we engineered a short 6q deletion by CRISPR/Cas9 genome editing indicated decreased ribosome and mitochondrial activities, suggesting that the resulting metabolic changes may regulate tumor progression. Indeed, xenograft experiments showed an increased leukemia-initiating cell activity of primary human leukemic cells upon coextinction of SYNCRIP and SNHG5. Our findings not only elucidate the nature of 6q deletion but also highlight the role of ribosomes and mitochondria in T-ALL tumor progression.Significance:The oncogenic role of 6q deletion in T-ALL has remained elusive since this chromosomal abnormality was first identified more than 40 years ago. We combined genomic analysis and functional models to show that the codeletion of two contiguous genes at 6q14 enhances malignancy through deregulation of a ribosome–mitochondria axis, suggesting the potential for therapeutic intervention.This article is highlighted in the In This Issue feature, p. 1494
- Published
- 2023
- Full Text
- View/download PDF
6. TIS Transformer: remapping the human proteome using deep learning
- Author
-
Zahra McVey, Jim Clauwaert, Ramneek Gupta, and Gerben Menschaert
- Subjects
Computer science ,business.industry ,Deep learning ,Applied Mathematics ,Reading frame ,Computational biology ,Computer Science Applications ,Transcriptome ,Open reading frame ,Structural Biology ,Proteome ,Human proteome project ,Genetics ,Coding region ,Artificial intelligence ,business ,Molecular Biology ,Transformer (machine learning model) - Abstract
The correct mapping of the proteome is an important step towards advancing our understanding of biological systems and cellular mechanisms. Methods that provide better mappings can fuel important processes such as drug discovery and disease understanding. Currently, true determination of translation initiation sites is primarily achieved byin vivoexperiments. Here we propose TIS Transformer, a deep learning model for the determination of translation start sites solely utilizing the information embedded in the transcript nucleotide sequence. The method is built upon deep learning techniques first designed for natural language processing. We prove this approach to be best suited for learning the semantics of translation, outperforming previous approaches by a large margin. We demonstrate that limitations in the model performance is primarily due to the presence of low quality annotations against which the model is evaluated against. Advantages of the method are its ability to detect key features of the translation process and multiple coding sequences on a transcript. These include micropeptides encoded by short Open Reading Frames, either alongside a canonical coding sequence or within long non-coding RNAs. To demonstrate the use of our methods, we applied TIS Transformer to remap the full human proteome.
- Published
- 2023
- Full Text
- View/download PDF
7. CpG Transformer for imputation of single-cell methylomes
- Author
-
Gaetan De Waele, Willem Waegeman, Gerben Menschaert, and Jim Clauwaert
- Subjects
Statistics and Probability ,Technology and Engineering ,AcademicSubjects/SCI01060 ,Computer science ,computer.software_genre ,Biochemistry ,Task (project management) ,Sliding window protocol ,Imputation (statistics) ,DNA METHYLATION ,Molecular Biology ,Transformer (machine learning model) ,Interpretability ,Biology and Life Sciences ,Genome Analysis ,Original Papers ,Computer Science Applications ,Computational Mathematics ,Task (computing) ,Computational Theory and Mathematics ,CpG site ,DNA methylation ,Data mining ,LANDSCAPES ,Transfer of learning ,computer - Abstract
Motivation The adoption of current single-cell DNA methylation sequencing protocols is hindered by incomplete coverage, outlining the need for effective imputation techniques. The task of imputing single-cell (methylation) data requires models to build an understanding of underlying biological processes. Results We adapt the transformer neural network architecture to operate on methylation matrices through combining axial attention with sliding window self-attention. The obtained CpG Transformer displays state-of-the-art performances on a wide range of scBS-seq and scRRBS-seq datasets. Furthermore, we demonstrate the interpretability of CpG Transformer and illustrate its rapid transfer learning properties, allowing practitioners to train models on new datasets with a limited computational and time budget. Availability and implementation CpG Transformer is freely available at https://github.com/gdewael/cpg-transformer. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2021
- Full Text
- View/download PDF
8. Editorial: sORF Encoded Peptides in Health and Disease
- Author
-
Joseph Rothnagel, Harsha Gowda, and Gerben Menschaert
- Subjects
Genetics ,Molecular Medicine ,Genetics (clinical) - Published
- 2022
9. Multiple solvent elution, a method to counter the effects of coelution and ion suppression in LC-MS analysis in bottom up proteomics
- Author
-
Harshavardhan Budamgunta, Ajay Anand Kumar, Geert Baggerman, Kurt Boonen, Karin Schildermans, Evelyne Maes, Hanny Willems, and Gerben Menschaert
- Subjects
Proteomics ,Proteome ,Clinical Biochemistry ,Ion suppression in liquid chromatography–mass spectrometry ,Peptide ,Mass spectrometry ,030226 pharmacology & pharmacy ,01 natural sciences ,Biochemistry ,Analytical Chemistry ,03 medical and health sciences ,0302 clinical medicine ,Tandem Mass Spectrometry ,Liquid chromatography–mass spectrometry ,Humans ,Shotgun proteomics ,Biology ,chemistry.chemical_classification ,Chromatography ,Chemistry ,Elution ,010401 analytical chemistry ,Cell Biology ,General Medicine ,0104 chemical sciences ,Bottom-up proteomics ,Peptides ,Chromatography, Liquid ,HeLa Cells - Abstract
On average a human cell type expresses around 10,000 different protein coding genes synthesizing all the different molecular forms of the protein product (proteoforms) found in a cell. In a typical shotgun bottom up proteomic approach, the proteins are enzymatically cleaved, producing several 100,000 s of different peptides that are analyzed with liquid chromatography-tandem mass spectrometry (LC-MSMS). One of the major consequences of this high sample complexity is that coelution of peptides cannot be avoided. Moreover, low abundant peptides are difficult to identify as they have a lower chance of being selected for fragmentation due to ion-suppression effects and the semi-stochastic nature of the precursor selection in data-dependent shotgun proteomic analysis where peptides are selected for fragmentation analysis one-by-one as they elute from the column. In the current study we explore a simple novel approach that has the potential to counter some of the effect of coelution of peptides and improves the number of peptide identifications in a bottom-up proteomic analysis. In this method, peptides from a HeLa cell digest were eluted from the reverse phase column using three different elution solvents (acetonitrile, methanol and acetone) in three replicate reversed phase LC-MS/MS shotgun proteomic analysis. Results were compared with three technical replicates using the same solvent, which is common practice in proteomic analysis. In total, we see an increase of up to 10% in unique protein and up to 30% in unique peptide identifications from the combined analysis using different elution solvents when compared to the combined identifications from the three replicates of the same solvent. In addition, the overlap of unique peptide identifications common in all three LC-MS analyses in our approach is only 23% compared to 50% in the replicates using the same solvent. The method presented here thus provides an easy to implement method to significantly reduce the effects of coelution and ion suppression of peptides and improve protein coverage in shotgun proteomics. Data are available via ProteomeXchange with identifier PXD011908 .
- Published
- 2019
- Full Text
- View/download PDF
10. A community-driven roadmap to advance research on translated open reading frames detected by Ribo-seq
- Author
-
Jose Manuel Gonzalez, Pavel V. Baranov, Juan Pablo Couso, Jonathan M. Mudge, Ariel A. Bazzini, Xavier Roucou, Mark Gerstein, Maria Jesus Martin, Uwe Ohler, Jian Chen, Nicholas T. Ingolia, Thomas F. Martinez, Yuchen Yang, Jonathan S. Weissman, Norbert Hubner, John R. Prensner, Michele Magrane, Paul Flicek, Jorge Ruiz-Orera, Alan Saghatelian, Jana Felicitas Schulz, Brunet, Elspeth A. Bruford, Gerben Menschaert, M. Mar Albà, Adam Frankish, S. van Heesch, and Anne-Ruxandra Carvunis
- Subjects
animal structures ,GENCODE ,Computer science ,HUGO Gene Nomenclature Committee ,Biological database ,Ensembl ,natural sciences ,Human genome ,Computational biology ,Ribosome profiling ,ORFS ,UniProt - Abstract
Ribosome profiling (Ribo-seq) has catalyzed a paradigm shift in our understanding of the translational ‘vocabulary’ of the human genome, discovering thousands of translated open reading frames (ORFs) within long non-coding RNAs and presumed untranslated regions of protein-coding genes. However, reference gene annotation projects have been circumspect in their incorporation of these ORFs due to uncertainties about their experimental reproducibility and physiological roles. Yet, it is indisputable that certain Ribo-seq ORFs make stable proteins, others mediate gene regulation, and many have medical implications. Ultimately, the absence of standardized ORF annotation has created a circular problem: while Ribo-seq ORFs remain unannotated by reference biological databases, this lack of characterisation will thwart research efforts examining their roles. Here, we outline the initial stages of a community-led effort supported by GENCODE / Ensembl, HGNC and UniProt to produce a consolidated catalog of human Ribo-seq ORFs.
- Published
- 2021
- Full Text
- View/download PDF
11. Explainable Transformer Models for Functional Genomics in Prokaryotes
- Author
-
Jim Clauwaert, Willem Waegeman, and Gerben Menschaert
- Subjects
Artificial neural network ,Computer science ,business.industry ,Deep learning ,Genome project ,Computational biology ,DNA sequencing ,DNA binding site ,Transcription (biology) ,Consensus sequence ,Artificial intelligence ,business ,Functional genomics ,Transformer (machine learning model) - Abstract
The effectiveness of deep learning methods can be largely attributed to the automated extraction of relevant features from raw data. In the field of functional genomics, this generally comprises the automatic selection of relevant nucleotide motifs from DNA sequences. To benefit from automated learning methods, new strategies are required that unveil the decision-making process of trained models. In this paper, we present several methods that can be used to gather insights on biological processes that drive any genome annotation task. This work builds upon a transformer-based neural network framework designed for prokaryotic genome annotation purposes. We find that the majority of sub-units (attention heads) of the model are specialized towards identifying DNA binding sites. Working with a neural network trained to detect transcription start sites in E. coli, we successfully characterize both locations and consensus sequences of transcription factor binding sites, including both well-known and potentially novel elements involved in the initiation of the transcription process.
- Published
- 2020
- Full Text
- View/download PDF
12. Mass spectrometric evidence for neuropeptide-amidating enzymes in
- Author
-
Jan Watteyne, Niels Ringstad, Gerben Menschaert, Liliane Schoofs, H. Robert Horvitz, Kurt Boonen, Steven J. Husson, Sven Van Bael, Wouter De Haes, and Liesbet Temmerman
- Subjects
0301 basic medicine ,Genomics and Proteomics ,Neuropeptide ,Biochemistry ,Mixed Function Oxygenases ,03 medical and health sciences ,0302 clinical medicine ,Multienzyme Complexes ,Tandem Mass Spectrometry ,Animals ,Humans ,Amino Acid Sequence ,Caenorhabditis elegans ,Caenorhabditis elegans Proteins ,Molecular Biology ,chemistry.chemical_classification ,biology ,Chemistry ,Neuropeptides ,Cell Biology ,Monooxygenase ,biology.organism_classification ,Proprotein convertase ,Carboxypeptidase ,Biosynthetic Pathways ,Neuropeptide processing ,030104 developmental biology ,Enzyme ,Amidine-Lyases ,Mutation ,biology.protein ,Sequence Alignment ,Copper ,Gene Deletion ,030217 neurology & neurosurgery ,Function (biology) - Abstract
Neuropeptides constitute a vast and functionally diverse family of neurochemical signaling molecules and are widely involved in the regulation of various physiological processes. The nematode Caenorhabditis elegans is well-suited for the study of neuropeptide biochemistry and function, as neuropeptide biosynthesis enzymes are not essential for C. elegans viability. This permits the study of neuropeptide biosynthesis in mutants lacking certain neuropeptide-processing enzymes. Mass spectrometry has been used to study the effects of proprotein convertase and carboxypeptidase mutations on proteolytic processing of neuropeptide precursors and on the peptidome in C. elegans. However, the enzymes required for the last step in the production of many bioactive peptides, the carboxyl-terminal amidation reaction, have not been characterized in this manner. Here, we describe three genes that encode homologs of neuropeptide amidation enzymes in C. elegans and used tandem LC-MS to compare neuropeptides in WT animals with those in newly generated mutants for these putative amidation enzymes. We report that mutants lacking both a functional peptidylglycine α-hydroxylating monooxygenase and a peptidylglycine α-amidating monooxygenase had a severely altered neuropeptide profile and also a decreased number of offspring. Interestingly, single mutants of the amidation enzymes still expressed some fully processed amidated neuropeptides, indicating the existence of a redundant amidation mechanism in C. elegans. All MS data are available via ProteomeXchange with the identifier PXD008942. In summary, the key steps in neuropeptide processing in C. elegans seem to be executed by redundant enzymes, and loss of these enzymes severely affects brood size, supporting the need of amidated peptides for C. elegans reproduction.
- Published
- 2018
- Full Text
- View/download PDF
13. A Gene Family Coding for Salivary Proteins (SHOT) of the Polyphagous Spider Mite Tetranychus urticae Exhibits Fast Host-Dependent Transcriptional Plasticity
- Author
-
Wannes Dermauw, Luc Tirry, Wim Reubens, Gerben Menschaert, Mousaalreza Khalighi, Wim Jonckheere, Bartel Vanholme, Thomas Van Leeuwen, Geert Baggerman, Nena Pavlidi, Merijn R. Kant, and Evolutionary and Population Biology (IBED, FNWI)
- Subjects
Proteomics ,0301 basic medicine ,Time Factors ,Transcription, Genetic ,Protein family ,Physiology ,Biology ,Host-Parasite Interactions ,03 medical and health sciences ,Gene Expression Regulation, Plant ,Spider mite ,Botany ,Plant defense against herbivory ,Animals ,Amino Acid Sequence ,RNA, Messenger ,Tetranychus urticae ,Salivary Proteins and Peptides ,Saliva ,Gene ,Peptide sequence ,Phylogeny ,Genetics ,Regulation of gene expression ,General Medicine ,Plants ,biology.organism_classification ,Chemistry ,030104 developmental biology ,Multigene Family ,Peptides ,Tetranychidae ,Engineering sciences. Technology ,Agronomy and Crop Science - Abstract
The salivary protein repertoire released by the herbivorous pest Tetranychus urticae is assumed to hold keys to its success on diverse crops. We report on a spider mite–specific protein family that is expanded in T. urticae. The encoding genes have an expression pattern restricted to the anterior podocephalic glands, while peptide fragments were found in the T. urticae secretome, supporting the salivary nature of these proteins. As peptide fragments were identified in a host-dependent manner, we designated this family as the SHOT (secreted host–responsive protein of Tetranychidae) family. The proteins were divided in three groups based on sequence similarity. Unlike TuSHOT3 genes, TuSHOT1 and TuSHOT2 genes were highly expressed when feeding on a subset of family Fabaceae, while expression was depleted on other hosts. TuSHOT1 and TuSHOT2 expression was induced within 24 h after certain host transfers, pointing toward transcriptional plasticity rather than selection as the cause. Transfer from an ‘inducer’ to a ‘noninducer’ plant was associated with slow yet strong downregulation of TuSHOT1 and TuSHOT2, occurring over generations rather than hours. This asymmetric on and off regulation points toward host-specific effects of SHOT proteins, which is further supported by the diversity of SHOT genes identified in Tetranychidae with a distinct host repertoire.
- Published
- 2018
- Full Text
- View/download PDF
14. Noncoding after All: Biases in Proteomics Data Do Not Explain Observed Absence of lncRNA Translation Products
- Author
-
Pieter-Jan Volders, Kris Gevaert, Jo Vandesompele, Lennart Martens, Pieter Mestdagh, Gerben Menschaert, Kenneth Verheggen, and Petra Van Damme
- Subjects
Proteomics ,Quality Control ,0301 basic medicine ,translation ,PROTEIN ,Computational biology ,Biology ,SEQUENCE ,Biochemistry ,public data ,Mass Spectrometry ,RNAS ,msms ,03 medical and health sciences ,lncRNA ,Eukaryotic genome ,REVEALS ,Humans ,mass spectrometry ,Genetics ,PROVIDES EVIDENCE ,Genome ,IDENTIFICATION ,LANDSCAPE ,030102 biochemistry & molecular biology ,Biology and Life Sciences ,Eukaryota ,High-Throughput Nucleotide Sequencing ,Proteins ,Translation (biology) ,General Chemistry ,micropeptide ,030104 developmental biology ,Gene Expression Regulation ,Protein Biosynthesis ,CELLS ,RNA, Long Noncoding - Abstract
Over the past decade, long noncoding RNAs (lncRNAs) have emerged as novel functional entities of the eukaryotic genome. However, the scientific community remains divided over the amount of true noncoding transcripts among the large number of unannotated transcripts identified by recent large scale and deep RNA-sequencing efforts. Here, we systematically exclude possible technical reasons underlying the absence of lncRNA-encoded proteins in mass spectrometry data sets, strongly suggesting that the large majority of lncRNAs is indeed not translated.
- Published
- 2017
- Full Text
- View/download PDF
15. proBAMconvert: A Conversion Tool for proBAM/proBed
- Author
-
Volodimir Olexiouk and Gerben Menschaert
- Subjects
0301 basic medicine ,Computer science ,Interface (computing) ,Information Storage and Retrieval ,Context (language use) ,computer.software_genre ,Biochemistry ,User-Computer Interface ,03 medical and health sciences ,Documentation ,Animals ,Humans ,Proteogenomics ,Graphical user interface ,Genome ,business.industry ,Command-line interface ,Chromosome Mapping ,Computational Biology ,General Chemistry ,File format ,Identification (information) ,030104 developmental biology ,Data mining ,Peptides ,business ,computer ,Algorithms ,Computer hardware - Abstract
The introduction of new standard formats, proBAM and proBed, improves the integration of genomics and proteomics information, thus aiding proteogenomics applications. These novel formats enable peptide spectrum matches (PSM) to be stored, inspected, and analyzed within the context of the genome. However, an easy-to-use and transparent tool to convert mass spectrometry identification files to these new formats is indispensable. proBAMconvert enables the conversion of common identification file formats (mzIdentML, mzTab, and pepXML) to proBAM/proBed using an intuitive interface. Furthermore, ProBAMconvert enables information to be output both at the PSM and peptide levels and has a command line interface next to the graphical user interface. Detailed documentation and a completely worked-out tutorial is available at http://probam.biobix.be .
- Published
- 2017
- Full Text
- View/download PDF
16. eIF1 modulates the recognition of suboptimal translation initiation sites and steers gene expression via uORFs
- Author
-
Elvis Ndah, Gerben Menschaert, Daria Fijalkowska, Veronique Jonckheere, Steven Verbruggen, and Petra Van Damme
- Subjects
0301 basic medicine ,Five prime untranslated region ,START CODON SELECTION ,OPEN READING FRAMES ,ISOFORM EXPRESSION ,Codon, Initiator ,Gene Expression ,Nerve Tissue Proteins ,Leaky scanning ,Biology ,UPSTREAM ORFS ,Cell Line ,Open Reading Frames ,03 medical and health sciences ,Eukaryotic translation ,Stress, Physiological ,MAMMALIAN-CELLS ,Eukaryotic initiation factor ,Upstream open reading frame ,Genetics ,Humans ,Initiation factor ,RNA, Messenger ,Eukaryotic Initiation Factors ,Peptide Chain Initiation, Translational ,EUKARYOTIC RIBOSOMES ,Biology and Life Sciences ,QUANTIFICATION ,HCT116 Cells ,Eukaryotic translation initiation factor 4 gamma ,Neoplasm Proteins ,Cell biology ,EIF1 ,030104 developmental biology ,MESSENGER-RNA TRANSLATION ,Gene Knockdown Techniques ,DISCOVERY ,Nucleic Acid Conformation ,RNA ,DEEP PROTEOME COVERAGE ,Energy Metabolism ,Ribosomes - Abstract
Alternative translation initiation mechanisms such as leaky scanning and reinitiation potentiate the polycistronic nature of human transcripts. By allowing for reprogrammed translation, these mechanisms can mediate biological responses to stimuli. We combined proteomics with ribosome profiling and mRNA sequencing to identify the biological targets of translation control triggered by the eukaryotic translation initiation factor 1 (eIF1), a protein implicated in the stringency of start codon selection. We quantified expression changes of over 4000 proteins and 10 000 actively translated transcripts, leading to the identification of 245 transcripts undergoing translational control mediated by upstream open reading frames (uORFs) upon eIF1 deprivation. Here, the stringency of start codon selection and preference for an optimal nucleotide context were largely diminished leading to translational upregulation of uORFs with suboptimal start. Interestingly, genes affected by eIF1 deprivation were implicated in energy production and sensing of metabolic stress.
- Published
- 2017
- Full Text
- View/download PDF
17. Mitochondrial peptide BRAWNIN is essential for vertebrate respiratory complex III assembly
- Author
-
Bruno Reversade, Shan Zhang, Loo Chien Wang, Srikanth Nama, Lena Ho, Ruey-Kuang Cheng, Caroline Lei Wee, Volodimir Olexiouk, Lisa Tucker-Kellogg, Gio Fidelito, Paula Duek Roggli, Joel Celio Francisco, Claire Tang, Prabha Sampath, Baptiste Kerouanton, Suresh Jesuthasan, Gerben Menschaert, Chao Liang, David A. Stroud, Lei Sun, Lydie Lane, Jih Hou Peh, Enrico Petretto, Boris Reljic, Radoslaw M. Sobota, Narendra Suhas Jagannathan, and Camille Mary
- Subjects
0301 basic medicine ,Male ,Proteomics ,PROTEOMICS DATA ,Respiratory chain ,General Physics and Astronomy ,PROTEIN ,Mitochondrion ,MICROPEPTIDE ,Oxidative Phosphorylation ,Open Reading Frames/genetics ,Animals, Genetically Modified ,Electron Transport Complex III ,Models ,Electron Transport Complex III/metabolism ,lcsh:Science ,Zebrafish ,Growth Disorders ,ddc:616 ,Multidisciplinary ,biology ,Mitochondrial Proteins/genetics/metabolism ,SMALL ORFS ,Cell biology ,Mitochondria ,MICROPROTEIN ,Mechanisms of disease ,Lactic acidosis ,Zebrafish/genetics/growth & development ,Gene Knockdown Techniques ,Proteome ,Models, Animal ,Acidosis, Lactic ,Female ,Acidosis ,Peptides/genetics/metabolism ,DATABASE ,Science ,Genetically Modified ,Zebrafish Proteins/genetics/metabolism ,Genetics and Molecular Biology ,Oxidative phosphorylation ,Models, Biological ,General Biochemistry, Genetics and Molecular Biology ,Article ,Mitochondrial Proteins ,03 medical and health sciences ,Open Reading Frames ,Lactic/genetics ,Mitochondria/metabolism ,medicine ,Animals ,Humans ,Metabolomics ,030102 biochemistry & molecular biology ,IDENTIFICATION ,Animal ,Biology and Life Sciences ,Growth Disorders/genetics ,General Chemistry ,Energy metabolism ,Zebrafish Proteins ,biology.organism_classification ,medicine.disease ,Biological ,GENE ,SIGNAL ,Disease Models, Animal ,030104 developmental biology ,Disease Models ,General Biochemistry ,UPDATE ,lcsh:Q ,Peptides ,Flux (metabolism) - Abstract
The emergence of small open reading frame (sORF)-encoded peptides (SEPs) is rapidly expanding the known proteome at the lower end of the size distribution. Here, we show that the mitochondrial proteome, particularly the respiratory chain, is enriched for small proteins. Using a prediction and validation pipeline for SEPs, we report the discovery of 16 endogenous nuclear encoded, mitochondrial-localized SEPs (mito-SEPs). Through functional prediction, proteomics, metabolomics and metabolic flux modeling, we demonstrate that BRAWNIN, a 71 a.a. peptide encoded by C12orf73, is essential for respiratory chain complex III (CIII) assembly. In human cells, BRAWNIN is induced by the energy-sensing AMPK pathway, and its depletion impairs mitochondrial ATP production. In zebrafish, Brawnin deletion causes complete CIII loss, resulting in severe growth retardation, lactic acidosis and early death. Our findings demonstrate that BRAWNIN is essential for vertebrate oxidative phosphorylation. We propose that mito-SEPs are an untapped resource for essential regulators of oxidative metabolism., Small open reading frame-encoded peptides (SEPs), shorter than 100 amino acids, are involved in many cell biological processes. Here the authors identify 16 nuclear-encoded mitochondrial SEPs, including BRAWNIN, an essential regulator of respiratory chain complex III assembly and ATP production.
- Published
- 2019
18. Downregulation of the FTO m
- Author
-
Jana, Jeschke, Evelyne, Collignon, Clémence, Al Wardi, Mohammad, Krayem, Martin, Bizet, Yan, Jia, Soizic, Garaud, Zéna, Wimana, Emilie, Calonne, Bouchra, Hassabi, Renato, Morandini, Rachel, Deplus, Pascale, Putmans, Gaurav, Dube, Nitesh Kumar, Singh, Alexander, Koch, Kateryna, Shostak, Lara, Rizzotto, Robert L, Ross, Christine, Desmedt, Yacine, Bareche, Françoise, Rothé, Jacqueline, Lehmann-Che, Martine, Duterque-Coquillaud, Xavier, Leroy, Gerben, Menschaert, Luis, Teixeira, Mingzhou, Guo, Patrick A, Limbach, Pierre, Close, Alain, Chariot, Eleonora, Leucci, Ghanem, Ghanem, Bi-Feng, Yuan, Karen, Willard-Gallo, Christos, Sotiriou, Jean-Christophe, Marine, and François, Fuks
- Subjects
Mice ,Epithelial-Mesenchymal Transition ,Alpha-Ketoglutarate-Dependent Dioxygenase FTO ,Animals ,Down-Regulation ,Humans ,RNA ,Neoplasms, Glandular and Epithelial - Abstract
Post-transcriptional modifications of RNA constitute an emerging regulatory layer of gene expression. The demethylase fat mass- and obesity-associated protein (FTO), an eraser of N
- Published
- 2019
19. A combined strategy of neuropeptide prediction and tandem mass spectrometry identifies evolutionarily conserved ancient neuropeptides in the sea anemone Nematostella vectensis
- Author
-
Eisuke Hayakawa, Liliane Schoofs, Geert Baggerman, Hiroshi Watanabe, Thomas W. Holstein, and Gerben Menschaert
- Subjects
0301 basic medicine ,Protein Extraction ,Animal Evolution ,Gene Expression ,Nematostella ,Peptide ,Tandem mass spectrometry ,Genome ,Biochemistry ,LOCUSTA-MIGRATORIA ,ELECTRON-TRANSFER DISSOCIATION ,Transcriptome ,Database and Informatics Methods ,0302 clinical medicine ,Tandem Mass Spectrometry ,Databases, Genetic ,Amidation ,HYDRA ,Database Searching ,Post-Translational Modification ,Conserved Sequence ,chemistry.chemical_classification ,Extraction Techniques ,Multidisciplinary ,Chemical Synthesis ,ANTHO-RFAMIDE ,PEPTIDOMICS ,Medicine ,Identification (biology) ,Drosophila melanogaster ,Engineering sciences. Technology ,Sequence Analysis ,Signal Peptides ,Research Article ,food.ingredient ,Bioinformatics ,Science ,In silico ,Neuropeptide ,Sequence Databases ,Computational biology ,Biology ,Research and Analysis Methods ,Evolution, Molecular ,03 medical and health sciences ,food ,Sequence Motif Analysis ,ENDOGENOUS PEPTIDES ,Animals ,Amino Acid Sequence ,Sequence Similarity Searching ,PRECURSOR ,Evolutionary Biology ,MOLECULAR-CLONING ,Neuropeptides ,Computational Biology ,Biology and Life Sciences ,Proteins ,biology.organism_classification ,NERVOUS-SYSTEM ,Organismal Evolution ,Electron-transfer dissociation ,030104 developmental biology ,Sea Anemones ,Biological Databases ,chemistry ,DROSOPHILA-MELANOGASTER ,Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization ,030217 neurology & neurosurgery - Abstract
Neuropeptides are a class of bioactive peptides and are responsible for various physiological processes including metabolism, development and reproduction. Although accumulated genome and transcriptome data have reported a number of neuropeptide candidates, it still remains difficult to obtain a comprehensive view of neuropeptide repertoires due to their small and variable nature. Neuropeptide prediction tools usually work only for those peptides for which sequentially related homologs have previously been identified. Recent peptidomics technology has enabled systematic structural identification of neuropeptides by using the combination of liquid chromatography and tandem mass spectrometry. However, obtaining reliable identifications of endogenous peptides is still difficult using a conventional tandem mass spectrometry-based peptide identification approach using protein database because a large search space has to be scanned due to the absence of a cleavage enzyme specification. We developed a pipeline consisting of the prediction ofin silicocleaved endogenous neuropeptides followed by peptide-spectrum matching enabling highly sensitive and reliable neuropeptide identification. This approach effectively reduces the search space of peptide-spectrum matching, and thus increases search sensitivity. To identify neuropeptides inNematostella vectensis,a basal eumetazoan having one of the most primitive nervous systems, we scanned theNematostellaprotein database for sequences displaying structural hallmarks of metazoan neuropeptides, including C/N-terminal structures and modifications. Peptide-spectrum matching was performed against thein silicocleaved peptides and successfully identified dozens of neuropeptides at high confidence. The identification of Nematostella neuropeptides structurally related the tachykinin, GnRH/AKH, neuromedin-U/pyrokinin peptide families indicate that these peptides already originated in the eumetazoan ancestor of all animal species, most likely concomitantly with the development of a nervous system.
- Published
- 2019
- Full Text
- View/download PDF
20. The Salivary Protein Repertoire of the Polyphagous Spider Mite Tetranychus urticae: A Quest for Effectors
- Author
-
Carlos A. Villarroel, Bartel Vanholme, Nicky Wybouw, Luc Tirry, Wannes Dermauw, Richard M. Clark, Thomas Van Leeuwen, Wim Jonckheere, Robert Greenhalgh, Merijn R. Kant, Rob C. Schuurink, Jan Van den Bulcke, Vladimir Zhurov, Gerben Menschaert, Mike Grbić, Geert Baggerman, Evolutionary and Population Biology (IBED, FNWI), IBED Other Research (FNWI), Faculty of Science, Plant Physiology (SILS, FNWI), and Population Biology (IBED, FNWI)
- Subjects
Crops, Agricultural ,Proteomics ,0301 basic medicine ,Saliva ,Biochemistry ,Host Specificity ,Arthropod Proteins ,Host-Parasite Interactions ,Analytical Chemistry ,03 medical and health sciences ,Tandem Mass Spectrometry ,Spider mite ,Botany ,Mite ,Plant defense against herbivory ,Animals ,Tissue Distribution ,Tetranychus urticae ,Salivary Proteins and Peptides ,Biology ,Molecular Biology ,2. Zero hunger ,Genetics ,biology ,Sequence Analysis, RNA ,Host (biology) ,Effector ,Research ,food and beverages ,biology.organism_classification ,Chemistry ,030104 developmental biology ,Gene Expression Regulation ,Body region ,Tetranychidae ,Chromatography, Liquid - Abstract
The two-spotted spider mite Tetranychus urticae is an extremely polyphagous crop pest. Alongside an unparalleled detoxification potential for plant secondary metabolites, it has recently been shown that spider mites can attenuate or even suppress plant defenses. Salivary constituents, notably effectors, have been proposed to play an important role in manipulating plant defenses and might determine the outcome of plant-mite interactions. Here, the proteomic composition of saliva from T. urticae lines adapted to various host plants—bean, maize, soy, and tomato—was analyzed using a custom-developed feeding assay coupled with nano-LC tandem mass spectrometry. About 90 putative T. urticae salivary proteins were identified. Many are of unknown function, and in numerous cases belonging to multimembered gene families. RNAseq expression analysis revealed that many genes coding for these salivary proteins were highly expressed in the proterosoma, the mite body region that includes the salivary glands. A subset of genes encoding putative salivary proteins was selected for whole-mount in situ hybridization, and were found to be expressed in the anterior and dorsal podocephalic glands. Strikingly, host plant dependent expression was evident for putative salivary proteins, and was further studied in detail by micro-array based genome-wide expression profiling. This meta-analysis revealed for the first time the salivary protein repertoire of a phytophagous chelicerate. The availability of this salivary proteome will assist in unraveling the molecular interface between phytophagous mites and their host plants, and may ultimately facilitate the development of mite-resistant crops. Furthermore, the technique used in this study is a time- and resource-efficient method to examine the salivary protein composition of other small arthropods for which saliva or salivary glands cannot be isolated easily.
- Published
- 2016
- Full Text
- View/download PDF
21. The hunt for sORFs: A multidisciplinary strategy
- Author
-
Marlies K. R. Peeters and Gerben Menschaert
- Subjects
0301 basic medicine ,Proteome ,Computational biology ,Protein Sorting Signals ,Biology ,Proteomics ,Mini review ,Open Reading Frames ,03 medical and health sciences ,0302 clinical medicine ,Multidisciplinary approach ,Animals ,Humans ,Ribosome profiling ,High potential ,Computational Biology ,High-Throughput Nucleotide Sequencing ,Cell Biology ,Plants ,030104 developmental biology ,Protein Biosynthesis ,030220 oncology & carcinogenesis ,Neural Networks, Computer ,Ribosomes ,Software - Abstract
Growing evidence illustrates the shortcomings on the current understanding of the full complexity of the proteome. Previously overlooked small open reading frames (sORFs) and their encoded microproteins have filled important gaps, exerting their function as biologically relevant regulators. The characterization of the full small proteome has potential applications in many fields. Continuous development of techniques and tools led to an improved sORF discovery, where these can originate from bioinformatics analyses, from sequencing routines or proteomics approaches. In this mini review, we discuss the ongoing trends in the three fields and suggest some strategies for further characterization of high potential candidates.
- Published
- 2020
- Full Text
- View/download PDF
22. A methodology for discovering novel brain-relevant peptides: Combination of ribosome profiling and peptidomics
- Author
-
Leslie G. Nucifora, Volodimir Olexiouk, Ravi Tharakan, Ceereena Ubaida-Mohien, Joëlle Lavoie, Nicholas T. Ingolia, Koko Ishizuka, Robert N. Cole, Simion Kreimer, Akira Sawa, and Gerben Menschaert
- Subjects
0301 basic medicine ,Male ,Proteomics ,Cell signaling ,Neuropeptide ,Peptide ,Computational biology ,Biology ,medicine.disease_cause ,Mass Spectrometry ,03 medical and health sciences ,Mice ,0302 clinical medicine ,Sequence Analysis, Protein ,medicine ,Animals ,Ribosome profiling ,Gene ,chemistry.chemical_classification ,Mutation ,General Neuroscience ,Neuropeptides ,Brain ,Translation (biology) ,General Medicine ,Mice, Inbred C57BL ,030104 developmental biology ,chemistry ,Peptides ,Ribosomes ,030217 neurology & neurosurgery ,Function (biology) - Abstract
Brain derived peptides function as signaling molecules in the brain and regulate various physiological and behavioral processes. The low abundance and atypical fragmentation of these brain derived peptides makes detection using traditional proteomic methods challenging. In this study, we introduce and validate a new methodology for the discovery of novel peptides derived from mammalian brain. This methodology combines ribosome profiling and mass spectrometry-based peptidomics. Using this framework, we have identified a novel peptide in mouse whole brain whose expression is highest in the basal ganglia, hypothalamus and amygdala. Although its functional role is unknown, it has been previously detected in peripheral tissue as a component of the mRNA decapping complex. Continued discovery and studies of novel regulating peptides in mammalian brain may also provide insight into brain disorders.
- Published
- 2018
23. Short Open Reading Frames and Their Encoded Peptides
- Author
-
Joseph A. Rothnagel and Gerben Menschaert
- Subjects
0301 basic medicine ,Computer science ,business.industry ,Computational biology ,Biology ,Proteomics ,Biochemistry ,Peptide Fragments ,03 medical and health sciences ,Open reading frame ,Open Reading Frames ,Workflow ,030104 developmental biology ,Text mining ,Ribosomal protein ,Protein Biosynthesis ,Animals ,Humans ,business ,Molecular Biology - Abstract
In this article, the following sentences and reference were erroneously published: “In their technical brief, Khitun and Slavoff describe their workflow to detect and validate non-annotated proteins with non-AUG start sites. This proteomic detection method is also exemplified with the characterization of the non-AUG initiated small protein (alt-RPL36) co-encoded with human ribosomal protein L36.” [11] S. A. K. Slavoff, Proteomics 2018 The Editorial Office apologizes for this error.
- Published
- 2018
24. mQC: A post-mapping data exploration tool for ribosome profiling
- Author
-
Steven Verbruggen and Gerben Menschaert
- Subjects
Quality Control ,IMPACT ,Computer science ,Health Informatics ,Triplet periodicity ,Ribosome profiling ,Quality visualization ,computer.software_genre ,DNA sequencing ,Open Reading Frames ,QUALITY-CONTROL ,REPRODUCIBILITY ,Cell Line, Tumor ,Gene expression ,Code (cryptography) ,Humans ,RNA, Messenger ,Cycloheximide ,IN-VIVO ,Sequence Analysis, RNA ,Gene Expression Profiling ,Biology and Life Sciences ,Computational Biology ,High-Throughput Nucleotide Sequencing ,Reproducibility of Results ,Translation (biology) ,Sequence Analysis, DNA ,HCT116 Cells ,Computer Science Applications ,Data mapping ,HEK293 Cells ,NGS ,Colonic Neoplasms ,Data mining ,TRANSLATION ,computer ,Ribosomes ,Algorithms ,Software ,Genome-Wide Association Study - Abstract
Background and objective Ribosome profiling is a recent next generation sequencing technique enabling the genome-wide study of gene expression in biomedical research at the translation level. Too often, researchers precipitously start trying to test their hypotheses after alignment of their data, without checking the quality and the general features of their mapped data. Despite the fact that these checks are essential to prevent errors and ensure valid conclusions afterwards, easy-to-use tools for visualizing the quality and overall outlook of mapped ribosome profiling data are lacking. Methods We present mQC, a modular tool implemented as a Bioconda package and also available in the Galaxy tool shed. Herewith both bio-informaticians as well as non-experts can easily perform the indispensable visualization of both the quality and the general features of their mapped P-site corrected ribosome profiling reads. The user manual, the raw code and more information can be found on its GitHub repository ( https://github.com/Biobix/mQC ). Results mQC was tested on multiple datasets to assess its general applicability and was compared to other tools that partly perform similar tasks. Conclusions Our results demonstrate that mQC can accomplish an unfilled but essential position in the ribosome profiling data analysis procedure by performing a thorough RIBO-Seq-specific exploration of aligned and P-site corrected ribosome profiling data.
- Published
- 2018
25. Proteogenomics from a bioinformatics angle: A growing field
- Author
-
David Fenyö and Gerben Menschaert
- Subjects
0301 basic medicine ,Chemistry ,Genomics ,Genome project ,Gene Annotation ,Condensed Matter Physics ,Bioinformatics ,Proteomics ,Proteogenomics ,General Biochemistry, Genetics and Molecular Biology ,DNA sequencing ,Analytical Chemistry ,03 medical and health sciences ,030104 developmental biology ,Proteome ,Ribosome profiling ,Spectroscopy - Abstract
Proteogenomics is a research area that combines areas as proteomics and genomics in a multi-omics setup using both mass spectrometry and high-throughput sequencing technologies. Currently, the main goals of the field are to aid genome annotation or to unravel the proteome complexity. Mass spectrometry based identifications of matching or homologues peptides can further refine gene models. Also, the identification of novel proteoforms is also made possible based on detection of novel translation initiation sites (cognate or near-cognate), novel transcript isoforms, sequence variation or novel (small) open reading frames in intergenic or un-translated genic regions by analyzing high-throughput sequencing data from RNAseq or ribosome profiling experiments. Other proteogenomics studies using a combination of proteomics and genomics techniques focus on antibody sequencing, the identification of immunogenic peptides or venom peptides. Over the years, a growing amount of bioinformatics tools and databases became available to help streamlining these cross-omics studies. Some of these solutions only help in specific steps of the proteogenomics studies, e.g. building custom sequence databases (based on next generation sequencing output) for mass spectrometry fragmentation spectrum matching. Over the last few years a handful integrative tools also became available that can execute complete proteogenomics analyses. Some of these are presented as stand-alone solutions, whereas others are implemented in a web-based framework such as Galaxy. In this review we aimed at sketching a comprehensive overview of all the bioinformatics solutions that are available for this growing research area. © 2015 Wiley Periodicals, Inc. Mass Spec Rev 36:584-599, 2017.
- Published
- 2015
- Full Text
- View/download PDF
26. Proteomics Standards Initiative: Fifteen Years of Progress and Future Work
- Author
-
Yasset Perez-Riverol, Stefan Tenzer, Henning Hermjakob, Shin Kawano, Eric W. Deutsch, Gerhard Mayer, David L. Tabb, Sandra Orchard, Henry H N Lam, Wout Bittremieux, Pierre-Alain Binz, Reza M. Salek, Martin Eisenacher, Juan Antonio Vizcaíno, Gerben Menschaert, Andrew R. Jones, and Mathias Walzer
- Subjects
0301 basic medicine ,Proteomics ,protein quantification ,Emerging technologies ,Computer science ,computer.internet_protocol ,Guidelines as Topic ,computer.software_genre ,Biochemistry ,03 medical and health sciences ,protein identification ,Human proteome project ,Humans ,Community standards ,quality control ,Databases, Protein ,Biology ,database ,mass spectrometry ,Computer. Automation ,030102 biochemistry & molecular biology ,Application programming interface ,Proteomics Standards Initiative ,General Chemistry ,Reference Standards ,Data science ,metabolomics ,Chemistry ,030104 developmental biology ,Perspective ,data standard ,Web service ,bioinformatics software ,Working group ,computer ,XML ,Software ,molecular interactions - Abstract
The Proteomics Standards Initiative (PSI) of the Human Proteome Organization (HUPO) has now been developing and promoting open community standards and software tools in the field of proteomics for 15 years. Under the guidance of the chair, co-chairs, and other leadership positions, the PSI working groups are tasked with the development and maintenance of community standards via special workshops and ongoing work. Among the existing, ratified standards, the PSI working groups continue to update PSI-MI XML, MITAB, mzML, mzIdentML, mzQuantML, mzTab, and the MIAPE (Minimum Information About a Proteomics Experiment) guidelines with the advance of new technologies and techniques. Further, new standards are currently either in the final stages of completion (proBed and proBAM for proteogenomics results, as well as PEFF) or in early stages of design (a spectral library standard format, a universal spectrum identifier, the qcML quality control format, and the Protein Expression Interface (PROXI) web services Application Programming Interface). In this work we review the current status of all these aspects of the PSI, describe synergies with other efforts such as the ProteomeXchange Consortium, the Human Proteome Project, and the metabolomics community, and provide a look at future directions of the PSI.
- Published
- 2017
27. Discovery of noncanonical translation initiation sites through mass spectrometric analysis of protein N termini
- Author
-
Gerben Menschaert, Christopher M. Overall, Mustafa A. Barbhuiya, Steven Verbruggen, Chan Hyun Na, Akhilesh Pandey, Olga Pletnikova, Stephen M. Eacker, Marc K. Halushka, Juan C. Troncoso, and Min-Sik Kim
- Subjects
0301 basic medicine ,Untranslated region ,ACETYLATION ,Protein domain ,Computational biology ,Biology ,Proteomics ,Mass Spectrometry ,EVENTS ,03 medical and health sciences ,chemistry.chemical_compound ,Open Reading Frames ,Eukaryotic translation ,Start codon ,Protein Domains ,MAMMALIAN-CELLS ,REVEALS ,Genetics ,Human Umbilical Vein Endothelial Cells ,Humans ,Ribosome profiling ,Peptide Chain Initiation, Translational ,IN-VIVO ,Genetics (clinical) ,Methionine ,LANDSCAPE ,CLEAVAGE ,Research ,Biology and Life Sciences ,NUCLEOTIDE RESOLUTION ,PRODUCTS ,Open reading frame ,030104 developmental biology ,HEK293 Cells ,chemistry ,PROTEOMICS ,5' Untranslated Regions ,Ribosomes - Abstract
Translation initiation generally occurs at AUG codons in eukaryotes, although it has been shown that non-AUG or noncanonical translation initiation can also occur. However, the evidence for noncanonical translation initiation sites (TISs) is largely indirect and based on ribosome profiling (Ribo-seq) studies. Here, using a strategy specifically designed to enrich N termini of proteins, we demonstrate that many human proteins are translated at noncanonical TISs. The large majority of TISs that mapped to 5′ untranslated regions were noncanonical and led to N-terminal extension of annotated proteins or translation of upstream small open reading frames (uORF). It has been controversial whether the amino acid corresponding to the start codon is incorporated at the TIS or methionine is still incorporated. We found that methionine was incorporated at almost all noncanonical TISs identified in this study. Comparison of the TISs determined through mass spectrometry with ribosome profiling data revealed that about two-thirds of the novel annotations were indeed supported by the available ribosome profiling data. Sequence conservation across species and a higher abundance of noncanonical TISs than canonical ones in some cases suggests that the noncanonical TISs can have biological functions. Overall, this study provides evidence of protein translation initiation at noncanonical TISs and argues that further studies are required for elucidation of functional implications of such noncanonical translation initiation.
- Published
- 2017
28. An update on LNCipedia: a database for annotated human lncRNA sequences
- Author
-
Kenneth Verheggen, Klaas Vandepoele, Pieter-Jan Volders, Gerben Menschaert, Jo Vandesompele, Pieter Mestdagh, and Lennart Martens
- Subjects
Lncrna gene ,Biology ,Proteomics ,computer.software_genre ,03 medical and health sciences ,Annotation ,Mice ,0302 clinical medicine ,REVEALS ,Genetics ,Database Issue ,Animals ,Humans ,Zebrafish ,030304 developmental biology ,0303 health sciences ,Internet ,Database ,IDENTIFICATION ,Sequence Analysis, RNA ,LONG NONCODING RNA ,Proteins ,Biology and Life Sciences ,Molecular Sequence Annotation ,MASS-SPECTROMETRY ,GENE ,Long non-coding RNA ,HEK293 Cells ,DIFFERENTIATION ,RESOLUTION ,030220 oncology & carcinogenesis ,CELLS ,PROTEOMICS ,Human genome ,RNA, Long Noncoding ,INACTIVATION ,Corrigendum ,Databases, Nucleic Acid ,computer - Abstract
The human genome is pervasively transcribed, producing thousands of non-coding RNA transcripts. The majority of these transcripts are long non-coding RNAs (lncRNAs) and novel lncRNA genes are being identified at rapid pace. To streamline these efforts, we created LNCipedia, an online repository of lncRNA transcripts and annotation. Here, we present LNCipedia 3.0 (http://www.lncipedia.org), the latest version of the publicly available human lncRNA database. Compared to the previous version of LNCipedia, the database grew over five times in size, gaining over 90 000 new lncRNA transcripts. Assessment of the protein-coding potential of LNCipedia entries is improved with state-of-the art methods that include large-scale reprocessing of publicly available proteomics data. As a result, a high-confidence set of lncRNA transcripts with low coding potential is defined and made available for download. In addition, a tool to assess lncRNA gene conservation between human, mouse and zebrafish has been implemented.
- Published
- 2014
- Full Text
- View/download PDF
29. A proteogenomics approach integrating proteomics and ribosome profiling increases the efficiency of protein identification and enables the discovery of alternative translation start sites
- Author
-
Ellen De Meester, Sarah De Keulenaer, Elvis Ndah, Jeroen Crappé, Alexander Koch, Sandra Steyaert, Daria Gawron, Kris Gevaert, Petra Van Damme, Gerben Menschaert, Ben Shen, Wim Van Criekinge, and Ming Ma
- Subjects
Proteomics ,Genetics ,Computational Biology ,Proteins ,Translation (biology) ,Biology ,HCT116 Cells ,Proteogenomics ,Biochemistry ,Article ,Deep sequencing ,Protein sequencing ,Tandem Mass Spectrometry ,Protein Biosynthesis ,Proteome ,Humans ,Ribosome profiling ,Shotgun proteomics ,Ribosomes ,Molecular Biology - Abstract
Next-generation transcriptome sequencing is increasingly integrated with mass spectrometry to enhance MS-based protein and peptide identification. Recently, a breakthrough in transcriptome analysis was achieved with the development of ribosome profiling (ribo-seq). This technology is based on the deep sequencing of ribosome-protected mRNA fragments, thereby enabling the direct observation of in vivo protein synthesis at the transcript level. In order to explore the impact of a ribo-seq-derived protein sequence search space on MS/MS spectrum identification, we performed a comprehensive proteome study on a human cancer cell line, using both shotgun and N-terminal proteomics, next to ribosome profiling, which was used to delineate (alternative) translational reading-frames. By including protein-level evidence of sample-specific genetic variation and alternative translation, this strategy improved the identification score of 69 proteins and identified 22 new proteins in the shotgun experiment. Furthermore, we discovered 18 new alternative translation start sites in the N-terminal proteomics data and observed a correlation between the quantitative measures of ribo-seq and shotgun proteomics with a Pearson correlation coefficient ranging from 0.483 to 0.664. Overall, this study demonstrated the benefits of ribosome profiling for MS-based protein and peptide identification and we believe this approach could develop into a common practice for next-generation proteomics.
- Published
- 2014
- Full Text
- View/download PDF
30. N-terminal Proteomics and Ribosome Profiling Provide a Comprehensive View of the Alternative Translation Initiation Landscape in Mice and Men
- Author
-
Wim Van Criekinge, Gerben Menschaert, Daria Gawron, and Petra Van Damme
- Subjects
Proteomics ,Ribosomal Proteins ,Codon, Initiator ,Biology ,Biochemistry ,Ribosome ,Cell Line ,Analytical Chemistry ,Jurkat Cells ,Mice ,Eukaryotic translation ,Start codon ,Ribosomal protein ,Animals ,Humans ,Protein Isoforms ,Ribosome profiling ,Databases, Protein ,Peptide Chain Initiation, Translational ,Molecular Biology ,Genetics ,Research ,Alternative splicing ,Translation (biology) ,HCT116 Cells ,Proteome ,Peptides ,Ribosomes ,HeLa Cells - Abstract
Usage of presumed 5'UTR or downstream in-frame AUG codons, next to non-AUG codons as translation start codons contributes to the diversity of a proteome as protein isoforms harboring different N-terminal extensions or truncations can serve different functions. Recent ribosome profiling data revealed a highly underestimated occurrence of database nonannotated, and thus alternative translation initiation sites (aTIS), at the mRNA level. N-terminomics data in addition showed that in higher eukaryotes around 20% of all identified protein N termini point to such aTIS, to incorrect assignments of the translation start codon, translation initiation at near-cognate start codons, or to alternative splicing. We here report on more than 1700 unique alternative protein N termini identified at the proteome level in human and murine cellular proteomes. Customized databases, created using the translation initiation mapping obtained from ribosome profiling data, additionally demonstrate the use of initiator methionine decoded near-cognate start codons besides the existence of N-terminal extended protein variants at the level of the proteome. Various newly identified aTIS were confirmed by mutagenesis, and meta-analyses demonstrated that aTIS reside in strong Kozak-like motifs and are conserved among eukaryotes, hinting to a possible biological impact. Finally, TargetP analysis predicted that the usage of aTIS often results in altered subcellular localization patterns, providing a mechanism for functional diversification.
- Published
- 2014
- Full Text
- View/download PDF
31. Colorado potato beetle (Coleoptera) gut transcriptome analysis: expression of RNA interference-related genes
- Author
-
Hanneke Huvenne, Anna Kourti, Dimitris Kontogiannatos, Richard H. ffrench-Constant, Guy Smagghe, Gerben Menschaert, Luc Swevers, and Yannick Pauchet
- Subjects
Genetics ,Small RNA ,Small interfering RNA ,fungi ,RNA ,Piwi-interacting RNA ,Biology ,Molecular biology ,Transcriptome ,RNA silencing ,RNA interference ,Insect Science ,Gene expression ,Molecular Biology - Abstract
In the search for new methods of pest control, the potential of RNA interference (RNAi) is being explored. Because the gut is the first barrier for the uptake of double-stranded (ds)RNA, pyrosequencing of the gut transcriptome is a powerful tool for obtaining the necessary sequences for specific dsRNA-mediated pest control. In the present study, a dataset representing the gut transcriptome of the Colorado potato beetle (CPB; Leptinotarsa decemlineata) was generated and analysed for the presence of RNAi-related genes. Almost all selected genes that were implicated in silencing efficiency at different levels in the RNAi pathway (core machinery, associated intracellular factors, dsRNA uptake, antiviral RNAi, nucleases), which uses different types of small RNA (small interfering RNA, microRNA and piwi-RNA), were expressed in the CPB gut. Although the database is of lower quality, the majority of the RNAi genes are also found to be present in the gut transcriptome of the tobacco hornworm [TH; Manduca sexta (19 out of 35 genes analysed)]. The high quality of the CPB transcriptome database will lay the foundation for future gene expression and functional studies regarding the gut and RNAi.
- Published
- 2013
- Full Text
- View/download PDF
32. Deep Proteome Coverage Based on Ribosome Profiling Aids Mass Spectrometry-based Protein and Peptide Discovery and Provides Evidence of Alternative Translation Products and Near-cognate Translation Initiation Events*
- Author
-
Wim Van Criekinge, Alexander Koch, Tineke Notelaers, Kris Gevaert, Jeroen Crappé, Gerben Menschaert, and Petra Van Damme
- Subjects
Proteomics ,Genetics ,Chromatography ,Proteome ,Research ,High-Throughput Nucleotide Sequencing ,Biology ,Proteogenomics ,Biochemistry ,Deep sequencing ,Cell Line ,Analytical Chemistry ,Mice ,Protein sequencing ,MRNA Sequencing ,Tandem Mass Spectrometry ,Animals ,Ribosome profiling ,Databases, Protein ,Peptides ,Shotgun proteomics ,Ribosomes ,Molecular Biology - Abstract
An increasing number of studies involve integrative analysis of gene and protein expression data, taking advantage of new technologies such as next-generation transcriptome sequencing and highly sensitive mass spectrometry (MS) instrumentation. Recently, a strategy, termed ribosome profiling (or RIBO-seq), based on deep sequencing of ribosome-protected mRNA fragments, indirectly monitoring protein synthesis, has been described. We devised a proteogenomic approach constructing a custom protein sequence search space, built from both Swiss-Prot- and RIBO-seq-derived translation products, applicable for MS/MS spectrum identification. To record the impact of using the constructed deep proteome database, we performed two alternative MS-based proteomic strategies as follows: (i) a regular shotgun proteomic and (ii) an N-terminal combined fractional diagonal chromatography (COFRADIC) approach. Although the former technique gives an overall assessment on the protein and peptide level, the latter technique, specifically enabling the isolation of N-terminal peptides, is very appropriate in validating the RIBO-seq-derived (alternative) translation initiation site profile. We demonstrate that this proteogenomic approach increases the overall protein identification rate 2.5% (e.g. new protein products, new protein splice variants, single nucleotide polymorphism variant proteins, and N-terminally extended forms of known proteins) as compared with only searching UniProtKB-SwissProt. Furthermore, using this custom database, identification of N-terminal COFRADIC data resulted in detection of 16 alternative start sites giving rise to N-terminally extended protein variants besides the identification of four translated upstream ORFs. Notably, the characterization of these new translation products revealed the use of multiple near-cognate (non-AUG) start codons. As deep sequencing techniques are becoming more standard, less expensive, and widespread, we anticipate that mRNA sequencing and especially custom-tailored RIBO-seq will become indispensable in the MS-based protein or peptide identification process. The underlying mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium with the dataset identifier PXD000124.
- Published
- 2013
- Full Text
- View/download PDF
33. Identification of Small Novel Coding Sequences, a Proteogenomics Endeavor
- Author
-
Volodimir, Olexiouk and Gerben, Menschaert
- Subjects
Mice ,Open Reading Frames ,Genome ,Sequence Analysis, RNA ,Protein Biosynthesis ,Animals ,High-Throughput Nucleotide Sequencing ,Humans ,Antigens ,Oligopeptides ,Ribosomes ,Proteogenomics - Abstract
The identification of small proteins and peptides has consistently proven to be challenging. However, technological advances as well as multi-omics endeavors facilitate the identification of novel small coding sequences, leading to new insights. Specifically, the application of next generation sequencing technologies (NGS), providing accurate and sample specific transcriptome / translatome information, into the proteomics field led to more comprehensive results and new discoveries. This book chapter focuses on the inclusion of RNA-Seq and RIBO-Seq also known as ribosome profiling, an RNA-Seq based technique sequencing the +/- 30 bp long fragments captured by translating ribosomes. We emphasize the identification of micropeptides and neo-antigens, two distinct classes of small translation products, triggering our current understanding of biology. RNA-Seq is capable of capturing sample specific genomic variations, enabling focused neo-antigen identification. RIBO-Seq can identify translation events in small open reading frames which are considered to be non-coding, leading to the discovery of micropeptides. The identification of small translation products requires the integration of multi-omics data, stressing the importance of proteogenomics in this novel research area.
- Published
- 2016
34. A Role for Human N-alpha Acetyltransferase 30 (Naa30) in Maintaining Mitochondrial Integrity
- Author
-
Thomas Vikestad Kalvik, Veronique Jonckheere, Kris Gevaert, Line M. Myklebust, Petra Van Damme, Kristian K. Starheim, Thomas Arnesen, Gerben Menschaert, and Jan Erik Varhaug
- Subjects
0301 basic medicine ,Proteomics ,MEMBRANE-PROTEIN SYS1P ,Protein subunit ,Plasma protein binding ,Mitochondrion ,Biology ,Biochemistry ,NatC complex ,Analytical Chemistry ,Substrate Specificity ,SACCHAROMYCES-CEREVISIAE ,Mitochondrial Proteins ,03 medical and health sciences ,0302 clinical medicine ,Cell Line, Tumor ,Mitophagy ,Humans ,YEAST ,Protein Interaction Maps ,N-Terminal Acetyltransferase C ,Molecular Biology ,ARF-LIKE GTPASE ,COMPLEX ,Research ,Biology and Life Sciences ,Acetylation ,DEGRADATION ,APOPTOSIS ,Cell biology ,TERMINAL ACETYLATION ,030104 developmental biology ,030220 oncology & carcinogenesis ,Acetyltransferase ,Gene Knockdown Techniques ,PROTEOMICS ,MITOPHAGY ,HeLa Cells ,Protein Binding - Abstract
N-terminal acetylation (Nt-acetylation) by N-terminal acetyltransferases (NATs) is one of the most common protein modifications in eukaryotes. The NatC complex represents one of three major NATs of which the substrate profile remains largely unexplored. Here, we defined the in vivo human NatC Nt-acetylome on a proteome-wide scale by combining knockdown of its catalytic subunit Naa30 with positional proteomics. We identified 46 human NatC substrates, expanding our current knowledge on the substrate repertoire of NatC which now includes proteins harboring Met-Leu, Met-Ile, Met-Phe, Met-Trp, Met-Val, Met-Met, Met-His and Met-Lys N termini. Upon Naa30 depletion the expression levels of several organellar proteins were found reduced, in particular mitochondrial proteins, some of which were found to be NatC substrates. Interestingly, knockdown of Naa30 induced the loss of mitochondrial membrane potential and fragmentation of mitochondria. In conclusion, NatC N-tacetylates a large variety of proteins and is essential for mitochondrial integrity and function.
- Published
- 2016
35. Linking Mass Spectrometric Imaging and Traditional Peptidomics: A Validation in the Obese Mouse Model
- Author
-
Laurens Minerva, Gerben Menschaert, Bart Landuyt, Kurt Boonen, Geert Baggerman, and Lut Arckens
- Subjects
Proteomics ,Difference gel electrophoresis ,Molecular Sequence Data ,Analytical chemistry ,Mice, Obese ,Peptide ,Tandem mass spectrometry ,Mass spectrometry imaging ,Fourier transform ion cyclotron resonance ,Analytical Chemistry ,Mice ,Peptide mass fingerprinting ,Animals ,Cluster Analysis ,Amino Acid Sequence ,Pancreas ,Chromatography, High Pressure Liquid ,chemistry.chemical_classification ,Chromatography ,Fourier Analysis ,Chemistry ,Mass spectrometric ,Disease Models, Animal ,Tissue sections ,Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization ,Peptides - Abstract
MALDI mass spectrometry imaging (MSI) is a promising technique in the field of molecular (immuno)histology but is confronted with the problematic large-scale identification of peptides from thin tissue sections. In this study we present a workflow that significantly increased the number of identified peptides in a given MALDI-MSI data set and we evaluated its power concerning relative peptide quantifications. Fourier transform mass spectrometry (FTMS) profiling on matrix-coated thin tissue sections allowed us to align spectra of different MS sources, matching identical peaks in the process, thus linking MSI data to tandem mass spectrometry (MS/MS) on one hand and semiquantitative liquid chromatography (LC)/MS data on the other. Bonanza clustering was applied in order to group MS/MS spectra of structurally related peptides, making it possible to infer the identity of MSI-detected compounds based on identified members within the same cluster, effectively increasing the number of identifications in a single MSI data set. Out of 136 detected peptides with MALDI-MSI, we were able to identify 46 peptides. For 31 of these, a LC/quadrupole time-of-flight (QTOF) counterpart was detected, and we observed similar obese (ob/ob) to wild-type (wt) peak intensity ratios for 18 peptides. This workflow significantly increased the number of identifications of peptide masses detected with MALDI-MSI and evaluated the power of this imaging method for relative quantification of peptide levels between experimental conditions.
- Published
- 2011
- Full Text
- View/download PDF
36. Using the sORFs.Org Database
- Author
-
Volodimir Olexiouk and Gerben Menschaert
- Subjects
0303 health sciences ,Database ,Computer science ,030305 genetics & heredity ,Translation (biology) ,General Medicine ,computer.software_genre ,Pipeline (software) ,Public repository ,Visualization ,Search Engine ,Open Reading Frames ,User-Computer Interface ,03 medical and health sciences ,Identification (information) ,Databases, Genetic ,Ribosome profiling ,computer ,030304 developmental biology - Abstract
Ribosome profiling involves sequencing of approximately 30-base-long stretches of ribosome-protected mRNA. The technique enables genome-wide mapping of RNA undergoing active translation. Numerous small open reading frames have been identified by using ribosome profiling, leading researchers to question the assumed non-functional character of sORFs and to the identification of various important sORF translation products. sORFs.org (https://www.sorfs.org) is a public repository of small open reading frames identified by ribosome profiling in a database of over 3 million sORFs across 78 datasets from six species. sORFs.org is a multi-omics endeavor providing tools and metrics to assess the coding potential of the delineated sORFs. A pipeline is also in place to systematically rescan public mass spectrometry datasets to acquire new experimental evidence for sORF-encoded polypeptides. sORFs.org provides two distinct query interfaces, export functionality, and various visualization tools to enable inspection of the available information. © 2018 by John Wiley & Sons, Inc.
- Published
- 2018
- Full Text
- View/download PDF
37. Comprehensive Peptide Analysis of Mouse Brain Striatum Identifies Novel sORF-Encoded Polypeptides
- Author
-
Kurt Boonen, Harshavardhan Budamgunta, Walter Luyten, Karin Schildermans, Geert Baggerman, Gerben Menschaert, Evelyne Maes, and Volodimir Olexiouk
- Subjects
0301 basic medicine ,chemistry.chemical_classification ,DNA repair ,Translation (biology) ,Peptide ,Computational biology ,Biology ,Biochemistry ,Chemistry ,03 medical and health sciences ,Open reading frame ,030104 developmental biology ,chemistry ,Cytoplasm ,Transcriptional regulation ,Secretion ,Ribosome profiling ,Molecular Biology - Abstract
Bio-active peptides are involved in the regulation of most physiological processes in the body. Classical bio-active peptides (CBAPs) are cleaved from a larger precursor protein and stored in secretion vesicles from which they are released in the extracellular space. Recently, another non-classical type of bio-active peptides (NCBAPs) has gained interest. These typically are not secreted but instead appear to be translated from short open reading frames (sORF) and released directly into the cytoplasm. In contrast to CBAPs, these peptides are involved in the regulation of intra-cellular processes such as transcriptional control, calcium handling and DNA repair. However, bio-chemical evidence for the translation of sORFs remains elusive. Comprehensive analysis of sORF-encoded polypeptides (SEPs) is hampered by a number of methodological and biological challenges: the low molecular mass (many 4-10kDa), the low abundance, transient expression and complications in data analysis. We developed a strategy to address a number of these issues. Our strategy is to exclude false positive identifications. In total sample, we identified 926 peptides originated from 37 known (neuro)peptide precursors in mouse striatum. In addition, four SEPs were identified including NoBody, a SEP that was previously discovered in humans and three novel SEPS from 5 untranslated transcript regions (UTRs).
- Published
- 2018
- Full Text
- View/download PDF
38. Glycosylation Signatures in Drosophila: Fishing with Lectins
- Author
-
Els J.M. Van Damme, Guy Smagghe, Rameshwaram Nagender Rao, Kris Gevaert, Gianni Vandenborre, Mohamad Hamshou, Gerben Menschaert, and Bart Ghesquière
- Subjects
chemistry.chemical_classification ,Glycan ,Glycosylation ,Computational Biology ,Lectin ,General Chemistry ,Biology ,biology.organism_classification ,Biochemistry ,chemistry.chemical_compound ,Drosophila melanogaster ,Immobilized Proteins ,Agglutinin ,chemistry ,Melanogaster ,biology.protein ,Animals ,Drosophila Proteins ,Plant Lectins ,Glycoprotein ,Glycoproteins ,Galanthus nivalis - Abstract
Glycosylation is a co- and/or post-translational protein modification that generates enormous structural diversity among glycoproteins. In this study, immobilized lectins were used to capture glycoproteins with different glycan profiles from Drosophila melanogaster extracts. On the basis of previous results from glycan array analyses, the snowdrop (Galanthus nivalis) agglutinin (GNA), the tobacco (Nicotiana tabacum) lectin (Nictaba) and the Rhizoctoni solani agglutinin (RSA) were used to select for a broad range of N- and O-glycan structures. After different lectin affinity chromatographies, the glycoproteome of Drosophila was analyzed using LC-MS/MS and glycoprotein abundances were calculated by different label-free methods. Bioinformatics tools were used to annotate the identified glycoproteins and the glycoproteins were classified according to their molecular function or their involvement in a biological process. Subsequent enrichment analysis (using the DAVID database) was employed to find biological processes or molecular functions in Drosophila in which a particular glycan signature is overrepresented. The results presented here clearly demonstrate that next to the presence of high-mannose and pauci-mannose N-glycans, Drosophila is capable of synthesizing glycoproteins carrying extended hybrid and complex N-linked glycans. Furthermore, it was demonstrated that a specific glycosylation signature can be associated with a functionally related group of glycoproteins in Drosophila, both in terms of biological process and molecular function.
- Published
- 2010
- Full Text
- View/download PDF
39. Spectral clustering in peptidomics studies helps to unravel modification profile of biologically active peptides and enhances peptide identification rate
- Author
-
Eisuke Hayakawa, Wim Van Criekinge, Bart Landuyt, Walter Luyten, Gerben Menschaert, Liliane Schoofs, and Tom T. M. Vandekerckhove
- Subjects
Male ,Proteomics ,Peptide ,Computational biology ,Biology ,Tandem mass spectrometry ,Biochemistry ,Islets of Langerhans ,Mice ,Tandem Mass Spectrometry ,Animals ,Cluster Analysis ,Database search engine ,Databases, Protein ,Cluster analysis ,Molecular Biology ,chemistry.chemical_classification ,Mice, Inbred BALB C ,Reproducibility of Results ,Combinatorial chemistry ,Spectral clustering ,chemistry ,Pituitary Gland ,Female ,Identification (biology) ,Time-of-flight mass spectrometry ,Peptides ,Protein Processing, Post-Translational ,Algorithms - Abstract
When studying the set of biologically active peptides (the so-called peptidome) of a cell type, organ, or entire organism, the identification of peptides is mostly attempted by MS. However, identification rates are often dismally unsatisfactory. A great deal of failed or missed identifications may be attributable to the wealth of modifications on peptides, some of which may originate from in vivo post-translational processes to activate the molecule, whereas others could be introduced during the tissue preparation procedures. Preliminary knowledge of the modification profile of specific peptidome samples would greatly improve identification rates. To this end we developed an approach that performs clustering of mass spectra in a way that allows us to group spectra having similar peak patterns over significant segments. Comparing members of one spectral group enables us to assess the modifications (expressed as mass shifts in Dalton) present in a peptidome sample. The clustering algorithm in this study is called Bonanza, and it was applied to MALDI-TOF/TOF MS spectra from the mouse. Peptide identification rates went up from 17 to 36% for 278 spectra obtained from the pancreatic islets and from 21 to 43% for 163 pituitary spectra. Spectral clustering with subsequent advanced database search may result in the discovery of new biologically active peptides and modifications thereof, as shown by this report indeed.
- Published
- 2009
- Full Text
- View/download PDF
40. Fast and Reliable Quantitative Peptidomics with labelpepmatch
- Author
-
Gerben Menschaert, Tom Wenseleers, Liliane Schoofs, Rik Verdonck, Thomas Huhn, Wouter De Haes, Geert Baggerman, Dries Cardoen, Bart Landuyt, and Kurt Boonen
- Subjects
0301 basic medicine ,Proteomics ,Neuropeptides ,General Chemistry ,Grasshoppers ,Biology ,computer.software_genre ,Bioinformatics ,Biochemistry ,Mass Spectrometry ,Chemistry ,03 medical and health sciences ,Identification (information) ,030104 developmental biology ,0302 clinical medicine ,Statistical inference ,Animals ,Insect Proteins ,Data mining ,Amino Acid Sequence ,computer ,030217 neurology & neurosurgery ,Software ,Chromatography, Liquid - Abstract
The use of stable isotope tags in quantitative peptidomics offers many advantages, but the laborious identification of matching sets of labeled peptide peaks is still a major bottleneck. Here we present labelpepmatch, an R package for fast and straightforward analysis of LC MS spectra of labeled peptides. This open-source tool offers fast and accurate identification of peak pairs alongside an appropriate framework for statistical inference on quantitative peptidomics data, based on techniques from other-omics disciplines. A relevant case study on the desert locust Schistocerca gregaria proves our pipeline to be a reliable tool for quick but thorough explorative analyses.
- Published
- 2016
41. Identification of Small Novel Coding Sequences, a Proteogenomics Endeavor
- Author
-
Volodimir Olexiouk and Gerben Menschaert
- Subjects
0301 basic medicine ,Genetics ,Translation (biology) ,Computational biology ,Biology ,Proteomics ,Proteogenomics ,Genome ,DNA sequencing ,Transcriptome ,03 medical and health sciences ,030104 developmental biology ,Identification (biology) ,Ribosome profiling - Abstract
The identification of small proteins and peptides has consistently proven to be challenging. However, technological advances as well as multi-omics endeavors facilitate the identification of novel small coding sequences, leading to new insights. Specifically, the application of next generation sequencing technologies (NGS), providing accurate and sample specific transcriptome / translatome information, into the proteomics field led to more comprehensive results and new discoveries. This book chapter focuses on the inclusion of RNA-Seq and RIBO-Seq also known as ribosome profiling, an RNA-Seq based technique sequencing the +/- 30 bp long fragments captured by translating ribosomes. We emphasize the identification of micropeptides and neo-antigens, two distinct classes of small translation products, triggering our current understanding of biology. RNA-Seq is capable of capturing sample specific genomic variations, enabling focused neo-antigen identification. RIBO-Seq can identify translation events in small open reading frames which are considered to be non-coding, leading to the discovery of micropeptides. The identification of small translation products requires the integration of multi-omics data, stressing the importance of proteogenomics in this novel research area.
- Published
- 2016
- Full Text
- View/download PDF
42. PubMeth: a cancer methylation database combining text-mining and expert annotation
- Author
-
Gerben Menschaert, Maté Ongenaert, Tim De Meyer, Wim Van Criekinge, Leander Van Neste, and Sofie Bekaert
- Subjects
Agriculture and Food Sciences ,PubMed ,MEDLINE ,Biology ,computer.software_genre ,Bioinformatics ,User-Computer Interface ,Annotation ,Text mining ,Neoplasms ,Databases, Genetic ,Genetics ,medicine ,Epigenetics ,Gene ,Internet ,Database ,business.industry ,Cancer ,Articles ,Methylation ,DNA Methylation ,medicine.disease ,DNA methylation ,business ,computer ,Genes, Neoplasm - Abstract
Epigenetics, and more specifically DNA methylation is a fast evolving research area. In almost every cancer type, each month new publications confirm the differentiated regulation of specific genes due to methylation and mention the discovery of novel methylation markers. Therefore, it would be extremely useful to have an annotated, reviewed, sorted and summarized overview of all available data. PubMeth is a cancer methylation database that includes genes that are reported to be methylated in various cancer types. A query can be based either on genes (to check in which cancer types the genes are reported as being methylated) or on cancer types (which genes are reported to be methylated in the cancer (sub) types of interest). The database is freely accessible at http://www.pubmeth.org. PubMeth is based on text-mining of Medline/PubMed abstracts, combined with manual reading and annotation of preselected abstracts. The text-mining approach results in increased speed and selectivity (as for instance many different aliases of a gene are searched at once), while the manual screening significantly raises the specificity and quality of the database. The summarized overview of the results is very useful in case more genes or cancer types are searched at the same time.
- Published
- 2007
- Full Text
- View/download PDF
43. Proteogenomics from a bioinformatics angle: A growing field
- Author
-
Gerben, Menschaert and David, Fenyö
- Subjects
Proteomics ,Databases, Factual ,Sequence Analysis, Protein ,Venoms ,Animals ,Computational Biology ,Genomics ,Peptides ,Peptide Mapping ,Antibodies ,Mass Spectrometry ,Software ,Article - Abstract
Proteogenomics is a research area that combines areas as proteomics and genomics in a multi-omics setup using both mass spectrometry and high-throughput sequencing technologies. Currently, the main goals of the field are to aid genome annotation or to unravel the proteome complexity. Mass spectrometry based identifications of matching or homologues peptides can further refine gene models. Also, the identification of novel proteoforms is also made possible based on detection of novel translation initiation sites (cognate or near-cognate), novel transcript isoforms, sequence variation or novel (small) open reading frames in intergenic or un-translated genic regions by analyzing high-throughput sequencing data from RNAseq or ribosome profiling experiments. Other proteogenomics studies using a combination of proteomics and genomics techniques focus on antibody sequencing, the identification of immunogenic peptides or venom peptides. Over the years, a growing amount of bioinformatics tools and databases became available to help streamlining these cross-omics studies. Some of these solutions only help in specific steps of the proteogenomics studies, e.g. building custom sequence databases (based on next generation sequencing output) for mass spectrometry fragmentation spectrum matching. Over the last few years a handful integrative tools also became available that can execute complete proteogenomics analyses. Some of these are presented as stand-alone solutions, whereas others are implemented in a web-based framework such as Galaxy. In this review we aimed at sketching a comprehensive overview of all the bioinformatics solutions that are available for this growing research area. © 2015 Wiley Periodicals, Inc. Mass Spec Rev 36:584-599, 2017.
- Published
- 2015
44. REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes
- Author
-
Gerben Menschaert, Elvis Ndah, Veronique Jonckheere, Eivind Valen, Adam Giess, and Petra Van Damme
- Subjects
Salmonella typhimurium ,0301 basic medicine ,PROTEINS ,PSEUDOGENES ,Pseudogene ,Computational biology ,Bacterial genome size ,Biology ,BINDING-SITES ,Genome ,Machine Learning ,Open Reading Frames ,03 medical and health sciences ,Annotation ,REVEALS ,Genetics ,Ribosome profiling ,ORFS ,MICROBIAL GENE IDENTIFICATION ,IN-VIVO ,COMPLEXITY ,Escherichia coli K12 ,Chromosome Mapping ,Computational Biology ,Biology and Life Sciences ,Molecular Sequence Annotation ,Genome project ,NUCLEOTIDE RESOLUTION ,SMALL ORFS ,Open reading frame ,030104 developmental biology ,Proteome ,Methods Online ,TRANSLATION ,Ribosomes ,Algorithms ,Genome, Bacterial ,Bacillus subtilis - Abstract
Prokaryotic genome annotation is highly dependent on automated methods, as manual curation cannot keep up with the exponential growth of sequenced genomes. Current automated methods depend heavily on sequence composition and often underestimate the complexity of the proteome. We developed RibosomeE Profiling Assisted (re-)AnnotaTION (REPARATION), a de novo machine learning algorithm that takes advantage of experimental protein synthesis evidence from ribosome profiling (Ribo-seq) to delineate translated open reading frames (ORFs) in bacteria, independent of genome annotation (https://github.com/Biobix/REPARATION). REPARATION evaluates all possible ORFs in the genome and estimates minimum thresholds based on a growth curve model to screen for spurious ORFs. We applied REPARATION to three annotated bacterial species to obtain a more comprehensive mapping of their translation landscape in support of experimental data. In all cases, we identified hundreds of novel (small) ORFs including variants of previously annotated ORFs and >70% of all (variants of) annotated protein coding ORFs were predicted by REPARATION to be translated. Our predictions are supported by matching mass spectrometry proteomics data, sequence composition and conservation analysis. REPARATION is unique in that it makes use of experimental translation evidence to intrinsically perform a de novo ORF delineation in bacterial genomes irrespective of the sequence features linked to open reading frames.
- Published
- 2017
- Full Text
- View/download PDF
45. [Untitled]
- Author
-
Patrick Sorgeloos, Ronaldo O. Cavalli, Gerben Menschaert, and Patrick Lavens
- Subjects
biology ,Macrobrachium rosenbergii ,Offspring ,Ecology ,business.industry ,media_common.quotation_subject ,Aquatic animal ,Broodstock ,Aquatic Science ,biology.organism_classification ,Fecundity ,Animal science ,Aquaculture ,Prawn ,Reproduction ,business ,Agronomy and Crop Science ,media_common - Abstract
The effects of increasing levels of dietaryphospholipids (PL) on the reproductive performance,egg and larval quality, and lipid composition offemales of the freshwater prawn Macrobrachiumrosenbergii were investigated. Three isolipidic dietscontaining similar amounts of highly unsaturated fattyacids but varying levels of PL (0.8, 2.4 and 4.6%)were fed during 180 days to three groups of eightfemales originating from Thai ponds. No significantdifferences were observed for fecundity, egg size andhatchability, starved larvae size, and size, survivaland tolerance to stress of 8 day-old larvae.Similarly, no major differences in the lipidcomposition of the midgut gland, ovaries and muscletissue of females could be detected. Results indicatethe lack of need of dietary supplementation of PL forM. rosenbergii broodstock, as previouslyreported for earlier life stages. It is suggested thatthe basal level of 0.8% dietary PL was sufficient tomeet the dietary demands of the prawn broodstock. ThePL requirements of M. rosenbergii broodstock, ifany, may be satisfied in commercial feeds through theinclusion of ingredients containing some phospholipidsendogenously.
- Published
- 2000
- Full Text
- View/download PDF
46. Improving the identification rate of endogenous peptides using electron transfer dissociation and collision-induced dissociation
- Author
-
Kris Gevaert, Gerben Menschaert, Geert Baggerman, Walter Luyten, Liliane Schoofs, Pieter-Jan De Bock, and Eisuke Hayakawa
- Subjects
Proteomics ,Collision-induced dissociation ,Antimicrobial peptides ,Molecular Sequence Data ,Peptide ,Electrons ,Computational biology ,Tandem mass spectrometry ,Biochemistry ,Electron Transport ,Mice ,Peptide mass fingerprinting ,Tandem Mass Spectrometry ,Tumor Cells, Cultured ,Animals ,Database search engine ,Amino Acid Sequence ,Peptide sequence ,Biology ,chemistry.chemical_classification ,Chemistry ,Neuropeptides ,Molecular Sequence Annotation ,General Chemistry ,Combinatorial chemistry ,Peptide Fragments ,Electron-transfer dissociation ,Pituitary Gland ,Proteolysis - Abstract
Tandem mass spectrometry (MS/MS) combined with bioinformatics tools have enabled fast and systematic protein identification based on peptide-to-spectrum matches. However, it remains challenging to obtain accurate identification of endogenous peptides, such as neuropeptides, peptide hormones, peptide pheromones, venom peptides, and antimicrobial peptides. Since these peptides are processed at sites that are difficult to predict reliably, the search of their MS/MS spectra in sequence databases needs to be done without any protease setting. In addition, many endogenous peptides carry various post-translational modifications, making it essential to take these into account in the database search. These characteristics of endogenous peptides result in a huge search space, frequently leading to poor confidence of the peptide characterizations in peptidomics studies. We have developed a new MS/MS spectrum search tool for highly accurate and confident identification of endogenous peptides by combining two different fragmentation methods. Our approach takes advantage of the combination of two independent fragmentation methods (collision-induced dissociation and electron transfer dissociation). Their peptide spectral matching is carried out separately in both methods, and the final score is built as a combination of the two separate scores. We demonstrate that this approach is very effective in discriminating correct peptide identifications from false hits. We applied this approach to a spectral data set of neuropeptides extracted from mouse pituitary tumor cells. Compared to conventional MS-based identification, i.e., using a single fragmentation method, our approach significantly increased the peptide identification rate. It proved also highly effective for scanning spectra against a very large search space, enabling more accurate genome-wide searches and searches including multiple potential post-translational modifications.
- Published
- 2013
47. Peptidomics coming of age: a review of contributions from a bioinformatics angle
- Author
-
Liliane Schoofs, Walter Luyten, Tom T. M. Vandekerckhove, Geert Baggerman, Wim Van Criekinge, and Gerben Menschaert
- Subjects
Proteomics ,Proteomics methods ,Computational Biology ,Reproducibility of Results ,General Chemistry ,Biology ,Bioinformatics ,Biochemistry ,Tandem mass spectrum ,Field (computer science) ,Bioactive peptide ,Tandem Mass Spectrometry ,Animals ,Humans ,Identification (biology) ,Biomarker discovery ,Analysis tools ,Databases, Protein ,Peptides - Abstract
The term peptidomics for a new promising "omics" field was not introduced until the beginning of 2000. The approach has been proven successful in several domains such as neuroendocrine research and biomarker or drug discovery. This review reports on bioinformatics tools and methodologies within the peptidomics field and the application thereof. Obviously, a plethora of proteomics data analysis tools lends themselves to direct use in peptidomics because the latter is a subfield of the former, at least to a certain extent. Nevertheless, peptidomics-specific tool extensions, inventions, and validation procedures have emerged, and certain tools are more suitable for this subfield than others due to small but important differences in peptidomics sample analysis. This paper focuses on these topics. Furthermore, it gives a comprehensive overview of available online tools tailored to the peptidomics field. To conclude, an ideal pipeline for bioactive peptide identification is presented.
- Published
- 2010
48. Reviving nuclear translation
- Author
-
Gerben Menschaert and Petra Van Damme
- Subjects
Genetics ,Messenger RNA ,biology ,RNA ,Translation (biology) ,Cell Biology ,Major histocompatibility complex ,Cell biology ,Histocompatibility Antigens Class I ,biology.protein ,Protein biosynthesis ,Signal transduction ,Molecular Biology ,Antigenic peptide - Abstract
Noncanonical translation of prespliced mRNA provides physiological meaning to nuclear translation in generating antigenic peptide substrates for the endogenous major histocompatibility complex class I pathway.
- Published
- 2013
- Full Text
- View/download PDF
49. PROTEOFORMER: deep proteome coverage through ribosome profiling and MS integration
- Author
-
Gerben Menschaert, Alexander Koch, Jeroen Crappé, Sarah De Keulenaer, Tim De Meyer, Petra Van Damme, Ellen De Meester, Wim Van Criekinge, Daria Gawron, Sandra Steyaert, and Elvis Ndah
- Subjects
Proteomics ,Proteome ,SPECTROMETRY-BASED PROTEIN ,PREDICTION ,LARGE NONCODING RNAS ,Molecular Sequence Data ,Computational biology ,Biology ,Mass Spectrometry ,Mice ,Open Reading Frames ,Genetics ,Protein biosynthesis ,Animals ,Humans ,Amino Acid Sequence ,RNA, Messenger ,Ribosome profiling ,Databases, Protein ,Cells, Cultured ,PROVIDES EVIDENCE ,Genome ,COMPLEXITY ,Sequence Homology, Amino Acid ,IDENTIFICATION ,Sequence database ,Computational Biology ,High-Throughput Nucleotide Sequencing ,Reproducibility of Results ,Biology and Life Sciences ,Translation (biology) ,MASS-SPECTROMETRY ,HCT116 Cells ,Open reading frame ,MRNA Sequencing ,Protein Biosynthesis ,DISCOVERY ,CELLS ,Methods Online ,TRANSLATION ,Ribosomes - Abstract
An increasing amount of studies integrate mRNA sequencing data into MS-based proteomics to complement the translation product search space. However, several factors, including extensive regulation of mRNA translation and the need for three- or six-frame-translation, impede the use of mRNA-seq data for the construction of a protein sequence search database. With that in mind, we developed the PROTEOFORMER tool that automatically processes data of the recently developed ribosome profiling method (sequencing of ribosome-protected mRNA fragments), resulting in genome-wide visualization of ribosome occupancy. Our tool also includes a translation initiation site calling algorithm allowing the delineation of the open reading frames (ORFs) of all translation products. A complete protein synthesis-based sequence database can thus be compiled for mass spectrometry-based identification. This approach increases the overall protein identification rates with 3% and 11% (improved and new identifications) for human and mouse, respectively, and enables proteome-wide detection of 5'-extended proteoforms, upstream ORF translation and near-cognate translation start sites. The PROTEOFORMER tool is available as a stand-alone pipeline and has been implemented in the galaxy framework for ease of use.
- Published
- 2014
- Full Text
- View/download PDF
50. Bioinformatics solutions for MS/MS interpretation tailored to the peptidomics field
- Author
-
Gerben Menschaert, Tt, Vandekerckhove, Baggerman G, Landuyt B, Schoofs L, Luyten W, and Van Criekinge W
- Subjects
Proteomics ,_none ,Tandem Mass Spectrometry ,Computational Biology - Abstract
ispartof: Communications in Agricultural and Applied Biological Sciences vol:74 issue:4 pages:67-73 ispartof: location:Belgium status: published
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.