1. The mzIdentML Data Standard Version 1.2, Supporting Advances in Proteome Informatics
- Author
-
Fawaz Ghali, Mathias Walzer, Eugen Netz, Salvador Martínez-Bartolomé, Tobias Ternent, Eric W. Deutsch, Andrew R. Jones, Oliver Kohlbacher, Robert J. Chalkley, Gerhard Mayer, Juan Antonio Vizcaíno, Juri Rappsilber, Yasset Perez-Riverol, Alexander Leitner, Simon Perkins, Lutz Fischer, Julian Uszkoreit, Harald Barsnes, Marc Vaudel, and Martin Eisenacher
- Subjects
QA75 ,Proteomics ,0301 basic medicine ,Biochemistry & Molecular Biology ,Computer science ,Bioengineering ,Biochemistry ,Analytical Chemistry ,World Wide Web ,QH301 ,Databases ,03 medical and health sciences ,Documentation ,Software ,Controlled vocabulary ,Databases, Protein ,Molecular Biology ,030102 biochemistry & molecular biology ,Proteomics Standards Initiative ,business.industry ,Protein ,Technological Innovation and Resources ,Computational Biology ,Proteogenomics ,Data Standard ,Identification (information) ,030104 developmental biology ,XML Schema (W3C) ,Networking and Information Technology R&D (NITRD) ,Generic health relevance ,Software engineering ,business ,Biotechnology - Abstract
The first stable version of the Proteomics Standards Initiative mzIdentML open data standard (version 1.1) was published in 2012—capturing the outputs of peptide and protein identification software. In the intervening years, the standard has become well-supported in both commercial and open software, as well as a submission and download format for public repositories. Here we report a new release of mzIdentML (version 1.2) that is required to keep pace with emerging practice in proteome informatics. New features have been added to support: (1) scores associated with localization of modifications on peptides; (2) statistics performed at the level of peptides; (3) identification of cross-linked peptides; and (4) support for proteogenomics approaches. In addition, there is now improved support for the encoding of de novo sequencing of peptides, spectral library searches, and protein inference. As a key point, the underlying XML schema has only undergone very minor modifications to simplify as much as possible the transition from version 1.1 to version 1.2 for implementers, but there have been several notable updates to the format specification, implementation guidelines, controlled vocabularies and validation software. mzIdentML 1.2 can be described as backwards compatible, in that reading software designed for mzIdentML 1.1 should function in most cases without adaptation. We anticipate that these developments will provide a continued stable base for software teams working to implement the standard., Molecular & Cellular Proteomics, 16 (7), ISSN:1535-9476, ISSN:1535-9484
- Published
- 2017
- Full Text
- View/download PDF