Back to Search
Start Over
OpenPepXL: An Open-Source Tool for Sensitive Identification of Cross-Linked Peptides in XL-MS
- Source :
- Mol Cell Proteomics
- Publication Year :
- 2020
- Publisher :
- Elsevier BV, 2020.
-
Abstract
- Cross-linking MS (XL-MS) has been recognized as an effective source of information about protein structures and interactions. In contrast to regular peptide identification, XL-MS has to deal with a quadratic search space, where peptides from every protein could potentially be cross-linked to any other protein. To cope with this search space, most tools apply different heuristics for search space reduction. We introduce a new open-source XL-MS database search algorithm, OpenPepXL, which offers increased sensitivity compared with other tools. OpenPepXL searches the full search space of an XL-MS experiment without using heuristics to reduce it. Because of efficient data structures and built-in parallelization OpenPepXL achieves excellent runtimes and can also be deployed on large compute clusters and cloud services while maintaining a slim memory footprint. We compared OpenPepXL to several other commonly used tools for identification of noncleavable labeled and label-free cross-linkers on a diverse set of XL-MS experiments. In our first comparison, we used a data set from a fraction of a cell lysate with a protein database of 128 targets and 128 decoys. At 5% FDR, OpenPepXL finds from 7% to over 50% more unique residue pairs (URPs) than other tools. On data sets with available high-resolution structures for cross-link validation OpenPepXL reports from 7% to over 40% more structurally validated URPs than other tools. Additionally, we used a synthetic peptide data set that allows objective validation of cross-links without relying on structural information and found that OpenPepXL reports at least 12% more validated URPs than other tools. It has been built as part of the OpenMS suite of tools and supports Windows, macOS, and Linux operating systems. OpenPepXL also supports the MzIdentML 1.2 format for XL-MS identification results. It is freely available under a three-clause BSD license at https://openms.org/openpepxl.
- Subjects :
- Models, Molecular
Computer science
computer.software_genre
Biochemistry
Mass Spectrometry
Analytical Chemistry
Set (abstract data type)
Reduction (complexity)
03 medical and health sciences
Humans
Database search engine
Amino Acid Sequence
Databases, Protein
Molecular Biology
030304 developmental biology
0303 health sciences
030302 biochemistry & molecular biology
Technological Innovation and Resources
Data structure
Data set
Identification (information)
Cross-Linking Reagents
HEK293 Cells
Memory footprint
Data mining
Peptides
Heuristics
Ribosomes
computer
Algorithms
Software
Subjects
Details
- ISSN :
- 15359476
- Volume :
- 19
- Database :
- OpenAIRE
- Journal :
- Molecular & Cellular Proteomics
- Accession number :
- edsair.doi.dedup.....358b32b3ec68c168a8f1302aa823c6f9
- Full Text :
- https://doi.org/10.1074/mcp.tir120.002186