Back to Search
Start Over
Mspire-Simulator: LC-MS Shotgun Proteomic Simulator for Creating Realistic Gold Standard Data
- Source :
- Journal of Proteome Research. 12:5742-5749
- Publication Year :
- 2013
- Publisher :
- American Chemical Society (ACS), 2013.
-
Abstract
- The most important step in any quantitative proteomic pipeline is feature detection (aka peak picking). However, generating quality hand-annotated data sets to validate the algorithms, especially for lower abundance peaks, is nearly impossible. An alternative for creating gold standard data is to simulate it with features closely mimicking real data. We present Mspire-Simulator, a free, open-source shotgun proteomic simulator that goes beyond previous simulation attempts by generating LC-MS features with realistic m/z and intensity variance along with other noise components. It also includes machine-learned models for retention time and peak intensity prediction and a genetic algorithm to custom fit model parameters for experimental data sets. We show that these methods are applicable to data from three different mass spectrometers, including two fundamentally different types, and show visually and analytically that simulated peaks are nearly indistinguishable from actual data. Researchers can use simulated data to rigorously test quantitation software, and proteomic researchers may benefit from overlaying simulated data on actual data sets.
- Subjects :
- Proteomics
Computer science
Pipeline (computing)
Molecular Sequence Data
Overlay
Biochemistry
Mass Spectrometry
Software
Genetic algorithm
Animals
Humans
Computer Simulation
Amino Acid Sequence
Simulation
Feature detection (computer vision)
Models, Statistical
business.industry
Proteins
Experimental data
General Chemistry
Gold standard (test)
Reference Standards
Cattle
Noise (video)
business
Algorithms
Chromatography, Liquid
Subjects
Details
- ISSN :
- 15353907 and 15353893
- Volume :
- 12
- Database :
- OpenAIRE
- Journal :
- Journal of Proteome Research
- Accession number :
- edsair.doi.dedup.....a938ee5adfa3e2ce10111317b3f033b9