Back to Search
Start Over
DISMS2: A flexible algorithm for direct proteome- wide distance calculation of LC-MS/MS runs
- Source :
- BMC Bioinformatics, BMC bioinformatics, 18:148
- Publication Year :
- 2016
-
Abstract
- Background The classification of samples on a molecular level has manifold applications, from patient classification regarding cancer treatment to phylogenetics for identifying evolutionary relationships between species. Modern methods employ the alignment of DNA or amino acid sequences, mostly not genome-wide but only on selected parts of the genome. Recently proteomics-based approaches have become popular. An established method for the identification of peptides and proteins is liquid chromatography-tandem mass spectrometry (LC-MS/MS). First, protein sequences from MS/MS spectra are identified by means of database searches, given samples with known genome-wide sequence information, then sequence based methods are applied. Alternatively, de novo peptide sequencing algorithms annotate MS/MS spectra and deduce peptide/protein information without a database. A newer approach independent of additional information is to directly compare unidentified tandem mass spectra. The challenge then is to compute the distance between pairwise MS/MS runs consisting of thousands of spectra. Methods We present DISMS2, a new algorithm to calculate proteome-wide distances directly from MS/MS data, extending the algorithm compareMS2, an approach that also uses a spectral comparison pipeline. Results Our new more flexible algorithm, DISMS2, allows for the choice of the spectrum distance measure and includes different spectra preprocessing and filtering steps that can be tailored to specific situations by parameter optimization. Conclusions DISMS2 performs well for samples from species with and without database annotation and thus has clear advantages over methods that are purely based on database search. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1514-2) contains supplementary material, which is available to authorized users.
- Subjects :
- 0301 basic medicine
Proteomics
Comparison of MS/MS spectra
Proteome
Computer science
Peptide
Mass spectrometry
Genome
Tandem mass spectrum
Biochemistry
03 medical and health sciences
chemistry.chemical_compound
Structural Biology
Tandem Mass Spectrometry
Distance of LC-MS/MS runs
Humans
Amino Acid Sequence
LC-MS/MS
Peptide identification
Databases, Protein
Molecular Biology
chemistry.chemical_classification
Applied Mathematics
De novo peptide sequencing
Amino acid
Computer Science Applications
Identification (information)
030104 developmental biology
chemistry
Peptides
Algorithm
DNA
Algorithms
Chromatography, Liquid
Research Article
Subjects
Details
- ISSN :
- 14712105
- Volume :
- 18
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- BMC bioinformatics
- Accession number :
- edsair.doi.dedup.....1400fcad25d41a7f6991ae9ce9b56cff