ABSTRACT
Background In proteomics, mass spectra representing peptides carrying multiple unknown modifications are particularly difficult to interpret, which results in a large number of unidentified spectra.
Methods We developed SpecGlob, a dynamic programming algorithm that aligns pairs of spectra, each such pair being a Peptide-Spectrum Match (PSM) provided by any Open Modification Search (OMS) method. For each PSM, SpecGlob computes the best alignment according to a given score system, while interpreting the mass delta within the PSM as one or several unspecified modification(s). All alignments are provided in a file, written in a specific syntax.
Results Using several sets of simulated spectra generated from the human proteome, we demonstrate that running SpecGlob as a post-analysis of an OMS method can significantly increase the number of correctly interpreted spectra, as SpecGlob is able to correctly and rapidly align spectra that differ by one or more modification(s) without any a priori.
Conclusion Since SpecGlob explores all possible alignments that may explain the mass delta within a PSM, it reduces interpretation errors generated by incorrect assumptions about the modifications present in the sample or the number and the specificities of modifications carried by peptides. Our results demonstrate that SpecGlob should be relevant to align experimental spectra, although this consists in a more challenging task.
Competing Interest Statement
The authors have declared no competing interest.
Abbreviations
- OMS
- Open Modification Search
- PSM
- Peptide Spectrum Match
- SPC
- Shared Peaks Count