RT Journal Article SR Electronic T1 Understanding the limit of open search in the identification of peptides with post-translational modifications — A simulation-based study JF bioRxiv FD Cold Spring Harbor Laboratory SP 289710 DO 10.1101/289710 A1 Jiaan Dai A1 Fengchao Yu A1 Ning Li A1 Weichuan Yu YR 2018 UL http://biorxiv.org/content/early/2018/03/28/289710.abstract AB Motivation Analyzing tandem mass spectrometry data to recognize peptides in a sample is the fundamental task in computational proteomics. Traditional peptide identification algorithms perform well when identifying unmodified peptides. However, when peptides have post-translational modifications (PTMs), these methods cannot provide satisfactory results. Recently, Chick et al., 2015 and Yu et al., 2016 proposed the spectrum-based and tag-based open search methods, respectively, to identify peptides with PTMs. While the performance of these two methods is promising, the identification results vary greatly with respect to the quality of tandem mass spectra and the number of PTMs in peptides. This motivates us to systematically study the relationship between the performance of open search methods and quality parameters of tandem mass spectrum data, as well as the number of PTMs in peptides.Results Through large-scale simulations, we obtain the performance trend when simulated tandem mass spectra are of different quality. We propose an analytical model to describe the relationship between the probability of obtaining correct identifications and the spectrum quality as well as the number of PTMs. Based on the analytical model, we can quantitatively describe the necessary condition to effectively apply open search methods.Availability Source codes of the simulation are available at http://bioinformatics.ust.hk/PST.html.Contact boningli{at}ust.hk or eeyu{at}ust.hkSupplementary information Supplementary data are available at Bioinformatics online.