Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

DO-MS: Data-Driven Optimization of Mass Spectrometry Methods

Gray Huffman, View ORCID ProfileHarrison Specht, View ORCID ProfileAlbert Chen, View ORCID ProfileNikolai Slavov
doi: https://doi.org/10.1101/512152
Gray Huffman
†Department of Bioengineering, Northeastern University, Boston, MA 02115, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Harrison Specht
†Department of Bioengineering, Northeastern University, Boston, MA 02115, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Harrison Specht
Albert Chen
†Department of Bioengineering, Northeastern University, Boston, MA 02115, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Albert Chen
Nikolai Slavov
†Department of Bioengineering, Northeastern University, Boston, MA 02115, USA
‡Department of Biology, Northeastern University, Boston, MA 02115, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nikolai Slavov
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

The performance of ultrasensitive LC-MS/MS methods, such as Single-Cell Proteomics by Mass Spectrometry (SCoPE-MS), depends on multiple interdependent parameters. This interdependence makes it challenging to specifically pinpoint bottlenecks in the LC-MS/MS methods and approaches for resolving them. For example, low signal at MS2 level can be due to poor LC separation, ionization, apex targeting, ion transfer, or ion detection. We sought to specifically diagnose such bottlenecks by interactively visualizing data from all levels of bottom-up LC-MS/MS analysis. Many search engines, such as MaxQuant, already provide such data, and we developed an open source platform for their interactive visualization and analysis: Data-driven Optimization of MS (DO-MS). We found that in many cases DO-MS not only specifically diagnosed bottlenecks but also enabled us to rationally optimize them. For example, we used DO-MS to diagnose poor sampling of the elution peak apex and to optimize it, which increased the efficiency of delivering ions for MS2 analysis by 370%. DO-MS is easy to install and use, and its GUI allows for interactive data subsetting and high-quality figure generation. The modular design of DO-MS facilitates customization and expansion. DO-MS is available for download from GitHub: github.com/SlavovLab/DO-MS

Introduction

Analytical methods combining liquid chromatography and tandem mass-spectrometry (LC-MS/MS) allow for unparalleled identification and relative quantitation of the protein components of biological systems.1–4 Advances in LC-MS/MS have enabeled analysis of protein complexes and their functions,5–9 regulation of protein synthesis and alternative RNA translation,10,11 rare cells in blood,12,13 and protein conformations.14–16 The increasing sensitivity,17–23 throughput, and robustness24 of LC-MS/MS set the stage for quantifying thousands of proteins across many thousands of single cells, providing data with transformative potential for biomedical research.25–29

While LC-MS/MS proteomics methods are very powerful, they require extensive optimization of interdependent instrument parameters. Optimization is particularly critical for quantifying low-input samples, e.g., single-cell proteomes and exosomes. LC-MS/MS optimization and quality control (QC) can be performed by manually inspecting the features of peptide standards30–32 within MS instrument software, e.g., Thermo Scientific Xcalibur, or by specialized software packages. Following the National Institute of Standards and Technology’s QC data analysis pipeline,33 numerous QC programs have been developed. These programs capitalize on advances in search engines,34 video analysis,35 direct analysis of raw data,36 and database management tools37 to track instrument performance over time and assist in duty cycle optimization. Other platforms, such as MSStatsQC 2.0, employ specialized statistical methods to differentiate normal variation in instrument performance from novel variation as a means to detect instrument problems early.38 A comprehensive review of LC-MS/MS QC and optimization tools has been published by Bittremieux et al.39

We found these tools useful in developing Single-Cell Proteomics by Mass Spectrometry (SCoPE-MS), which combines TMT-labeled peptides from single cells with a TMT-labeled carrier channel to enable quantifying proteins across many single cells.25–27,40 However, none of these tools provided all of the metrics needed for optimizating our SCoPE-MS analysis.40 This motivated us to develop DO-MS, a highly modular and interactive environment for optimizing ultra-sensitive LC-MS/MS methods.

DO-MS aims to diagnose problems and suggest solutions as specifically as possible. To illustrate this point, here we describe concrete examples, including optimizing apex targeting, assessing contamination, and evaluating SCoPE-MS results. In order to enable specific diagnosis, the DOMS dashboard juxtaposes distribution plots of data from multiple levels of LC-MS/MS analysis, including retention lengths at base and mid height, intensity of all ions and of precursors selected for MS/MS, elution peak apex offset, number of MS/MS events, MS2-level coisolation (i.e., parent ion fraction), number of identified peptides at all confidence levels and quantification benchmarks. These features are organized thematically in the dashboard for ease of reference. DO-MS has already enabled us to quickly identify bottlenecks in our methods, their exact origin in the workflow, and potential solutions. Below, we share DO-MS along with a selection of examples from our work in the hope that it will facilitate the wider adoption and advancement of single-cell proteomics. We also hope that the modular nature of our platform will enable the community to add new modules for optimizing LC-MS/MS for an expanding array of applications.

Materials and Methods

Implementation

DO-MS is implemented as a Shiny app, built using R.41 All plots are generated using the ggplot2 package.42,43 Shiny was chosen for its interactivity, allowing data to be dynamically subset based on experiment or confidence of peptide spectral match. Additionally, this package can be run from the command line or the RStudio IDE. DO-MS is available from the Slavov Lab GitHub page: github.com/SlavovLab/DO-MS

Data Preprocessing

The experimental data used here were generated as part of developing and optimizing Minimal ProteOmic sample Preparation (mPOP)27 and SCoPE-MS,40 and a full description of the experiments can be found in Specht et al.27 Briefly, all samples were separated on a 25cm length × 75μm Waters nanoEase column (1.7 μm resin, Waters PN:186008795) run by a Proxeon Easy nLC1200 UHPLC (Thermo Scientific). All samples were analyzed by a Thermo Scientific Q-Exactive mass spectrometer. Prior to running DO-MS, RAW files were searched using MaxQuant 1.6.0.16.44–46 The human SwissProt FASTA database (39748 entries, downloaded 5/1/2018) was used for searching data from U-937 and Jurkat cells. MaxQuant searches were conducted as previously described27,46. Trypsin was specified as the digest enzyme, and a maximum of two missed cleavages were allowed for peptides between 5 and 26 amino acids long. Methionine oxidation (+ 15.99491 Da) and protein n-terminus acetylation (+42.01056 Da) were specified as variable modifications. The allPeptides.txt, evidence.txt, msmsScans.txt, and msms.txt files output by MaxQuant were imported by DO-MS for analysis and figure generation. In order to make the greatest use of the DO-MS platform, users must enable the Calculate Peak Properties option on the Advanced submenu of MaxQuant’s Global Parameters tab44,46. This option can be automatically enabled by using the mqpar.xml files from the supporting files of this manuscript. Searches conducted without enabling this option will not generate plots for the Elution Peak Apex Offset and Peak Width at Full Width Half Max panels of the DO-MS dashboard. DO-MS is currently optimized for MaxQuant search results but users can customize it to work with alternative search-engine outputs by specifying the column headers corresponding to the data selected for visualization.

Visualization

DO-MS’s diagnostic plots have been organized into the following five categories: Chromatography, Instrument Performance, Contamination, Peptide Identifications, SCoPE-MS Diagnostics, and DART-ID47. Data are visualized as full distributions using vertically oriented histograms to avoid kernel-smoothing issues. This approach is advantageous as distinct data sets may have similar summary statistics but markedly different distributions of data points.48,49 The full distributions allow subpopulations of ions to be identified, which can be key to optimizing LC-MS/MS performance. Additionally, these distributions can be conditioned on common ions, allowing for a more principled comparison, as discussed in the Results section.

Data imported into DO-MS can be subset based on confidence of peptide spectral match assignment and experiment name using a slider and dynamically populated list, respectively. DO-MS relies on the posterior error probability (PEP), as estimated by MaxQuant, to indicate the confidence of a given peptide spectral match (PSM). The PEP value can be thought of as the probability that the identified peptide was not in the mass spectrometer at the time the spectra was acquired.50 By default, DO-MS labels experiments by their corresponding raw file names. If desired, experiments can be labeled via a text-input field. Such labeling can enhance the clarity of figures and thus facilitate their analysis and broader interpretability if the figures are intended for publication.

Report Generation and Figure Output

To facilitate the sharing of experimental results, users can output the DO-MS dashboard plots as an HTML report which reflects all data subsetting performed in the app, as well as user-supplied experimental labels. Report generation is achieved via a button in the “Report Generation” dashboard tab. Dashboard plots can also be saved as individual.png or .pdf files for use in presentations and publications directly from the dashboard tabs on which they appear.

User Customization

We built DO-MS as a modular application to make dashboard customization and expansion as easy as possible. Each plot is generated from a separate R file, and a template file has been provided as a guide for users interested in including additional plots in their dashboard. Adding plots to the DO-MS dashboard can be accomplished by adding a customized template.r file to a new folder in the “modules” directory. After reloading the app, the new plot will automatically appear in the DO-MS dashboard in the user-specified tab. More details for this process can be found on the Github project page: github.com/SlavovLab/DO-MS

Results and Discussion

Sampling the Elution Peak Apex

LC-MS/MS methods aim to sample the elution profile of each peptide at its apex because such sampling maximizes the number and purity of sampled ions.51 However, no exisiting method can target the apex of every ion. Rather, instrument parameters can be optimized to maximize the fraction of ions sent for MS/MS at or close to their apexes. DO-MS facilitates this optimization by visualizing the distribution of apex offsets (i.e., the time offset between the apex of an ion and the time when it is MS/MSed) as estimated by MaxQuant.

The visualization of apex offsets, as shown in Fig. 1a, enabled us to rationally optimize the instrument methods for analyzing SCoPE-MS sets. To resolve the reporter ions of a TMT 11plex, we perform the MS2 scans at 70,000 resolving power, which on our Q-exactive takes 256 ms.52 Since Q-exactive instruments can perform the ion accumulation and MS scan in parallel, we started by setting our max fill (ion accumulation) times for MS2 scans to be 250ms.52 This setting resulted in sampling most ions for MS2 scans too early, significantly before the apexes of their elution peaks, Fig. 1a. We reasoned that such premature sampling could be alleviated by elongating the duty cycle either by increasing the number of MS/MSed ions per cycle or by increasing the max fill times. Indeed, increasing the fill times to 500ms and 1000ms, while keeping all other parameters constant, increased the fraction of ions whose elution peaks are sampled at or near the apex, Fig. 1a.

Figure 1:
  • Download figure
  • Open in new tab
Figure 1: By rationally optimizing apex targeting, DO-MS increased ion delivery and peptide identification rates

(a) Distributions of apex offsets for three runs on 90-minute gradients. Each injection was of 1 μl from the same vial and corresponds to 1 × M dilution of a SCoPE-MS master sample as described by Specht et al.27 All LC parameters and instrument parameters were set to be the same except for the max fill time. (b) The relative efficiency of delivering ions for MS2 analysis was estimated by the intensities of reporter ions (RI). For each peptide identified across all 3 experiments, the RI intensity was divided by the corresponding RI intensity for 250ms fill time and the results for all peptides are shown as distributions on a log2 scale. (c) The number of peptides identified at each PEP threshold is shown as a rank sorted list for the three fill times. This display shows the number of peptides for all levels of confidence of identification, as quantified by the PEP.

The increased fill times and improved apex targeting increased ion delivery for MS2 analysis (Fig. 1b) and the number of identified peptides at all levels of confidence, as shown in Fig. 1c. Rather than displaying the number of identified peptides at an arbitrary confidence cut-off, DOMS plots all peptides rank-sorted by the posterior error probability (PEP) of their identification, Fig. 1c. These curves show the number of identifications for all levels of confidence. The dashed grey line on the plot denotes peptides with a 99% probability of having been correctly identified. Such curves offer insight into low-confidence peptide identifications which might be boosted by incorporating additional features, such as retention time.46,47

This example is consistent with previous observations that longer accumulation times can increase the number of confident peptide identifications for lowly abundant samples17,40. Furthermore, this example underscores that fill times can strongly influence apex targeting. Such optimization of duty-cycle and apex-targeting is sample and system dependent, and thus it requires methods that allow for rational optimization of all levels of MS analysis, such as DO-MS.

Characterizing Contamination

Contaminants of non-protein origin are very common in proteomics experiments.34,53 At best, they are lowly abundant and elute separately from peptides; at worst, they are highly abundant and elute with peptides, undermining ionization, charge determination, and identification. Low-input samples are especially sensitive to contaminants, as the ratio between target and contaminant ions is more likely to be lower. DO-MS displays contaminants across the LC gradient by plotting the intensity of ions with a +1 charge state (z=1) for each minute of the gradient. This type of data presentation allows users to distinguish between hydrophilic and hydrophobic contaminants.

Due to the mosaic structure of the DO-MS dashboard, potential relationships between factors such as contaminant ion intensity and peptide identifications can be easily seen, as exemplified in Fig. 2. The juxtaposition of summed precursor intensities for contaminant ions (Fig. 2a) and for peptides (Fig. 2b) shows a clear correlation: Sample 2 has more hydrophobic contaminants and their elution coincides with reduced peptide identifications. This correlation immediately suggested that the lower identification rates for sample 2 were due to its contamination, which we subsequently identified to be polyethylene glycol (PEG).

Figure 2:
  • Download figure
  • Open in new tab
Figure 2: Diagnosing reduced peptide identification due to co-eluting contaminants

Plotting the cumulative intensities for all +1 ions detected during the survey scans (a) alongside the number of peptides identified across the gradient (b) can reveal correlations between co-eluting contaminants and reduced peptide identification. (c) Number of all detected ions by charge states. (d) Peptides were rank sorted by their PEPs to display the number of identified peptides across all levels of confidence.

As an additional diagnostic plot for contamination, DO-MS displays the number of ions detected by the instrument by charge state as shown in Fig. 2c. This compact display is particularly useful for comparing ions likely to be contaminants (with charge +1) and those likely to be peptides (with charge ≥ 2) across many runs. The negative impact of these numerous contaminantions on peptide identifications can be seen in Fig. 2d. Once the presence of contaminants has been diagnosed with DO-MS, several existing software tools can be applied to more fully characterize the type of contamination present in each sample.34,53

Controlled Sample Comparisons

While distributions are much more informative than their summary statistics, comparing distributions for different populations can still be misleading. Thus, when comparing the distributions plotted by DO-MS, it is important to control for (condition on) the composition of the distributions, e.g., the peptides comprising each distribution. The importance of such controlled comparison is exemplified in Fig. 3 with an experiment testing the effect of calcium addition on trypsin digestion. For this experiment, a sample was split into two equal parts, sample 1 and sample 2. They were processed in identical ways, except that 50mM CaCl2 was added to sample 2 during its digestion.

Figure 3:
  • Download figure
  • Open in new tab
Figure 3: Controlled comparison of peptide abundances across experiments

Without controlling for the composition of the two populations being compared, trends in the data can be misread. In this case, when comparing the distribution of precursor intensities for all peptides identified in each sample (a), Sample 1 appears to have more highly abundant peptides. However, when ensuring that the comparison is only based on those peptides identified in each sample (b), the opposite trend becomes apparent: namely, that the peptide species in Sample 2 were more highly abundant.

Both samples were digested with 20ng/μl Promega Trypsin/ LysC mix. The distributions of all peptide abundances and the corresponding median abundances (Fig. 3a) may be interpreted to suggest that sample 1 resulted in more efficient delivery of peptides to MS analysis and the addition of calcium was detrimental. However, conditioning the comparison on the common peptides indicates the opposite conclusion: The addition of calcium chloride to sample 2 during its digestion resulted in more efficient delivery of peptides to the instrument.

Low-Input Sample Diagnostics

In the process of developing Minimal ProteOmic sample Preparation (mPOP)27 and SCoPE-MS,40 we discovered a number of helpful metrics for diagnosing SCoPE-MS sample preparation and optimizing instrument parameters for low-input samples. Two such metrics are relative reporter ion (RI) intensities and the correlations among them. These metrics indicate the efficiency of single-cell preparations and quantitative data quality.

To optimize SCoPE-MS, we used bulk SCoPE-MS sets from which a single injection (1%) corresponds to a single-cell SCoPE-MS set. Such samples allowed us to assess and optimize instrument performance independent of sample variability (since we could inject multiple aliquots of the same sample) and to assess quantification, as we had a strong expectation that the pseudosingle-cell channels should correlate positively to their corresponding carrier channel. Below we demonstrate diagnosis of such a sample by DO-MS; the sample preparation is described in detail by Specht et al27. Briefly, the sample was composed of 10 TMT channels containing serial dilutions of digested cell lysate from two cell types: U-937 (monocytes, denoted by U) and Jurkat (T-cells, denoted by J). This reference standard is diluted so that 1 μl corresponds to a SCoPE-MS set and contains peptide input equivalent to about 106 single cells (20 — 50ng of total protein). The reference samples have two carrier channels, each of which contains peptide input comparable to 50 cells of each type, and six channels contain peptide inputs comparable to individual single cells, three of each type.27

The distribution of reporter ion (RI) intensities from a SCoPE-MS set can be an informative indicator. Low RI intensities may be due to failed cell isolation, digestion, or labeling. High RI intensities may be due to background contamination or cross-labeling. Such deviations may be diagnosed simply from the distributions of RI intensities. However these distributions are quite broad since the RI intensities for quantified peptides often span several orders of magnitude. To decrease this dynamic range, DO-MS plots the distribution of relative RI intensities: for each peptide, the RI intensities are scaled by dividing them by the RI intensity in the most abundant channel (the one with the highest median RI intensity), Fig. 4a. This visualization clearly indicates whether the single-cell channels have about 50-fold lower median intensity than a 50-cell carrier channel, as well as the background signal (including isotopic contamination) in the empty channels. Such a diagnostic is helpful for determining label-quenching efficiency, as well as the efficiency of cell sorting by FACS. By examining the relative RI intensities present in blank channels, one can also assess the amount of background signal present and the degree of isotopic carryover.

Figure 4:
  • Download figure
  • Open in new tab
Figure 4: Evaluating low-input samples, such as SCoPE-MS sets

(a) the distributions of relative reporter ion (RI) intensities can indicate the relative amount of peptides and the efficiency of sample preparation for each channel. (b) the matrix of pairwise correlations among all all channels of a SCoPE-MS set can be used to benchmark relative quantification within that set. In (a) we expect single-cell channels to have relative RI intensities that are 50-fold lower than the 50-cell carrier channels (about 1.5 on log10 scale). In (b), we expect singlecell channels to correlate positively with single-cell channels and carrier channels that contain their respective cell type, while cross-cell-type correlations for single-cell channels are expected to be negative.

The quantitative accuracy of SCoPE-MS sets can be benchmarked by comparing the relative quantification from the carrier channels and single-cell channels as quantified by the correlations among them. DO-MS computes all possible pairwise correlations, i.e. the correlation matrix for the column and row-normalized RI intensities. This matrix can serve as a complimentary diagnostic for relative quantification in SCoPE-MS sets based on the expectation that cells of the same type should correlate positively with each other but not with different cell types. Furthermore, blank channels should not correlate positively with either the single-cell or carrier channels. This expectation is consistent with the correlations shown in Fig. 4b. This correlation matrix can also be useful for identifying cross-contamination, cross-labeling, and on-column carryover.

These SCoPE-MS plots should be analyzed in the context of distribution plots reporting on all levels of the LC-MS/MS analysis so that the origin of problems can be identified and interdependent parameters optimized. We hope that these metrics will assist the wide adoption of low-input sample preparation and analysis methods.

Conclusion

Optimal instrument parameters are context dependent and thus should be determined systematically by data-driven approaches for each set of samples and LC-MS/MS configurations. DO-MS enables such optimization. Sometimes, this data-driven approach results in counter-intuitive results, as demonstrated in Fig. 1 with the increased number of identified peptides at longer max fill times. By increasing our MS2 injection time, and consequently our duty cycle length, we managed to increase the number of confidently identified peptides in our sample. This result contrasts with the strategy commonly employed by bulk proteomics methods, namely seeking to increase peptide identification by increasing MS2 sampling frequency and thus decreasing the fill time for each MS/MS.

The DO-MS dashboard is an open-source, GUI-based tool for quickly assessing LC-MS/MS parameter optimization strategies and sample quality in single-cell proteomics experiments. This diagnostic platform can assist other labs in adopting ultrasensitive, low-input LC-MS/MS methods, such as SCoPE-MS, and can serve as a highly adaptable data visualization tool for proteomics researchers.

Availability and Documentation

The DO-MS Shiny dashboard, basic documentation, and a sample data set are currently available through a git repository at github.com/SlavovLab/DO-MS, under the MIT license. A free, web-based version of the DO-MS is also accessible at singlecellproteomics.shinyapps.io/QCtest/, although it is recommended that interested users install the dashboard locally to avoid data upload restrictions and monthly activity caps.

Funding Sources

This project was supported by a New Innovator Award from the NIGMS from the National Institutes of Health to N.S. under Award Number DP2GM123497

Acknowledgments

The catalyst for creating this package was a break-out discussion group led by Jürgen Cox at the 2018 Single-Cell Proteomics conference. Special thanks goes to Jürgen Cox for structuring this discussion, as well as to Ezra Levy, Guillaume Harmange, and Hung-Yi Wu for their part in outlining the characteristics that an instrument-parameter optimization and sample-quality analysis software package should possess. We would also like to thank Ed Emmott, David Perlman, and Toni Koller for their valuable feedback and troubleshooting efforts during the package development process.

Footnotes

  • ↵* E-mail: nslavov{at}northeastern.edu

References

  1. (1).↵
    Yates, J. R. Mass spectrometry and the age of the proteome. Journal of mass spectrometry: JMS 1998, 33, 1–19.
    OpenUrl
  2. (2).
    Yates, J. R.; Ruse, C. I.; Nakorchevsky, A. Proteomics by Mass Spectrometry: Approaches, Advances, and Applications. Annual Review of Biomedical Engineering 2009, 11, 49–79.
    OpenUrlCrossRefPubMedWeb of Science
  3. (3).
    Nesvizhskii, A. I. Proteogenomics: concepts, applications and computational strategies. Nature methods 2014, 11, 1114.
    OpenUrl
  4. (4).↵
    Aebersold, R.; Mann, M. Mass-spectrometric exploration of proteome structure and function. Nature 2016, 537, 347–355.
    OpenUrlCrossRefPubMed
  5. (5).↵
    Cravatt, B. F.; Simon, G. M.; Yates Iii, J. R. The biological impact of mass-spectrometrybased proteomics. Nature 2007, 450, 991.
    OpenUrlCrossRefPubMedWeb of Science
  6. (6).
    Leitner, A.; Faini, M.; Stengel, F.; Aebersold, R. Crosslinking and mass spectrometry: an integrated technology to understand the structure and function of molecular machines. Trends in biochemical sciences 2016, 41, 20–32.
    OpenUrlCrossRef
  7. (7).
    Slavov, N.; Semrau, S.; Airoldi, E.; Budnik, B.; van Oudenaarden, A. Differential stoichiometry among core ribosomal proteins. Cell Reports 2015, 13, 865–873.
    OpenUrl
  8. (8).
    Malioutov, D.; Chen, T.; Airoldi, E.; Jaffe, J.; Budnik, B.; Slavov, N. Quantify ing homologous proteins and proteoforms. Molecular & Cellular Proteomics 2018, doi:10.1074/mcp.TIR118.000947.
    OpenUrlAbstract/FREE Full Text
  9. (9).↵
    Emmott, E. P.; Jovanovic, M.; Slavov, N. Ribosome stoichiometry: from form to function. Trends in Biochemical Sciences 2018, doi: 10.1016/j.tibs.2018.10.009
    OpenUrlCrossRef
  10. (10).↵
    Trendel, J.; Schwarzl, T.; Horos, R.; Prakash, A.; Bateman, A.; Hentze, M. W.; Krijgsveld, J. The Human RNA-Binding Proteome and Its Dynamics during Translational Arrest. Cell 2018,
  11. (11).↵
    Slavoff, S. A.; Mitchell, A. J.; Schwaid, A. G.; Cabili, M. N.; Ma, J.; Levin, J. Z.; Karger, A. D.; Budnik, B. A.; Rinn, J. L.; Saghatelian, A. Peptidomic discovery of short open reading frame–encoded peptides in human cells. Nature chemical biology 2013, 9, 59.
    OpenUrlCrossRefPubMed
  12. (12).↵
    Li, S.; Plouffe, B. D.; Belov, A. M.; Ray, S.; Wang, X.; Murthy, S. K.; Karger, B. L.; Ivanov, A. R. An Integrated Platform for Isolation, Processing, and Mass Spectrometry-based Proteomic Profiling of Rare Cells in Whole Blood. Molecular & Cellular Proteomics 2015, 14, 1672–1683.
    OpenUrl
  13. (13).↵
    Shraibman, B. et al. Identification of tumor antigens among the HLA peptidomes of Glioblastoma tumors and plasma. Molecular & Cellular Proteomics 2018, mcp.RA118.000792.
  14. (14).↵
    Savitski, M. M.; Reinhard, F. B. M.; Franken, H.; Werner, T.; Savitski, M. F.; Eberhard, D.; Molina, D. M.; Jafari, R.; Dovega, R. B.; Klaeger, S.; Kuster, B.; Nordlund, P.; Bantscheff, M.; Drewes, G. Tracking cancer drugs in living cells by thermal profiling of the proteome. Science 2014, 346.
  15. (15).
    Yates, J. R. Innovation: Structural Proteomics Goes Global. Journal of Proteome Research 2018, 17, 3613–3613.
    OpenUrl
  16. (16).↵
    Kaur, U.; Meng, H.; Lui, F.; Ma, R.; Ogburn, R. N.; Johnson, J. H. R.; Fitzgerald, M. C.; Jones, L. M. Proteome-Wide Structural Biology: An Emerging Field for the Structural Analysis of Proteins on the Proteomic Scale. Journal of Proteome Research 2018, 17, 3614–3627.
    OpenUrl
  17. (17).↵
    Kelstrup, C. D.; Young, C.; Lavallee, R.; Nielsen, M. L.; Olsen, J. V. Optimized fast and sensitive acquisition methods for shotgun proteomics on a quadrupole orbitrap mass spectrometer. Journal of proteome research 2012, 11, 3487–3497.
    OpenUrlCrossRefPubMed
  18. (18).
    Virant-Klun, I.; Leicht, S.; Hughes, C.; Krijgsveld, J. Identification of maturation-specific proteins by single-cell proteomics of human oocytes. Molecular & Cellular Proteomics 2016, 15, 2616–2627.
    OpenUrl
  19. (19).
    Lombard-Banek, C.; Moody, S. A.; Nemes, P. Single-Cell Mass Spectrometry for Discovery Proteomics: Quantifying Translational Cell Heterogeneity in the 16-Cell Frog (Xenopus) Embryo. Angewandte Chemie International Edition 2016, 55, 2454–2458.
    OpenUrl
  20. (20).
    Cifani, P.; Kentsis, A. High sensitivity quantitative proteomics using automated multidimensional nano-flow chromatography and accumulated ion monitoring on quadrupole-Orbitraplinear ion trap mass spectrometer. Molecular & Cellular Proteomics 2017,
  21. (21).
    Shao, X.; Wang, X.; Guan, S.; Lin, H.; Yan, G.; Gao, M.; Deng, C.; Zhang, X. Integrated Proteome Analysis Device for Fast Single-Cell Protein Profiling. Analytical Chemistry 2018, 90, 14003–14010.
    OpenUrl
  22. (22).
    Shishkova, E.; Hebert, A. S.; Westphall, M. S.; Coon, J. J. Ultra-High Pressure (> 30, 000 psi) Packing of Capillary Columns Enhancing Depth of Shotgun Proteomic Analyses. Analytical chemistry 2018, 90, 11503–11508.
    OpenUrl
  23. (23).↵
    Macron, C.; Lane, L.; Nez Galindo, A.; Dayon, L. Deep Dive on the Proteome of Human Cerebrospinal Fluid: A Valuable Data Resource for Biomarker Discovery and Missing Protein Identification. Journal of Proteome Research 2018,
  24. (24).↵
    Bache, N.; Geyer, P. E.; Bekker-Jensen, D. B.; Hoerning, O.; Falkenby, L.; Treit, P. V.; Doll, S.; Paron, I.; Müller, J. B.; Meier, F.; Olsen, J. V.; Vorm, O.; Mann, M. A Novel LC System Embeds Analytes in Pre-formed Gradients for Rapid, Ultra-robust Proteomics. Molecular & Cellular Proteomics 2018, 17, 2284–2296.
    OpenUrl
  25. (25).↵
    Levy, E.; Slavov, N. Single cell protein analysis for systems biology. Essays in Biochemistry 2018, 62, 595–605.
    OpenUrlAbstract/FREE Full Text
  26. (26).
    Specht, H.; Slavov, N. Transformative opportunities for single-cell proteomics, Journal of Proteome Research 2018; Vol. 17; pp 2563–2916.
    OpenUrlPubMed
  27. (27).↵
    Specht, H.; Harmange, G.; Perlman, D. H.; Emmott, E.; Niziolek, Z.; Budnik, B.; Slavov, N. Automated sample preparation for high-throughput single-cell proteomics. bioRxiv 2018, doi:10.1101/399774.
    OpenUrlCrossRef
  28. (28).
    Yates, J. R. Innovations in Proteomics: The Drive to Single Cells. Journal of Proteome Research 2018, 17, 2563–2564, PMID: 30024162.
    OpenUrlPubMed
  29. (29).↵
    Packer, J.; Trapnell, C. Single-Cell Multi-omics: An Engine for New Quantitative Models of Gene Regulation. Trends in Genetics 2018, 34, 653–665.
    OpenUrlCrossRef
  30. (30).↵
    Burkhart, J. M.; Premsler, T.; Sickmann, A. Quality control of nano-LC-MS systems using stable isotope-coded peptides. PROTEOMICS 2011, 11, 1049–1057.
    OpenUrlCrossRefPubMed
  31. (31).
    Abbatiello, S. E. et al. Design, Implementation and Multisite Evaluation of a System Suitability Protocol for the Quantitative Assessment of Instrument Performance in Liquid Chromatography-Multiple Reaction Monitoring-MS (LC-MRM-MS). Molecular & Cellular Proteomics 2013, 12, 2623–2639.
    OpenUrl
  32. (32).↵
    Gallien, S.; Bourmaud, A.; Domon, B. A Simple Protocol To Routinely Assess the Uniformity of Proteomics Analyses. Journal of Proteome Research 2014, 13, 2688–2695.
    OpenUrlCrossRefPubMed
  33. (33).↵
    Rudnick, P. A. et al. Performance Metrics for Liquid Chromatography-Tandem Mass Spectrometry Systems in Proteomics Analyses. Molecular & Cellular Proteomics: MCP 2010, 9, 225–241.
    OpenUrl
  34. (34).↵
    Bielow, C.; Mastrobuoni, G.; Kempa, S. Proteomics Quality Control: Quality Control Software for MaxQuant Results. Journal of Proteome Research 2016, 15, 777–787.
    OpenUrl
  35. (35).↵
    Scheltema, R. A.; Mann, M. SprayQc: A Real-Time LCMS/MS Quality Monitoring System To Maximize Uptime Using Off the Shelf Components. Journal of Proteome Research 2012, 11, 3458–3466.
    OpenUrlCrossRefPubMedWeb of Science
  36. (36).↵
    Trachsel, C.; Panse, C.; Kockmann, T.; Wolski, W. E.; Grossmann, J.; Schlapbach, R. rawDiag: An R Package Supporting Rational LCMS Method Optimization for Bottom-up Proteomics. Journal of Proteome Research 2018,
  37. (37).↵
    Bittremieux, W.; Willems, H.; Kelchtermans, P.; Martens, L.; Laukens, K.; Valkenborg, D. iMonDB: Mass Spectrometry Quality Control through Instrument Monitoring. Journal of Proteome Research 2015, 14, 2360–2366.
    OpenUrl
  38. (38).↵
    Dogu, E.; Mohammad Taheri, S.; Olivella, R.; Marty, F.; Lienert, I.; Reiter, L.; Sabid, E.; Vitek, O. MSstatsQC 2.0: R/Bioconductor package for statistical quality control of mass spectrometry-based proteomic experiments. Journal of Proteome Research 2018,
  39. (39).↵
    Bittremieux, W.; Valkenborg, D.; Martens, L.; Laukens, K. Computational quality control tools for mass spectrometry proteomics. PROTEOMICS 2017, 17, 1600159.
    OpenUrl
  40. (40).↵
    Budnik, B.; Levy, E.; Harmange, G.; Slavov, N. SCoPE-MS: mass-spectrometry of single mammalian cells quantifies proteome heterogeneity during cell differentiation. Genome Biology 2018, 19, 161.
    OpenUrl
  41. (41).↵
    Team, R. C. R: A language and environment for statistical computing. 2013; http://www.R-project.org/.
  42. (42).↵
    Wickham, H. ggplot2: Elegant Graphics for Data Analysis, 1st ed.; Use R!; Springer-Verlag: New York, 2009.
  43. (43).↵
    Wilkinson, L. The Grammar of Graphics, 2nd ed.; Statistics and Computing; SpringerVerlag: New York, 2005.
  44. (44).↵
    Cox, J.; Mann, M. MaxQuant enables high peptide identification rates, individualized ppbrange mass accuracies and proteome-wide protein quantification. Nature biotechnology 2008, 26, 1367–1372.
    OpenUrlCrossRefPubMedWeb of Science
  45. (45).
    Cox, J.; Neuhauser, N.; Michalski, A.; Scheltema, R. A.; Olsen, J. V.; Mann, M. Andromeda: a peptide search engine integrated into the MaxQuant environment. Journal of proteome research 2011, 10, 1794–1805.
    OpenUrlCrossRefPubMedWeb of Science
  46. (46).↵
    Tyanova, S.; Temu, T.; Cox, J. The MaxQuant computational platform for mass spectrometrybased shotgun proteomics. Nature Protocols 2016, 11, 2301–2319.
    OpenUrl
  47. (47).↵
    Chen, A.; Franks, A.; Slavov, N. DART-ID increases single-cell proteome coverage. bioRxiv 2018, doi:10.1101/399121.
    OpenUrlCrossRef
  48. (48).↵
    Anscombe, F. J. Graphs in Statistical Analysis. The American Statistician 1973, 27, 17–21.
    OpenUrlCrossRefWeb of Science
  49. (49).↵
    Gatto, L.; Breckels, L. M.; Naake, T.; Gibb, S. Visualization of proteomics data using R and bioconductor. Proteomics 2015, 15, 1375–1389.
    OpenUrl
  50. (50).↵
    Kll, L.; Storey, J. D.; MacCoss, M. J.; Noble, W. S. Posterior Error Probabilities and False Discovery Rates: Two Sides of the Same Coin. Journal of Proteome Research 2008, 7, 40–44.
    OpenUrlCrossRefPubMedWeb of Science
  51. (51).↵
    Savitski, M. M.; Sweetman, G.; Askenazi, M.; Marto, J. A.; Lang, M.; Zinn, N.; Bantscheff, M. Delayed Fragmentation and Optimized Isolation Width Settings for Improvement of Protein Identification and Accuracy of Isobaric Mass Tag Quantification on Orbitrap-Type Mass Spectrometers. Analytical Chemistry 2011, 83, 8959–8967.
    OpenUrlCrossRefPubMed
  52. (52).↵
    Michalski, A.; Damoc, E.; Hauschild, J.-P.; Lange, O.; Wieghaus, A.; Makarov, A.; Nagaraj, N.; Cox, J.; Mann, M.; Horning, S. Mass spectrometry-based proteomics using Q Exactive, a high-performance benchtop quadrupole Orbitrap mass spectrometer. Molecular & Cellular Proteomics 2011, mcp–M111.
  53. (53).↵
    Rardin, M. J. Rapid Assessment of Contaminants and Interferences in Mass Spectrometry Data Using Skyline. Journal of The American Society for Mass Spectrometry 2018, 29, 1327–1330.
    OpenUrl
Back to top
PreviousNext
Posted January 06, 2019.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
DO-MS: Data-Driven Optimization of Mass Spectrometry Methods
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
DO-MS: Data-Driven Optimization of Mass Spectrometry Methods
Gray Huffman, Harrison Specht, Albert Chen, Nikolai Slavov
bioRxiv 512152; doi: https://doi.org/10.1101/512152
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
DO-MS: Data-Driven Optimization of Mass Spectrometry Methods
Gray Huffman, Harrison Specht, Albert Chen, Nikolai Slavov
bioRxiv 512152; doi: https://doi.org/10.1101/512152

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4688)
  • Biochemistry (10379)
  • Bioengineering (7695)
  • Bioinformatics (26372)
  • Biophysics (13547)
  • Cancer Biology (10720)
  • Cell Biology (15460)
  • Clinical Trials (138)
  • Developmental Biology (8509)
  • Ecology (12842)
  • Epidemiology (2067)
  • Evolutionary Biology (16885)
  • Genetics (11416)
  • Genomics (15493)
  • Immunology (10638)
  • Microbiology (25254)
  • Molecular Biology (10239)
  • Neuroscience (54587)
  • Paleontology (402)
  • Pathology (1671)
  • Pharmacology and Toxicology (2899)
  • Physiology (4355)
  • Plant Biology (9263)
  • Scientific Communication and Education (1588)
  • Synthetic Biology (2561)
  • Systems Biology (6789)
  • Zoology (1470)