Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Computational pipeline for the PGV-001 neoantigen vaccine trial

View ORCID ProfileAlex Rubinsteyn, Julia Kodysh, Isaac Hodes, Sebastien Mondet, Bulent Arman Aksoy, John P Finnigan, Nina Bhardwaj, Jeffrey Hammerbacher
doi: https://doi.org/10.1101/174516
Alex Rubinsteyn
1Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alex Rubinsteyn
Julia Kodysh
1Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Isaac Hodes
1Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sebastien Mondet
1Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bulent Arman Aksoy
1Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029
3Department of Microbiology and Immunology, Medical University of South Carolina, Charleston, SC 29425
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
John P Finnigan
1Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029
2Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nina Bhardwaj
1Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029
2Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeffrey Hammerbacher
1Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029
3Department of Microbiology and Immunology, Medical University of South Carolina, Charleston, SC 29425
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

This paper describes the sequencing protocol and computational pipeline for the PGV-001 personalized vaccine trial. PGV-001 consists of a fully personalized therapeutic vaccine targeting neoantigens identified from patient tumor samples. The PGV pipeline is modular and consists of many independently usable tools and software libraries. We draw attention to three particular tools which may be useful to other groups working on neoantigen vaccination. (1) Epidisco is a workflow which orchestrates the parallel execution of the PGV pipeline, including both common steps such as alignment as well well as tools which have been developed specifically for the PGV-001 trial. (2) Vaxrank uses somatic variants and tumor RNA reads to select peptides for inclusion in a patient’s vaccine. (3) Isovar is a library used by Vaxrank to determine the mutated protein sequence associated with a genomic variant from supporting tumor RNA reads. We hope that the functionality of these tools may extend beyond the specifics of the PGV-001 trial and enable other research groups in their own neoantigen investigations.

Introduction

Cancer neoantigens are antigens presented on tumor cells which, due to either mutation or modification, are not presented on normal cells. Neoantigens generated by tumor DNA mutations have been shown to play a significant role in mediating the destruction of tumor cells by the adaptive immune system 1–3. Several groups have used therapeutic vaccines targeting neoantigens to clear tumors in murine models 4–6. Consequently, many human neoantigen vaccine trials are now under way and several have even published promising early results7,8. Since very few cancer mutations are recurrent between patients, the identification of neoantigens requires a personalized genomic approach 9. We describe the sequencing protocol and immunogenomic pipeline of PGV-001, a neoantigen vaccine trial at the Mount Sinai Hospital10.

The PGV (Personalized Genomic Vaccine) pipeline takes tumor/normal sequencing data as an input and generates a ranked list of mutated peptide sequences. The various steps along the way of determining a personalized vaccine’s contents are implemented as configurable independent tools. We highlight several of these tools (Isovar, Vaxrank, and Epidisco) since they may be potentially useful to other groups working on similar neoantigen vaccine trials.

Overview of the PGV-001 Personalized Vaccine Trial

PGV-001 is a phase I clinical trial at Mount Sinai Hospital, studying the safety and immunogenicity of a multi-peptide personalized genomic vaccine for treatment of cancers. A PGV dose consists of 10 synthetic long peptides 11, each containing a somatic mutation from the patient’s tumor, as well as an immunostimulatory adjuvant: Poly-ICLC 12. In the PGV-001 trial, the personalized vaccine is administered in the adjuvant setting, for patients who undergo a complete resection and have no evidence of residual disease. The personalized vaccine is administered as an intramuscular injection and is given to the patient 10 times over a span of 6 months.

The inputs to the PGV pipeline (see Figure 2) are unmapped sequencing data from tumor DNA, tumor RNA, and “normal” patient DNA, typically from peripheral blood. The two DNA samples are compared to identify somatic mutations which are unique to the tumor. The RNA sample is used to quantify the degree to which a mutation is expressed and to determine the mutated protein sequence.

Figure 1:
  • Download figure
  • Open in new tab
Figure 1: Overview of PGV-001 Trial
Figure 2:
  • Download figure
  • Open in new tab
Figure 2: Overview of PGV Computational Pipeline

To make immunological predictions from these mutated protein sequences, it is necessary to know the patient’s HLA type. This can be determined computationally from exome or bulk RNA sequencing or validated externally using HLA-specific amplicon sequencing 13. The PGV pipeline currently uses seq2hla14 for HLA typing from tumor RNA while also using amplicon sequencing of normal DNA for validation. Thus far, across 10 patients, the two methods have only disagreed on a single allele. This high degree of concordance matches our previous experience with HLA typing of fresh frozen tissue samples; formalin fixed tissue is more likely to give discordant results between different sequencing methods.

After the patient’s HLA type is determined, we use NetMHCpan to predict the binding affinity of mutated peptides to all patient class I alleles. It should be noted that the pipeline is written to freely allow the use of other predictors, such as NetMHC15 and MHCflurry16. We then combine affinity predictions with an estimate of each mutated protein’s abundance to generate a ranked list of vaccine peptides. The manufacturer then attempts to synthesize many different peptides from this list and delivers the top 10 sequences which they are able to purify to sufficient quality and quantity. The lyophilized peptides are dissolved in DMSO and mixed with poly-ICLC immediately before injection.

Sequencing protocol for DNA and RNA

The sequencing protocols used for both DNA and RNA can dramatically affect the sensitivity of variant detection, and thus ultimately change the results of the vaccine pipeline. The largest determinants of sensitivity are the sample quality, method of sequencing library preparation, and quantity of sequenced reads. Whenever possible, PGV uses fresh frozen tumor tissue samples, which results in significantly improved variant detection accuracy as compared with sequencing of formalin fixed (FFPE) samples17. An additional benefit of using fresh frozen samples is that, whereas the fragmented RNA of FFPE samples can only be prepared with less efficient methods such as ribosomal depletion 18. For patients with solid tumors, normal DNA is extracted from peripheral blood rather than potentially contaminated adjacent tissue 19.

Fragmentation by sonication was preferred to transposase-based methods20 due to significant sequence bias, leading to lost coverage after marking duplicate reads21. Among the exome enrichment techniques which use sonication, we chose Agilent’s SureSelect XT kit due to its efficient rate of capturing on-target reads22.

We chose to target 150X mean coverage for the normal DNA (exome) sequencing since this was found to be the point of diminishing sensitivity across different variant calling pipelines 23. Several of the cancer types allowed in the PGV-001 trial (particularly lung and head/neck cancers) have been shown to result in systematically low purity samples24. To accommodate a significant degree of non-cancerous DNA, we assume 50% tumor purity and consequently target 300X exome coverage for the tumor DNA sample.

A final consideration is the choice of read length, which does not significantly impact variant discovery from DNA but does impact variant phasing in RNA. Since a 25mer vaccine peptide is translated from 75bp of coding sequence, PGV could theoretically use any read length above that minimum. To allow for many distinct aligned positions overlapping the same region of coding sequence, the PGV protocol uses 125bp reads. These provide a good compromise between length and base quality on the HiSeq 2500 instrument. Newer sequencing platforms from Illumina allow longer reads (150bp) without significant increases in base error.

Somatic variant calling

The tumor and normal DNA samples are aligned against the human GRCh37 reference genome using BWA-MEM25. The tumor RNA is aligned using STAR26, which has been found to have particularly high sensitivity for detecting indel variants27. The alignment files were processed according to GATK Best Practices28. To improve the sensitivity of indel calling, we also perform assembly-based realignment on RNA reads containing potential indel variants. Somatic variant calling is then performed using Mutect29 and Strelka30, whose results are combined by taking a union of called variants. In cases where Mutect and Strelka yield an insufficient number of variants, we also use Mutect2 to increase sensitivity. Once all these steps have completed, we have combined VCF of candidate somatic variants and aligned RNA reads which will be used to gauge expression of these variants and determine mutated coding sequences.

Determining mutant protein sequence

There are several different software packages which predict the protein-level effect of a coding mutation31–33. However, for the purposes of selecting a vaccine peptide’s sequence, it is not sufficient to predict a DNA mutation’s protein effect in without considering the transcripts in which it occurs. A somatic mutation can be associated with selective splicing of particular RNA isoforms34 and can also co-occur with other genomic variants. Thus, the tumor RNA sequencing data is also used to determine a mutant coding sequence.

For each mutation, it is possible to infer multiple coding sequences from supporting RNA reads due to sequencing error, splicing diversity, and tumor heterogeneity. To account for these potentially complicating factors, we developed a tool called Isovar35, which can be downloaded from https://github.com/hammerlab/isovar. Isovar uses RNA to assemble the most abundant coding sequence for each mutation. An overview of the algorithm is given in Figure 3.

Figure 3:
  • Download figure
  • Open in new tab
Figure 3: Overview of Isovar Algorithm for Determining Mutant Protein Sequence
Figure 4:
  • Download figure
  • Open in new tab
Figure 4: Phasing Adjacent Variants

One advantage of using RNA to determine the coding sequence is that it naturally phases adjacent (germline or somatic) variants. A further advantage is that Isovar, by using mutation-supporting RNA reads to determine each mutant protein sequence, naturally ends up estimating allele-specific expression. If the PGV pipeline, on the other hand, used bulk expression for each gene it would potentially overestimate how much of a mutant protein is being made. In an extreme case, all of the RNA reads aligning to a particular gene could be wildtype, with none supporting the somatic variant of interest.

Vaccine peptide selection

Once we have determined the amino acid sequences containing somatic mutations and estimated their abundance in the tumor, the final step is to rank them according to desirability of inclusion in personalized vaccine.

There are many potential correlates of immunogenicity which can be used to prioritize neoantigens, such as expression, MHC binding affinity, peptide-MHC complex stability, proteasomal cleavage, and other antigen processing steps. Of those, the PGV pipeline optimizes for high expression and predicted strong Class I MHC binding. There are several published computational predictors of Class I MHC binding affinity which have demonstrated high accuracy15,36,37. PGV uses NetMHCpan36 due to its extensive coverage of patient alleles.

The final ranking of candidate vaccine peptides according to predicted MHC binding and expression is performed by a tool called Vaxrank38. Vaxrank identifies high-affinity mutant MHC ligands within each peptide and combines these predictions into a single MHC binding score. This score is then scaled according to the expression of that mutation in the tumor. The formula for computing these MHC and expression scores is given in Figure 5. The scale and offset for MHC affinity normalization was determined by a logistic fit of affinity versus. immunogenicity from the dataset used to determine the classical 500nM affinity threshold39. There no rigorous justification for the multiplicative scoring function, other than the intuition that epitope abundance and MHC affinity are independent prerequisites for immunogenicity.

Figure 5:
  • Download figure
  • Open in new tab
Figure 5: Scoring Criterion for Selecting Vaccine Peptides

Since some peptides cannot be manufactured using solid-phase synthesis, our vaccine peptide ranking algorithm includes manufacturability heuristics, such as minimization of cysteines. Vaxrank can be downloaded from https://github.com/hammerlab/vaxrank.

Pipeline workflow

The actual execution of the PGV computational pipeline requires careful orchestration and combination of various computational steps. Furthermore, the time and the computational power needed to complete each of these intermediate steps within the PGV pipeline varies: for example, somatic mutational calling on the whole genome can easily become a major bottleneck for a pipeline if the details of the execution is not well planned.

We use a suite of computational tools that enable PGV pipeline follow the best practices defined by the community, run faster, and tolerate failures without sacrificing any efficiency. In particular, we parallelize a majority of the somatic mutation calling pipeline by splitting the data by chromosome and running computationally intensive parts of the pipeline on each chromosome instead of the whole chromosome. This allows efficient and simultaneous use of tens of computation units towards a single goal and therefore reduces the amount of time and resources required for such demanding computational tasks. Furthermore, QC checks for each data type, the processing of the RNA sequencing data, and the joint analyses of the normal-tumor DNA sequencing data are all run independently, again enabling us to divide the computational tasks needed for each to multiple computational units and run them simultaneously. The specific tools are enumerated in Supplementary Table 3. The only one of these tools which is specific to immunogenics is Epidisco, which orchestrates the execution of all the other underlying tools used by the PGV pipeline.

Epidisco aims to simplify parallel and efficient execution of the PGV pipeline, therefore it supports local computation, traditional HPC scheduling, and cloud-based resources from well-known providers. On a typical machine, running the complete PGV pipeline for a single patient can take up to 4 days; but making use of 5 or more computers for parallelization reduces the overall running time down to a single day. The parallelization of the PGV pipeline becomes more prominent once more than one patients are being analyzed simultaneously, which allows efficiently saturating the computational resources and allow the pipeline to scale according to the actual workload. This also make it feasible to use PGV pipeline on a cohort level where fast and efficient pipelines are required to be able to analyze data for tens of patients in a fast, efficient, and accurate way.

Epidisco also makes the PGV pipeline tolerant to failures of intermediate steps and allows resuming the pipeline from the point of failure with a simple restart request. By handling such failures in an automated way, carrying out cleaning procedures, and restarting only the tasks that need to be re-run, the workflow makes it easier for researchers to operate such complex computational tasks possible. Epidisco provides command line and web-based utilities to facilitate starting a new workflow, collecting the results, and troubleshoot specific parts of a pipeline.

Discussion

The PGV pipeline is a modular, highly configurable, freely available method for selecting the contents of a therapeutic neoantigen vaccine. The PGV pipeline has been used to predict vaccine peptides for several mouse models (LLC, B16 F1/F10), 5 “dry run” patients whose samples were processed according to the PGV protocol but did not participate in the trial, and 5 patients which were being considered for enrollment in the trial. A summary of the results from the “dry run” patients is presented in Supplementary Table 1. Of the patients eligible for enrollment, 1 has been treated so far and another enrolled. The remainder did not enroll due to progression of disease or low quality tumor samples.

Several other groups have released pipelines for neoantigen vaccine prediction, most notably pVAC-seq40, MuPeXI41. A deep comparison between neoantigen pipelines likely requires evaluating T-cell response and anti-tumor activity after vaccination, which is well beyond the scope of this paper. There are, however, a few obvious differences between the PGV pipeline and others which have been published.

  • Modularity: The PGV pipeline has been developed as a collection of flexible standalone tools, rather than a single monolithic script (see Supplementary Tables 2 & 3). These components are brought together in the Epidisco library, which allows users to run highly configurable genomics and immunoinformatics pipelines. For example, the variant calling and epitope prediction components of the PGV pipeline have also been used through Epidisco for retrospective analyses of checkpoint blockade clinical trials42.

  • Inputs are FASTQ files: MuPeXI and pVAC-seq both require the implementation of separate genomics pipeline to infer patient HLA type, call somatic variants, and quantify expression. The PGV pipeline, by contrast, is self-contained in the sense that its inputs are raw FASTQ files and its outputs are vaccine peptide predictions.

  • Dependence on tumor RNA: The PGV pipeline relies on tumor RNA reads to determine the mutant protein coding sequence. MuPeXI and pVAC-seq, by contrast, only consider expression data after predicting a mutant protein sequence from a variant in isolation. PGV’s approach has potential benefits in capturing altered patterns of splicing and phasing somatic variants with other nearby variants. These potential benefits, however, have yet to be evaluated systematically.

  • Liberal software license: All of the software components which comprise the PGV pipeline are freely available under the Apache software license. MuPeXI and pVAC-seq, by contrast, use more restrictive licenses, which might prohibit their integration with other bioinformatics software.

  • Optimization of peptide sequence for solid phase synthesis: PGV appears to be unique among freely available neoantigen pipelines in attempting to choose peptides whose sequence content is more likely to be successfully manufactured. We have found this to be an important step, especially when using longer peptides, due to the significant delays introduced by failed synthesis or purification attempts.

The PGV-001 trial is the first in a series of planned neoantigen vaccine investigations. Several improvements to the PGV pipeline are planned, including the use of genomic fusions and other structural variants as neoantigen sources, clonality as a consideration for variant prioritization, and additional immunological predictions such as proteasomal cleavage and Class II MHC binding. As immune response data from ongoing preclinical work and PGV-001 becomes available, our method for combining correlates of immunogenicity into a single ranking will likely undergo significant tuning.

References

  1. 1.↵
    Fritsch, E. F. et al. HLA-binding properties of tumor neoepitopes in humans. Cancer Immunol Res 2, 522–529 (2014).
    OpenUrlAbstract/FREE Full Text
  2. 2.
    Finnigan, J. P., Jr., Rubinsteyn, A., Hammerbacher, J. & Bhardwaj, N. Mutation-Derived Tumor Antigens: Novel Targets in Cancer Immunotherapy. Oncology 29, (2015).
  3. 3.↵
    Ward, J. P., Gubin, M. M. & Schreiber, R. D. The Role of Neoantigens in Naturally Occurring and Therapeutically Induced Immune Responses to Cancer. Adv. Immunol. 130, 25–74 (2016).
    OpenUrlCrossRefPubMed
  4. 4.↵
    Gubin, M. M. et al. Checkpoint blockade cancer immunotherapy targets tumour-specific mutant antigens. Nature 515, 577–581 (2014).
    OpenUrlCrossRefPubMedWeb of Science
  5. 5.
    Castle, J. C. et al. Exploiting the mutanome for tumor vaccination. Cancer Res. 72, 1081–1091 (2012).
    OpenUrlAbstract/FREE Full Text
  6. 6.↵
    Yadav, M. et al. Predicting immunogenic tumour mutations by combining mass spectrometry and exome sequencing. Nature 515, 572–576 (2014).
    OpenUrlCrossRefPubMedWeb of Science
  7. 7.↵
    Sahin, U. et al. Personalized RNA mutanome vaccines mobilize poly-specific therapeutic immunity against cancer. Nature 547, 222–226 (2017).
    OpenUrlCrossRefPubMed
  8. 8.↵
    Ott, P. A. et al. An immunogenic personal neoantigen vaccine for patients with melanoma. Nature 547, 217–221 (2017).
    OpenUrlCrossRefPubMed
  9. 9.↵
    Hartmaier, R. J. et al. Genomic analysis of 63,220 tumors reveals insights into tumor uniqueness and targeted cancer immunotherapy strategies. Genome Med. 9, 16 (2017).
    OpenUrl
  10. 10.↵
    Kyi, C. et al. A phase I study of the safety and immunogenicity of a multipeptide personalized genomic vaccine in the adjuvant treatment of solid cancers. J. Clin. Orthod. 35, TPS3114–TPS3114 (2017).
    OpenUrl
  11. 11.↵
    Rosalia, R. A. et al. Dendritic cells process synthetic long peptides better than whole protein, improving antigen presentation and T-cell activation. Eur. J. Immunol. 43, 2554–2565 (2013).
    OpenUrl
  12. 12.↵
    Ammi, R. et al. Poly(I:C) as cancer vaccine adjuvant: knocking on the door of medical breakthroughs. Pharmacol. Ther. 146, 120–131 (2015).
    OpenUrlCrossRefPubMed
  13. 13.↵
    Cereb, N., Kim, H. R., Ryu, J. & Yang, S. Y. Advances in DNA sequencing technologies for high resolution HLA typing. Hum. Immunol. 76, 923–927 (2015).
    OpenUrlCrossRefPubMed
  14. 14.↵
    Boegel, S. et al. HLA typing from RNA-Seq sequence reads. Genome Med. 4, 102 (2012).
    OpenUrlCrossRefPubMed
  15. 15.↵
    Lundegaard, C. et al. NetMHC-3.0: accurate web accessible predictions of human, mouse and monkey MHC class I affinities for peptides of length 8-11. Nucleic Acids Res. 36, W509–12 (2008).
    OpenUrlCrossRefPubMedWeb of Science
  16. 16.↵
    Rubinsteyn, A., O’Donnell, T., Damaraju, N. & Hammerbacher, J. Predicting Peptide-MHC Binding Affinities With Imputed Training Data. bioRxiv 054775 (2016). doi:10.1101/054775.
    OpenUrlCrossRef
  17. 17.↵
    Oh, E. et al. Comparison of Accuracy of Whole-Exome Sequencing with Formalin-Fixed Paraffin-Embedded and Fresh Frozen Tissue Samples. PLoS One 10, e0144162 (2015).
    OpenUrl
  18. 18.↵
    Zhao, W. et al. Comparison of RNA-Seq by poly (A) capture, ribosomal RNA depletion, and DNA microarray for expression profiling. BMC Genomics 15, 419 (2014).
    OpenUrlCrossRefPubMed
  19. 19.↵
    Wei, L. et al. Pitfalls of improperly procured adjacent non-neoplastic tissue for somatic mutation analysis using next-generation sequencing. BMC Med. Genomics 9, 64 (2016).
    OpenUrlCrossRef
  20. 20.↵
    Adey, A. et al. Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition. Genome Biol. 11, R119 (2010).
    OpenUrlCrossRefPubMed
  21. 21.↵
    García-García, G. et al. Assessment of the latest NGS enrichment capture methods in clinical context. Sci. Rep. 6, 20948 (2016).
    OpenUrl
  22. 22.↵
    Shigemizu, D. et al. Performance comparison of four commercial human whole-exome capture platforms. Sci. Rep. 5, 12742 (2015).
    OpenUrlCrossRefPubMed
  23. 23.↵
    Cornish, A. & Guda, C. A Comparison of Variant Calling Pipelines Using Genome in a Bottle as a Reference. Biomed Res. Int. 2015, 456479 (2015).
    OpenUrl
  24. 24.↵
    Aran, D., Sirota, M. & Butte, A. J. Systematic pan-cancer analysis of tumour purity. Nat. Commun. 6, 8971 (2015).
    OpenUrlCrossRefPubMed
  25. 25.↵
    Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv [q-bio.GN] (2013).
  26. 26.↵
    Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
    OpenUrlCrossRefPubMedWeb of Science
  27. 27.↵
    Sun, Z., Bhagwate, A., Prodduturi, N., Yang, P. & Kocher, J.-P. A. Indel detection from RNA-seq data: tool evaluation and strategies for accurate detection of actionable mutations. Brief. Bioinform. (2016). doi:10.1093/bib/bbw06.
    OpenUrlCrossRef
  28. 28.↵
    Van der Auwera, G. A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinformatics 43, 11.10.1–33 (2013).
    OpenUrlPubMed
  29. 29.↵
    Cibulskis, K. et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat. Biotechnol. 31, 213–219 (2013).
    OpenUrlCrossRefPubMed
  30. 30.↵
    Saunders, C. T. et al. Strelka: accurate somatic small-variant calling from sequenced tumor–normal sample pairs. Bioinformatics 28, 1811–1817 (2012).
    OpenUrlCrossRefPubMedWeb of Science
  31. 31.↵
    Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80–92 (2012).
    OpenUrlCrossRefPubMedWeb of Science
  32. 32.
    Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
    OpenUrlCrossRefPubMed
  33. 33.↵
    Rubinsteyn, A. et al. hammerlab/varcode: Version 0.5.15. (2017). doi:10.5281/zenodo.82250.
    OpenUrlCrossRef
  34. 34.↵
    Sveen, A., Kilpinen, S., Ruusulehto, A., Lothe, R. A. & Skotheim, R. I. Aberrant RNA splicing in cancer; expression changes and driver mutations of splicing factor genes. Oncogene 35, 2413–2427 (2016).
    OpenUrl
  35. 35.↵
    Rubinsteyn, A., Kodysh, J. & Aksoy, B. A. hammerlab/isovar: Version 0.7.0. (2017). doi:10.5281/zenodo.82122.
    OpenUrlCrossRef
  36. 36.↵
    Hoof, I. et al. NetMHCpan, a method for MHC class I binding prediction beyond humans. Immunogenetics 61, 1–13 (2009).
    OpenUrlCrossRefPubMedWeb of Science
  37. 37.↵
    Karosiene, E., Lundegaard, C., Lund, O. & Nielsen, M. NetMHCcons: a consensus method for the major histocompatibility complex class I predictions. Immunogenetics 64, 177–186 (2012).
    OpenUrlCrossRefPubMedWeb of Science
  38. 38.↵
    Rubinsteyn, A., Hodes, I., Kodysh, J. & Hammerbacher, J. Vaxrank: A Computational Tool For Designing Personalized Cancer Vaccines. bioRxiv 142919 (2017).
  39. 39.↵
    Sette, A. et al. The relationship between class I binding affinity and immunogenicity of potential cytotoxic T cell epitopes. J. Immunol. 153, 5586–5592 (1994).
    OpenUrlAbstract
  40. 40.↵
    Hundal, J. et al. pVAC-Seq: A genome-guided in silico approach to identifying tumor neoantigens. Genome Med. 8, 11 (2016).
    OpenUrlPubMed
  41. 41.↵
    Bjerregaard, A.-M., Nielsen, M., Hadrup, S. R., Szallasi, Z. & Eklund, A. C. MuPeXI: prediction of neo-epitopes from tumor sequencing data. Cancer Immunol. Immunother. (2017). doi:10.1007/s00262-017-2001-.
    OpenUrlCrossRef
  42. 42.↵
    Snyder, A. et al. Contribution of systemic and somatic factors to clinical response and resistance to PD-L1 blockade in urothelial cancer: An exploratory multi-omic analysis. PLoS Med. 14, e1002309 (2017).
    OpenUrlCrossRef
Back to top
PreviousNext
Posted August 10, 2017.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Computational pipeline for the PGV-001 neoantigen vaccine trial
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Computational pipeline for the PGV-001 neoantigen vaccine trial
Alex Rubinsteyn, Julia Kodysh, Isaac Hodes, Sebastien Mondet, Bulent Arman Aksoy, John P Finnigan, Nina Bhardwaj, Jeffrey Hammerbacher
bioRxiv 174516; doi: https://doi.org/10.1101/174516
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Computational pipeline for the PGV-001 neoantigen vaccine trial
Alex Rubinsteyn, Julia Kodysh, Isaac Hodes, Sebastien Mondet, Bulent Arman Aksoy, John P Finnigan, Nina Bhardwaj, Jeffrey Hammerbacher
bioRxiv 174516; doi: https://doi.org/10.1101/174516

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4113)
  • Biochemistry (8816)
  • Bioengineering (6519)
  • Bioinformatics (23463)
  • Biophysics (11791)
  • Cancer Biology (9209)
  • Cell Biology (13324)
  • Clinical Trials (138)
  • Developmental Biology (7439)
  • Ecology (11410)
  • Epidemiology (2066)
  • Evolutionary Biology (15152)
  • Genetics (10439)
  • Genomics (14044)
  • Immunology (9171)
  • Microbiology (22155)
  • Molecular Biology (8812)
  • Neuroscience (47571)
  • Paleontology (350)
  • Pathology (1428)
  • Pharmacology and Toxicology (2491)
  • Physiology (3730)
  • Plant Biology (8081)
  • Scientific Communication and Education (1437)
  • Synthetic Biology (2221)
  • Systems Biology (6038)
  • Zoology (1253)