Spacer2PAM: A computational framework to guide experimental determination of functional CRISPR-Cas system PAM sequences

Nucleic Acids Res. 2022 Apr 8;50(6):3523-3534. doi: 10.1093/nar/gkac142.

Abstract

RNA-guided nucleases from CRISPR-Cas systems expand opportunities for precise, targeted genome modification. Endogenous CRISPR-Cas systems in many prokaryotes are attractive to circumvent expression, functionality, and unintended activity hurdles posed by heterologous CRISPR-Cas effectors. However, each CRISPR-Cas system recognizes a unique set of protospacer adjacent motifs (PAMs), which requires identification by extensive screening of randomized DNA libraries. This challenge hinders development of endogenous CRISPR-Cas systems, especially those based on multi-protein effectors and in organisms that are slow-growing or have transformation idiosyncrasies. To address this challenge, we present Spacer2PAM, an easy-to-use, easy-to-interpret R package built to predict and guide experimental determination of functional PAM sequences for any CRISPR-Cas system given its corresponding CRISPR array as input. Spacer2PAM can be used in a 'Quick' method to generate a single PAM prediction or in a 'Comprehensive' method to inform targeted PAM libraries small enough to screen in difficult to transform organisms. We demonstrate Spacer2PAM by predicting PAM sequences for industrially relevant organisms and experimentally identifying seven PAM sequences that mediate interference from the Spacer2PAM-informed PAM library for the type I-B CRISPR-Cas system from Clostridium autoethanogenum. We anticipate that Spacer2PAM will facilitate the use of endogenous CRISPR-Cas systems for industrial biotechnology and synthetic biology.

MeSH terms

  • CRISPR-Cas Systems* / genetics
  • Clostridium / genetics
  • Computational Biology / methods*
  • Gene Library
  • Nucleotide Motifs