ABSTRACT
Background The development of CRISPR genome editing has transformed biomedical research. Most applications reported thus far rely upon the Cas9 protein from Streptococcus pyogenes SF370 (SpyCas9). With many RNA guides, wild-type SpyCas9 can induce significant levels of unintended mutations at near-cognate sites, necessitating substantial efforts toward the development of strategies to minimize off-target activity. Although the genome-editing potential of thousands of other Cas9 orthologs remains largely untapped, it is not known how many will require similarly extensive engineering to achieve single-site accuracy within large (e.g. mammalian) genomes. In addition to its off-targeting propensity, SpyCas9 is encoded by a relatively large (~4.2 kb) open reading frame, limiting its utility in applications that require size-restricted delivery strategies such as adeno-associated virus vectors. In contrast, some genome-editing-validated Cas9 orthologs (e.g. from Staphylococcus aureus, Campylobacter jejuni, Geobacillus stearothermophilus and Neisseria meningitidis) are considerably smaller and therefore better suited for viral delivery.
Results Here we show that wild-type NmeCas9, when programmed with guide sequences of natural length (24 nucleotides), exhibits a nearly complete absence of unintended editing in human cells, even when targeting sites that are prone to off-target activity with wildtype SpyCas9. We also validate at least six variant protospacer adjacent motifs (PAMs), in addition to the preferred consensus PAM (5’-N4GATT-3’), for NmeCas9 genome editing in human cells.
Conclusions Our results show that NmeCas9 is a naturally high-fidelity genome editing enzyme and suggest that additional Cas9 orthologs may prove to exhibit similarly high accuracy, even without extensive engineering.
List of Abbreviations
- AAV
- adeno-associated virus
- BLESS
- breaks labelling, enrichment on streptavidin and next-generation sequencing
- BLISS
- breaks labeling in situ and sequencing
- bp
- base pair
- Cas
- CRISPR-associated
- Circle-seq
- circularization for in vitro reporting of cleavage effects by sequencing
- CjeCas9
- Campylobacter jejuni Cas9
- CMV
- cytomegalovirus
- CRISPR
- clustered, regularly interspaced, short palindromic repeats
- crRNAs
- CRISPR RNAs
- dCas9
- “dead” Cas9
- Digenome-seq
- digested genome sequencing
- DSB
- double-strand breaks
- dsODN
- double-stranded oligodeoxynucleotide
- DTS
- dual target site
- EF1α
- elongation factor-1α
- GeoCas9
- Geobacillus stearothemophilus
- GUIDE-seq
- genome-wide unbiased identification of double strand breaks enabled by sequencing
- HDR
- homology-directed repair
- HTGTS
- high-throughput genome-wide translocation sequencing
- IDLV
- integrase-defective lentiviral vector
- mESC
- mouse embryonic stem cell
- NHEJ
- non-homologous end joining
- NLS
- nuclear localization signal
- NmeCas9
- Neisseria meningitidis (strain 8013) Cas9
- NTS
- NmeCas9 target site
- PAM
- protospacer adjacent motif
- RNP
- ribonucleoprotein
- SauCas9
- Staphylococcus aureus Cas9
- sgRNA
- single-guide RNA
- SITE-Seq
- selective enrichment and identification of tagged genomic DNA ends by sequencing
- SpyCas9
- Streptococcus pyogenes Cas9
- T7E1
- T7 Endonuclease 1
- tracrRNA
- trans-acting CRISPR RNA
- tru-sgRNAs
- truncated sgRNAs.








