
Integrase-mediated spacer acquisition during crispr–cas adaptive immunity
- Select a language for the TTS:
- UK English Female
- UK English Male
- US English Female
- US English Male
- Australian Female
- Australian Male
- Language selected: (auto detect) - EN
Play all audios:

ABSTRACT Bacteria and archaea insert spacer sequences acquired from foreign DNAs into CRISPR loci to generate immunological memory. The _Escherichia coli_ Cas1–Cas2 complex mediates spacer
acquisition _in vivo_, but the molecular mechanism of this process is unknown. Here we show that the purified Cas1–Cas2 complex integrates oligonucleotide DNA substrates into acceptor DNA to
yield products similar to those generated by retroviral integrases and transposases. Cas1 is the catalytic subunit and Cas2 substantially increases integration activity. Protospacer DNA
with free 3′-OH ends and supercoiled target DNA are required, and integration occurs preferentially at the ends of CRISPR repeats and at sequences adjacent to cruciform structures abutting
AT-rich regions, similar to the CRISPR leader sequence. Our results demonstrate the Cas1–Cas2 complex to be the minimal machinery that catalyses spacer DNA acquisition and explain the
significance of CRISPR repeats in providing sequence and structural specificity for Cas1–Cas2-mediated adaptive immunity. Access through your institution Buy or subscribe This is a preview
of subscription content, access via your institution ACCESS OPTIONS Access through your institution Subscribe to this journal Receive 51 print issues and online access $199.00 per year only
$3.90 per issue Learn more Buy this article * Purchase on SpringerLink * Instant access to full article PDF Buy now Prices may be subject to local taxes which are calculated during checkout
ADDITIONAL ACCESS OPTIONS: * Log in * Learn about institutional subscriptions * Read our FAQs * Contact customer support SIMILAR CONTENT BEING VIEWED BY OTHERS GENOME EXPANSION BY A CRISPR
TRIMMER-INTEGRASE Article Open access 14 June 2023 HISTONES DIRECT SITE-SPECIFIC CRISPR SPACER ACQUISITION IN MODEL ARCHAEON Article 07 August 2023 ALTERNATIVE FUNCTIONS OF CRISPR–CAS
SYSTEMS IN THE EVOLUTIONARY ARMS RACE Article 06 January 2022 ACCESSION CODES PRIMARY ACCESSIONS GENE EXPRESSION OMNIBUS * GSE64552 DATA DEPOSITS Sequencing data are deposited in Gene
Expression Omnibus under accession number GSE64552. REFERENCES * Barrangou, R. et al. CRISPR provides acquired resistance against viruses in prokaryotes. _Science_ 315, 1709–1712 (2007)
Article ADS CAS PubMed Google Scholar * van der Oost, J., Westra, E. R., Jackson, R. N. & Wiedenheft, B. Unravelling the structural and mechanistic basis of CRISPR–Cas systems.
_Nature Rev. Microbiol._ 12, 479–492 (2014) Article CAS Google Scholar * Mojica, F. J., Diez-Villasenor, C., Garcia-Martinez, J. & Soria, E. Intervening sequences of regularly spaced
prokaryotic repeats derive from foreign genetic elements. _J. Mol. Evol._ 60, 174–182 (2005) Article ADS CAS PubMed Google Scholar * Bolotin, A., Quinquis, B., Sorokin, A. &
Ehrlich, S. D. Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin. _Microbiology_ 151, 2551–2561 (2005) Article CAS PubMed Google
Scholar * Pourcel, C., Salvignol, G. & Vergnaud, G. CRISPR elements in _Yersinia pestis_ acquire new repeats by preferential uptake of bacteriophage DNA, and provide additional tools
for evolutionary studies. _Microbiology_ 151, 653–663 (2005) Article CAS PubMed Google Scholar * Stern, A., Keren, L., Wurtzel, O., Amitai, G. & Sorek, R. Self-targeting by CRISPR:
gene regulation or autoimmunity? _Trends in Genet._ 26, 335–340 (2010) Article CAS Google Scholar * Carte, J., Wang, R., Li, H., Terns, R. M. & Terns, M. P. Cas6 is an
endoribonuclease that generates guide RNAs for invader defense in prokaryotes. _Genes Dev._ 22, 3489–3496 (2008) Article CAS PubMed PubMed Central Google Scholar * Haurwitz, R. E.,
Jinek, M., Wiedenheft, B., Zhou, K. & Doudna, J. A. Sequence- and structure-specific RNA processing by a CRISPR endonuclease. _Science_ 329, 1355–1358 (2010) Article ADS CAS PubMed
PubMed Central Google Scholar * Deltcheva, E. et al. CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III. _Nature_ 471, 602–607 (2011) Article ADS CAS PubMed
PubMed Central Google Scholar * Brouns, S. J. et al. Small CRISPR RNAs guide antiviral defense in prokaryotes. _Science_ 321, 960–964 (2008) Article ADS CAS PubMed PubMed Central
Google Scholar * Garneau, J. E. et al. The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA. _Nature_ 468, 67–71 (2010) Article ADS CAS PubMed Google Scholar *
Jinek, M. et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. _Science_ 337, 816–821 (2012) Article ADS CAS PubMed PubMed Central Google Scholar *
Yosef, I., Goren, M. G. & Qimron, U. Proteins and DNA elements essential for the CRISPR adaptation process in _Escherichia coli_. _Nucleic Acids Res._ 40, 5569–5576 (2012) Article CAS
PubMed PubMed Central Google Scholar * Datsenko, K. A. et al. Molecular memory of prior infections activates the CRISPR/Cas adaptive bacterial immunity system. _Nature Commun._ 3, 945
(2012) Article ADS Google Scholar * Swarts, D. C., Mosterd, C., van Passel, M. W. & Brouns, S. J. CRISPR interference directs strand specific spacer acquisition. _PLoS ONE_ 7, e35888
(2012) Article ADS CAS PubMed PubMed Central Google Scholar * Nuñez, J. K. et al. Cas1–Cas2 complex formation mediates spacer acquisition during CRISPR–Cas adaptive immunity. _Nature
Struct. Mol. Biol._ 21, 528–534 (2014) Article Google Scholar * Wiedenheft, B. et al. Structural basis for DNase activity of a conserved protein implicated in CRISPR-mediated genome
defense. _Structure_ 17, 904–912 (2009) Article CAS PubMed Google Scholar * Babu, M. et al. A dual function of the CRISPR–Cas system in bacterial antivirus immunity and DNA repair. _Mol.
Microbiol._ 79, 484–502 (2011) Article CAS PubMed Google Scholar * Kim, T. Y., Shin, M., Huynh Thi Yen, L. & Kim, J. S. Crystal structure of Cas1 from Archaeoglobus fulgidus and
characterization of its nucleolytic activity. _Biochem. Biophys. Res. Commun._ 441, 720–725 (2013) Article CAS PubMed Google Scholar * Beloglazova, N. et al. A novel family of
sequence-specific endoribonucleases associated with the clustered regularly interspaced short palindromic repeats. _J. Biol. Chem._ 283, 20361–20371 (2008) Article CAS PubMed PubMed
Central Google Scholar * Samai, P., Smith, P. & Shuman, S. Structure of a CRISPR-associated protein Cas2 from _Desulfovibrio vulgaris_. _Acta Crystallogr. Sect. F Struct. Biol. Cryst.
Commun._ 66, 1552–1556 (2010) Article CAS PubMed PubMed Central Google Scholar * Nam, K. H. et al. Double-stranded endonuclease activity in _Bacillus halodurans_ clustered regularly
interspaced short palindromic repeats (CRISPR)-associated Cas2 protein. _J. Biol. Chem._ 287, 35943–35952 (2012) Article CAS PubMed PubMed Central Google Scholar * Li, M. & Craigie,
R. Processing of viral DNA ends channels the HIV-1 integration reaction to concerted integration. _J. Biol. Chem._ 280, 29334–29339 (2005) Article CAS PubMed Google Scholar *
Cherepanov, P. LEDGF/p75 interacts with divergent lentiviral integrases and modulates their enzymatic activity _in vitro_. _Nucleic Acids Res._ 35, 113–124 (2007) Article CAS PubMed
Google Scholar * Hare, S. et al. A novel co-crystal structure affords the design of gain-of-function lentiviral integrase mutants in the presence of modified PSIP1/LEDGF/p75. _PLoS Pathog._
5, e1000259 (2009) Article PubMed PubMed Central Google Scholar * Yang, J. Y., Jayaram, M. & Harshey, R. M. Positional information within the Mu transposase tetramer: catalytic
contributions of individual monomers. _Cell_ 85, 447–455 (1996) Article CAS PubMed Google Scholar * Dinardo, S., Voelkel, K. A., Sternglanz, R., Reynolds, A. E. & Wright, A.
_Escherichia coli_ DNA topoisomerase I mutants have compensatory mutations in DNA gyrase genes. _Cell_ 31, 43–51 (1982) Article CAS PubMed Google Scholar * Pruss, G. J., Manes, S. H.
& Drlica, K. _Escherichia coli_ DNA topoisomerase I mutants: increased supercoiling is corrected by mutations near gyrase genes. _Cell_ 31, 35–42 (1982) Article CAS PubMed Google
Scholar * Chow, S. A., Vincent, K. A., Ellison, V. & Brown, P. O. Reversal of integration and DNA splicing mediated by integrase of human immunodeficiency virus. _Science_ 255, 723–726
(1992) Article ADS CAS PubMed Google Scholar * Au, T. K., Pathania, S. & Harshey, R. M. True reversal of Mu integration. _EMBO J._ 23, 3408–3420 (2004) Article CAS PubMed PubMed
Central Google Scholar * Engelman, A., Mizuuchi, K. & Craigie, R. HIV-1 DNA integration: mechanism of viral DNA cleavage and DNA strand transfer. _Cell_ 67, 1211–1221 (1991) Article
CAS PubMed Google Scholar * Mizuuchi, K. & Adzuma, K. Inversion of the phosphate chirality at the target site of Mu DNA strand transfer: evidence for a one-step transesterification
mechanism. _Cell_ 66, 129–140 (1991) Article CAS PubMed Google Scholar * Curcio, M. J. & Derbyshire, K. M. The outs and ins of transposition: from mu to kangaroo. _Nature Rev. Mol.
Cell Biol._ 4, 865–877 (2003) Article CAS Google Scholar * Arslan, Z., Hermanns, V., Wurm, R., Wagner, R. & Pul, U. Detection and characterization of spacer integration intermediates
in type I-E CRISPR-Cas system. _Nucleic Acids Res._ 42, 7884–7893 (2014) Article CAS PubMed PubMed Central Google Scholar * Tyson, G. W. & Banfield, J. F. Rapidly evolving CRISPRs
implicated in acquired resistance of microorganisms to viruses. _Environ. Microbiol._ 10, 200–207 (2008) CAS PubMed Google Scholar * Sheflin, L. G. & Kowalski, D. Altered DNA
conformations detected by mung bean nuclease occur in promoter and terminator regions of supercoiled pBR322 DNA. _Nucleic Acids Res._ 13, 6137–6154 (1985) Article CAS PubMed PubMed
Central Google Scholar * Goren, M. G., Yosef, I., Auster, O. & Qimron, U. Experimental definition of a clustered regularly interspaced short palindromic duplicon in _Escherichia coli_.
_J. Mol. Biol._ 423, 14–16 (2012) Article CAS PubMed Google Scholar * Savitskaya, E., Semenova, E., Dedkov, V., Metlitskaya, A. & Severinov, K. High-throughput analysis of type I-E
CRISPR/Cas spacer acquisition in _E. _ _coli_. _RNA Biol._ 10, 716–725 (2013) Article CAS PubMed PubMed Central Google Scholar * Shmakov, S. et al. Pervasive generation of oppositely
oriented spacers during CRISPR adaptation. _Nucleic Acids Res._ 42, 5907–5916 (2014) Article CAS PubMed PubMed Central Google Scholar * Deveau, H. et al. Phage response to
CRISPR-encoded resistance in Streptococcus thermophilus. _J. Bacteriol._ 190, 1390–1400 (2008) Article CAS PubMed Google Scholar * Semenova, E. et al. Interference by clustered regularly
interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence. _Proc. Natl Acad. Sci. USA_ 108, 10098–10103 (2011) Article ADS CAS PubMed PubMed Central Google
Scholar * Westra, E. R. et al. Type I-E CRISPR-cas systems discriminate target from non-target DNA through base pairing-independent PAM recognition. _PLoS Genet._ 9, e1003742 (2013) Article
CAS PubMed PubMed Central Google Scholar * Craigie, R. & Bushman, F. D. HIV DNA integration. _Cold Spring Harbor Perspect. Med._ 2, a006890 (2012) Article Google Scholar *
Nowotny, M. Retroviral integrase superfamily: the structural perspective. _EMBO Rep._ 10, 144–151 (2009) Article CAS PubMed PubMed Central Google Scholar * Hochstrasser, M. L. &
Doudna, J. A. Cutting it close: CRISPR-associated endoribonuclease structure and function. _Trends Biochem. Sci._ 40, 58–66 (2015) Article CAS PubMed Google Scholar * Paleček, E. Local
supercoil-stabilized DNA structures. _Crit. Rev. Biochem. Mol. Biol._ 26, 151–226 (1991) Article PubMed Google Scholar * Engelman, A. & Craigie, R. Efficient magnesium-dependent human
immunodeficiency virus type 1 integrase activity. _J. Virol._ 69, 5908–5911 (1995) CAS PubMed PubMed Central Google Scholar * Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L.
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. _Genome Biol._ 10, R25 (2009) Article PubMed PubMed Central Google Scholar * Crooks, G. E., Hon, G.,
Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. _Genome Res._ 14, 1188–1190 (2004) Article CAS PubMed PubMed Central Google Scholar Download references
ACKNOWLEDGEMENTS We are grateful to M. Chung, P. J. Kranzusch and A.V. Wright for technical assistance and members of the Doudna laboratory and J. Cate for discussions. This project was
funded by US National Science Foundation grant no. 1244557 to J.A.D. and by NIH grant AI070042 to A.E. This work used the Vincent J. Coates Genomics Sequencing Laboratory at UC Berkeley,
supported by NIH S10 Instrumentation Grants S10RR029668 and S10RR027303. J.K.N. is supported by a US National Science Foundation Graduate Research Fellowship and a UC Berkeley Chancellor’s
Graduate Fellowship. A.S.Y.L. is supported as an American Cancer Society Postdoctoral Fellow (PF-14-108-01-RMC). J.A.D. is an Investigator of the Howard Hughes Medical Institute and a member
of the Center for RNA Systems Biology. AUTHOR INFORMATION AUTHORS AND AFFILIATIONS * Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, California
94720, USA, James K. Nuñez, Amy S. Y. Lee & Jennifer A. Doudna * Center for RNA Systems Biology, University of California, Berkeley, Berkeley, California 94720, USA, Amy S. Y. Lee &
Jennifer A. Doudna * Department of Cancer Immunology and AIDS, Dana-Farber Cancer Institute and Department of Medicine, Harvard Medical School, Boston, 02115, Massachusetts, USA Alan
Engelman * Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, California 94720, USA, Jennifer A. Doudna * Department of Chemistry, University of California,
Berkeley, Berkeley, California 94720, USA, Jennifer A. Doudna * Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, 94720, California, USA Jennifer A. Doudna
Authors * James K. Nuñez View author publications You can also search for this author inPubMed Google Scholar * Amy S. Y. Lee View author publications You can also search for this author
inPubMed Google Scholar * Alan Engelman View author publications You can also search for this author inPubMed Google Scholar * Jennifer A. Doudna View author publications You can also search
for this author inPubMed Google Scholar CONTRIBUTIONS J.K.N. performed the biochemical experiments. A.S.Y.L. processed and analysed the high-throughput sequencing data. J.K.N., A.S.Y.L.,
A.E. and J.A.D. designed the study, analysed the data and wrote the manuscript. CORRESPONDING AUTHOR Correspondence to Jennifer A. Doudna. ETHICS DECLARATIONS COMPETING INTERESTS J.A.D. and
J.K.N. have filed a related patent application. EXTENDED DATA FIGURES AND TABLES EXTENDED DATA FIGURE 1 THE INTEGRATION REACTION IS DEPENDENT ON THE PRESENCE OF PROTOSPACERS, LOW SALT AND
DIVALENT METAL IONS. A, _In vitro_ integration assay alongside EcoRI- and Nb.BbvCI nickase-treated pCRISPR. B, Salt-dependence assay using Cas1 or Cas2 only and Cas1+Cas2. The titration
corresponds to 0, 25, 50, 100 and 200 nM KCl, in addition to the salt carried in from the reaction reagents. C, Integration assays in the presence of 10 mM EDTA, Mg2+, Mn2+ or no additive.
D, Integration assays with increasing protospacer concentrations. E, A comparison of post-reaction treatments as indicated. The data presented in A–E are representative of at least three
replicates. EXTENDED DATA FIGURE 2 CAS1 REQUIRES CAS2 FOR ROBUST PROTOSPACER INTEGRATION. A, Schematic of the integration assays using 32P-labelled protospacers (PDB code 4P6I for
Cas1–Cas2). B, Integration assays in the presence of increasing protein and 10 mM MnCl2. The titration corresponds to 0, 50, 100 and 200 nM protein. C, Same as B except in the presence of 10
mM MgCl2. The data presented in B and C are representative of at least three replicates. EXTENDED DATA FIGURE 3 THE CATALYTIC ACTIVITY OF CAS1 IS REQUIRED FOR INTEGRATION. A, Close-up view
of the Cas1 active site with the conserved residues shown in stick configurations (PDB 4P6I). B, Integration assays of purified Cas1 active site mutants complexed with wild-type Cas2. C, The
same as B except using radiolabelled protospacers. The data presented in B and C are representative of at least three replicates. EXTENDED DATA FIGURE 4 BAND X CORRESPONDS TO TOPOISOMERS OF
PCRISPR. A, Agarose gel of purified relaxed and band X integration products. B, Analysis of the total reaction products, after phenol chloroform extraction and ethanol precipitation, on a
pre-stained agarose gel. C, Same as B except ethidium bromide staining was performed after electrophoresis. D, PCR amplification products of various segments of pCRISPR using the relaxed,
band X or pCRISPR template shown in A. The laddering effect of minor products using CRISPR locus primers likely reflects the propensity of CRISPR repeats to form DNA hairpins. The data
presented in A–D are representative of at least three replicates. EXTENDED DATA FIGURE 5 CAS1 CATALYSES THE DISINTEGRATION OF HALF-SITE INTEGRATED PROTOSPACERS. A, Schematic of the four
strands constituting the Y DNA substrate used in the disintegration assays. B, Native polyacrylamide gel analysis of the annealing products with either strand A or strand C radiolabelled. C,
Native polyacrylamide gel analysis of disintegration assay products using Y DNA substrates with strand A labelled. D, Denaturing gel analysis of the disintegration assay products with
strand A labelled. EXTENDED DATA FIGURE 6 CAS1–CAS2 CAN INTEGRATE VARIOUS LENGTHS OF DOUBLE-STRANDED DNA WITH BLUNT- OR 3′-OVERHANG ENDS INTO A SUPERCOILED TARGET PLASMID. A, Integration
assays using the indicated lengths of protospacer DNA. B, Integration assays using varying 5′ or 3′ overhang lengths. C, D, A comparison of integration assays using pCRISPR or
Nb.BbvCI-nicked pCRISPR target. E, Integration assay using different target plasmids with or without a CRISPR locus. The green arrows correspond to the relaxed product of each target and the
cyan arrows correspond to the band X product. The data presented in A–E are representative of at least three replicates. EXTENDED DATA FIGURE 7 CAS1 TYROSINE MUTANTS SUPPORT INTEGRATION
ACTIVITY _IN VITRO_. A, A close-up of the Cas1 active site with the tyrosine residues labelled in blue. B, Structure-based sequence alignment of Cas1 proteins, highlighting the tyrosine
residues mutated to alanine in this study. C, Radiolabelled protospacer integration assay of Cas1 tyrosine mutants complexed with wild-type Cas2. The gel presented in C is representative of
at least three replicates. EXTENDED DATA FIGURE 8 HIGH-THROUGHPUT SEQUENCING OF INTEGRATION PRODUCTS REVEALS SEQUENCE-SPECIFIC INTEGRATION. A, Schematic of the workflow for high-throughput
sequencing analysis of the integration sites. B, Raw map of the total reads along pCRISPR before collapsing into single peaks of protospacer–pCRISPR junctions depicted in Fig. 4. C, Same as
B, except for the pUC19 target. D, Sequence of the leader-end of the CRISPR locus in _E. coli_. E, F, WebLogo analysis from the −5 to +5 positions surrounding the protospacer integration
sites on the plus (E) and minus (F) of pCRISPR. The arrow points to the nucleotide that is covalently joined to the protospacer. G, H, Same as E, F, except for the pUC19 target. EXTENDED
DATA FIGURE 9 CAS1–CAS2 CORRECTLY ORIENTS THE PROTOSPACER DNA DURING INTEGRATION. A–F, Mapped integration sites along the CRISPR locus of pCRISPR when using protospacer DNA with nucleotide
ends ‘wild-type’ 3′ C and 3′ T (A), 3′ A and 3′ T (C), and 3′ C and 3′ C (E). The red arrow in C and E points to the nucleotide change in the protospacer DNA compared to the ‘wild-type’
sequence in A. The protospacer DNA 3′ nucleotide and the CRISPR locus strand biases in A, C, E are plotted in B, D and F, respectively, as percentages of integration events within the CRISPR
locus. The black and clear bars represent the (−) and (+) strands of the CRISPR locus, respectively. NS corresponds to not significant and _P_ < 0.0001 by chi-square test. The _n_ values
for B, D and F are 5,623, 5,685 and 12,453 reads along the CRISPR locus, respectively. EXTENDED DATA FIGURE 10 MODEL OF THE CRISPR–CAS ADAPTIVE IMMUNITY PATHWAY IN _E. COLI_. Mature
double-stranded protospacers bearing a 3′ C-OH are site-specifically integrated into the leader-end of the CRISPR locus. Correct protospacer integration (left) results in the 5′G/3′C as the
first nucleotide of the spacer, proximal to the leader. After transcription of the CRISPR locus and subsequent crRNA processing, foreign DNA destruction is initiated by strand-specific
recognition of the 3′-TTC-5′ PAM sequence in the target strand by the crRNA-guided Cascade complex. Incorrect protospacer integration (right) cannot initiate foreign DNA destruction due to
the inability for the crRNA to recognize the strand with the 3′-TTC-5′ PAM. Thus, foreign DNA interference during CRISPR–Cas adaptive immunity relies on the Cas1–Cas2 complex for correctly
orienting the protospacer during integration. POWERPOINT SLIDES POWERPOINT SLIDE FOR FIG. 1 POWERPOINT SLIDE FOR FIG. 2 POWERPOINT SLIDE FOR FIG. 3 POWERPOINT SLIDE FOR FIG. 4 POWERPOINT
SLIDE FOR FIG. 5 RIGHTS AND PERMISSIONS Reprints and permissions ABOUT THIS ARTICLE CITE THIS ARTICLE Nuñez, J., Lee, A., Engelman, A. _et al._ Integrase-mediated spacer acquisition during
CRISPR–Cas adaptive immunity. _Nature_ 519, 193–198 (2015). https://doi.org/10.1038/nature14237 Download citation * Received: 06 November 2014 * Accepted: 15 January 2015 * Published: 18
February 2015 * Issue Date: 12 March 2015 * DOI: https://doi.org/10.1038/nature14237 SHARE THIS ARTICLE Anyone you share the following link with will be able to read this content: Get
shareable link Sorry, a shareable link is not currently available for this article. Copy to clipboard Provided by the Springer Nature SharedIt content-sharing initiative