Abstract
The ability to construct defined genetic mutations in many bacteria is difficult and limited. Transposon mutagenesis is often highly efficient, but is not site specific, thus selections are often needed to identify mutants of interest. The construction of arrayed mutant libraries would help to fill this need, though these libraries are costly and time consuming. To enable easier construction of arrayed libraries we developed a workflow and methodology using a hierarchical barcoding scheme to identify mutants within a multiwell plate. We applied this method to the marine Alphaproteobacterium Ruegeria pomeroyi DSS-3 and created a library with over 2,800 disrupted genes.
Introduction
Heterotrophic marine bacteria shape global biogeochemical cycles and drive nutrient cycling in the surface oceans (Karl & Church, 2014). While there has been much progress over the last two decades in understanding the genetic underpinnings of these cycles, many gaps remain. The explosion of omics data collection over the past two decades has provided extensive data at the community level, enabling hypothesis generation (Faure et al., 2021). More generally, the number of genes with unknown function in bacteria remains high. Even with the most recent computational tools, about 30% of microbial genes have an unknown function (Vanni et al., 2022).
Transposon mutant libraries and next generation sequencing have enabled systems level investigations of bacterial genetics and physiology. Barcoded transposons and the associated Bar-seq workflow has further simplified these genome level analyses (Wetmore et al., 2015). In Bar-seq, each transposon possesses a 20-bp random barcode. After an initial Tn-Seq to identify the genomic location and barcode sequence of the transposon, only amplification of the 20-bp barcode is needed for subsequent experiments, greatly speeding and simplifying the workflow. In fact, this method was used to provide phenotypic data for 11,779 proteins with unknown function (Price et al., 2018).
Starting with a barcoded library, Shiver et al. described a technique to construct an ordered library of mutants in a strict anaerobic bacterium (Shiver et al., 2021). This technique used a grouped pooling approach, where the location of each mutant in the plate is deconvoluted computationally. Similarly, we describe a method for constructing an arrayed transposon mutant library that relies on the 20-bp barcode for identification. However, our method uses a hierarchical barcoding scheme to identify the row, column, and plate where each mutant originated. A single PCR reaction is performed with each mutant arrayed in a 384-well plate as template. A series of 16 and 24 primers that correspond to each row and column of a 384-well plate. The products from each plate are then pooled and amplified with Illumina indices. Each amplicon thus contains the information to identify the 20-bp barcode, column and row position, and plate from which the barcode originated, in a technique dubbed PlateSeq. We applied PlateSeq to the marine Alphaproteobacterium Ruegeria pomeroyi DSS-3 and obtained arrayed mutants for over 2800 genes, demonstrating feasibility of the methods and providing a useful set of genetic mutants for research. An overview and flow chart of the methods used for the Tn-Seq and subsequent arraying and plate mapping is given in Figure 1.
Materials and methods
Media
½ YTSS broth is prepared by adding 2 g of yeast extract, 1.25 g of tryptone, 20 g of sea salts to 1 L of deionized water (dH2O).
Luria-Bertani broth (LB) is prepared by adding 10 g of tryptone, 5 g yeast extract, and 10 g NaCl.
100 mg L-1 kanamycin (kan) in dH2O
150 mM Diaminopimelic acid (DAP)
For solid media, add 15 g/L agar.
384-well plates with 80 μL of ½ YTSS with 100 mg L-1 kan
384-well plates with 60 μL of ½ YTSS with 100 mg L-1 kan
Sterile 80% glycerol
Bacterial strains
R. pomeroyi is stored in 20% glycerol at −80 C and grown on ½ YTSS medium.
E. coli WM3064 with barcoded pkmw7 is stored in 20% glycerol stock at −80 C. and grown on LB with 100 mg L-1 kan
R. pomeroyi Tn-5 library is stored in 20% glycerol stock at −80 C and grown on ½ YTSS with solid media with on 100 mg L-1 kan.
Reagents
0.25% xylene cyanole dye is prepared by adding 0.0125 g of xylene cyanole and 3.53 mL 85% glycerol to 1.47 mL dH2O and filtering through a 0.2 μM filter.
25 mM Deoxynucleotide (dNTP) Solution Mix
5x OneTaq Standard Reaction Buffer (New England Biolabs)
OneTaq DNA polymerase (New England Biolabs)
Q5 High-Fidelity 2X Master Mix (New England Biolabs)
1X OneTaq Master Mix is prepared according to manufacturer protocol. 1X OneTaq Master Mix
DNA gel extraction kit
PCR purification kit
NEBNext Ultra II FS DNA Library Prep with Sample Purification Beads (NEB #E6177)
80% Ethanol (freshly prepared)
Nuclease-free water
0.1X TE (pH 8.0)
Primers
TA_Adaptor_Top - /5Phos/CTCACCGCTCTTGTAGCTGTCTCTTATACACATCTCCGAGCCCACGAGAC
TA_Adaptor_Bottom - CTACAAGAGCGGTGAGT
2815: 5’-CCTACACGACGCTCTTCCGATCTGTCTCGTGGGCTCGGAGATGTGTATAAG-3’
2814: 5’-GAGTTCAGACGTGTGCTCTTCCGATCTTCACTCACCGCCCTGCAGGGATGTCCACGAGGTCTC-3’
BSUp and BSDn Primers Listed in Supplementary table 1
Unique Dual Index (UDI) Illumina Adapters
Consumables
384-well polypropylene plates
384-pin replicator
384-well polypropylene PCR plates
Ultracold compatible aluminum seals
Polypropylene PCR microplate seals
Equipment
Opentrons OT-2 liquid handler
Opentrons 8-Channel Electronic Pipette P20
Multidrop 384 well plate dispenser
Thermo Scientific E1-ClipTip Electronic Multichannel Pipettes 16-channel
Microplate film sealing roller
Genetix Qpix2 colony-picker
Thermocycler
1. LIBRARY CONSTRUCTION
We constructed the barcoded transposon library by conjugating R. pomeroyi with E. coli WM3064 containing the pKMW7 Tn5 library (strain APA766) (Wetmore et al., 2015). A filter mating protocol was used here (Stoudenmire et al., 2019), though other methods may be suitable for different strains.
Filter mating
From isolated colonies, grow cultures of E. coli donor in 5 mL LB broth with 100 mg L-1 kan supplemented with DAP (0.3 mM) at 37° C and R. pomeroyi recipient in 5 mL ½ YTSS at 30° C for approximately 16 hours.
Dilute cultures 100-fold in 5 mL of respective broth with antibiotic and supplement and incubate in respective incubation temperature until they have reached mid-log phase.
Centrifuge 500 μL of E. coli donor with ½ YTSS at 4,000 x g for 2 min to remove antibiotic, discard supernatant and wash twice.
Resuspend in 500 μL of ½ YTSS.
Combine 500 μL of R. pomeroyi mid-log culture with the 500 μL of the washed E. coli donor.
Run mating mix through membrane filter and wash with 2-3 times that volume of ½ YTSS supplemented with DAP (0.3 mM).
Remove membrane filter from filter flask and place in ½ YTSS. Vortex to remove the cell from the filter and discard the filter.
Centrifuge mix at 4,000 x g for 2 min, resuspend in 1 mL of 1/2 YTSS and wash twice to remove DAP.
Plate 500 μL (or a volume that obtains isolated colonies) on ½ YTSS with kan.
Incubate plate at 30° C for approximately 48 hours.
Collect transconjugant colonies by scraping the plate or washing with ½ YTSS broth. Mix with 50 % glycerol and store at −80° C.
2. TRANSPOSON MAPPING PREPARATION
Genomic DNA extraction of library
Genomic DNA can be extracted with the user’s method of choice. We prefer the method below because it is fast and yields high purity DNA.
Pellet 1 mL of the transposon library by centrifugation at about 5,000 x g for 5 min and decant the supernatant.
Resuspend the pellet in sterile dH2O for a total volume of 500 μL.
Add 500 μL of phenol/chloroform/isoamyl alcohol (25:24:1) and pipette to mix.
Centrifuge mixture at 12,000 x g for 5 min.
Transfer the aqueous layer to a clean 1.5 mL tube, avoiding the white film and organic layer.
Purify using a column-based purification kit and elute in 30 μL of elution buffer.
Quantify genomic DNA (gDNA) by QuBit and ensure genomic DNA is clean-with 260/230 and 260/280 being greater than 1.8. Store gDNA at −20 ° C.
Transposon sequencing library (TnSeq) preparation
Our group adapted a transposon insertion sequencing library preparation protocol(Transposon Insertion Sequencing (Tn-Seq) Library Preparation Protocol - Includes UMI for PCR Duplicate Removal, n.d.) and scaled it down by half the total reaction amount. Instead of inserting UMIs into sequenced regions, our 20-bp inline barcodes identifies the transposon insertion before subsequent PCR amplification… Our top adapotor’s top and bottom primer sequences also differ in that…
Part 1: DNA Fragmentation, end-repair, and adaptor ligation
Before beginning:
Fully thaw and quickly vortex all reagents to mix; briefly spin-down reagents.
Once thawed and mixed, keep all reagents on ice.
Adaptor annealing:
To anneal the adaptor, mix 15 μL adaptor top F Primer (100 μM), 15 μL 2730 adaptor bottom R Primer (100 μM), and 70 μL nuclease-free H2O in a 0.2-mL PCR tube.
Run the following annealing program in thermocycler.
Enzymatic fragmentation, end-repair, and ligation:
Aliquot 50 ng of library gDNA into a 0.2mL PCR tube and use 1X TE to bring total volume to 13 μL.
To fragment DNA add 3.5 μL Ultra II FS Buffer and 1 μL Ultra II FS Enzyme Mix. Pipette up and down several times to mix.
Run the following program in a thermocycler with heated lid set to 75°C:
Sample can be stored at −20 °C. However, this is not recommended because yield is often decreased.
Add 15 μL Ultra II Ligation Mix, 0.5 μL Ligation Enhancer, and 1.25 μL Palani adaptor to reaction, mix well.
Run the following NEBNext Ligation program in thermocycler with heated lid off.
DNA Cleanup and Size Selection with Sample Purification Beads:
Transfer all 34.25 μL of adaptor-ligated reaction to a 1.5mL microcentrifuge tube and add 15.75 μL 0.1x TE (dilute provided 1x TE 1:10) to bring volume to 50 μL. Pipette to mix.
Briefly vortex NEBNext Sample Purification Beads and add 15 μL to sample. Pipette to mix.
Incubate at room temperature for 5 min, briefly spin down, then place in magnetic tube rack.
Once solution becomes clear (about 3-5 min), transfer supernatant (65 μL) to a new 1.5 mL microcentrifuge tube.
Add an additional 7.5 μL magnetic beads and pipette to mix. Incubate at room temperature for 5 min, briefly spin down, then place on magnetic tube rack.
Once the solution becomes clear (about 3-5 min), remove and discard supernatant.
Add 200 μL 80% EtOH, avoiding disturbing the bead pellet. Let sit for 30 sec, then remove and discard EtOH.
Repeat step (7) once more.
Open tube cap to dry the bead pellet at room temperature for up to 5 min. Carefully aspirate by pipette any remaining droplets if present, without disturbing bead pellet. Bead pellet is dry when dark brown, glossy, and no remaining droplets are visible. Do not over dry- if bead pellet becomes light brown and cracked, sample recovery will fail.
Remove from magnetic rack and add 10 μL EB (warmed to 55°C). Pipette to mix and briefly spin down. Incubate for 2 min at room temperature, then place back on magnetic rack.
Once clear, transfer supernatant to new 1.5 mL microcentrifuge tube. Proceed to Part 2: Transposon Enrichment PCR or store sample at −20° C
Part 2: Transposon Enrichment PCR
To amplify the transposon junction, set up the following reaction in a 0.2 mL PCR tube.
Run the following NEBNext Enrichment PCR program in thermocycler.
Perform 1.2x Magnetic Bead DNA Cleanup as follows.
Transfer all 25 μL of enrichment PCR to a 1.5mL microcentrifuge tube and add 25 μL 0.1x TE (dilute provided 1x TE 1:10) to bring volume to 50 μL. Pipette to mix.
Briefly vortex NEBNext Sample Purification Beads and add 60 μL to sample. Pipette to mix.
Incubate at RT for 5 min, briefly spin down, then place in magnetic tube rack.
Once the solution becomes clear (about 3-5 min), remove and discard supernatant.
Add 200 μL 80% EtOH, avoiding disturbing the bead pellet. Let sit for 30 sec., then remove and discard EtOH.
Repeat step (5) once more.
Open tube cap to dry the bead pellet at room temperature for up to 5 min. Carefully aspirate by pipette any remaining droplets if present, without disturbing bead pellet. Bead pellet is dry when dark brown, glossy, and no remaining droplets are visible. Do not over dry- if bead pellet becomes light brown and cracked, sample recovery will fail.
Remove from magnetic rack and add 10 μL Buffer EB (warmed to 55°C). Pipette to mix and briefly spin down. Incubate for 2 min at room temperature, then place back on magnetic rack.
Once clear, transfer supernatant to new 1.5 mL microcentrifuge tube.
Quantify cleaned-up enriched DNA by Qubit, using 1 μL. Record concentration. Proceed to Part 3 or store sample at −20 °C.
Part 3: Indexing PCR for Illumina Sequencing
To anneal Unique Dual Indexes to enriched DNA, set up the following reaction in 0.2mL PCR tube:
Run the following NEBNext Index PCR program in the thermocycler.
Perform 1.2x Magnetic Bead DNA Cleanup as follows:
Transfer all 25 μL of indexing PCR to a 1.5 mL microcentrifuge tube and add 25 μL 0.1x TE (dilute provided 1x TE 1:10) to bring volume to 50 μL.
Briefly vortex NEBNext Sample Purification Beads and add 60 μL to sample.
Incubate at room temperature for 5 min, then place in magnetic tube rack.
Once the solution becomes clear (about 3-5 min), remove and discard supernatant.
Add 200 μL 80% EtOH, let sit for 30 sec, then remove and discard EtOH.
Repeat step (5) once more.
Open tube cap to dry the bead pellet at room temperature for up to 5 min. Bead pellet is dry when no remaining droplets surround it. It should be dark brown and glossy. Do not over dry. If it becomes light brown and cracked, sample recovery will fail
Remove from magnetic rack and add 10 μL EB (warmed to 55°C), incubate for 2 min at room temperature, then place back on magnetic rack.
Once clear, transfer supernatant to new 1.5 mL microcentrifuge tube.
Quantify cleaned-up indexed DNA by Qubit. Concentration should be higher than the recorded concentration of cleaned-up enriched DNA that was obtained in Part 2, step 10 of this protocol.
Assess the size and quality of cleaned-up indexed DNA by Bioanalyzer before sending for Illumina sequencing.
3. LIBRARY ARRAYING AND PLATE MAPPING (Figure 1)
Spread the transposon mutant library onto agar plates so that isolated colonies are obtained after 2-3 days of incubation at 30° C.
Inoculate each well with a single colony from agar plate and incubate at 30 ° C for 2-3 days until growth is visible in wells. This original microplate will serve as inoculum for the replicate microplate and for the subsequent PCR protocol.
Replicate the 384-well culture plate into a plate with 60 μL of ½ YTSS with 100 mg L-1 kan using a 384-pin replicator and incubate at 30° C for 2-3 days.
Seal the original microplate with aluminum seal and store in - 80° C.
Add 20 μL of 70 % glycerol solution with a multidrop reagent dispenser to each well of the replicate microplates.
Seal replicate microplate with aluminum seal and invert 3-5 times to mix glycerol with culture in each well and store at −80° C. The replicate microplate serves as long-term storage of bacterial glycerol cultures.
Primer plate preparation and PCR 1
In a deep 384-well plate, combine the 16 BSUp and 24 BSDown primers so that a unique primer pair is in each 384-well at a final concentration of 0.25 μM.
Distribute 5-10 μL of the primers into 384-well PCR plates using the Opentrons liquid handler.
Dehydrate the primer aliquots by incubating at 72° C until liquid is no longer visible. The aliquoted plates can be stored with desiccation until usage.
Aliquot 5-10 μL of the 1X OneTaq master mix into each well of the 384-well primer PCR plate. Keep on ice or store at 4 ° C until needed.
Thaw the stored 384-well culture plates from the −80° C and then briefly centrifuge to remove culture from the aluminum seal.
Remove the aluminum seal slowly to avoid cross contamination by splatter.
Sterilize the 384-pin replicator with 70% isopropyl alcohol and flame to evaporate alcohol. Allow replicator to cool and dry before proceeding to the next step.
Stamp replicator into the 384-well culture plate and lightly mix within wells to pick up settled cells.
Stamp replicator into the 384-well primer PCR plate containing 5 μL of 1X OneTaq. Ensure that pins reach the bottom of each well in the PCR microplate.
Repeat the above steps to make a duplicate PCR microplate.
Seal the duplicate plates with polypropylene PCR plate seals.
Briefly centrifuge the plates, so that contents reach the bottom of the wells.
Place PCR plates in thermocycler and run the following program:
Using a multi-channel pipette or the Opentrons liquid handler, transfer 5 μL of the completed PCR in each well to a 96-well plate. Collect all the reactions that correspond to one 384-well PCR plate in a 1.5-2 mL microcentrifuge tube.
Continue this process until all “Step 1” PCRs have been consolidated from one 384-well PCR plate to one microcentrifuge tube accordingly and store at –20° C.
Resolve 5 μL of the unpurified pooled Step 1 PCR on a 2% agarose gel to ensure size of about 120 bp.
Purify the consolidated pool using a spin column purification kit.
PCR 2
Amplify 300-600 ng of the “Step 1” product to add unique dual indexes and Illumina adapters with Q5 polymerase using the following program.
Run “Step 2” PCR product on a 2% agarose gel. The product size should be about 250 bp.
Excise the gel fragment and store in a 1.5 mL microcentrifuge tube.
Gel purify using a column purification kit.
Quantify purified DNA by Qubit.
For Illumina sequencing, pool samples at equal concentrations.
4. COMPUTATIONAL MAPPING
Illumina paired end .fastq reads are processed using the FEBA program described previously (Wetmore et al., 2015), the code is available in bitbucket (https://bitbucket.org/berkeleylab/feba/src). The FEBA workflow provides a table output with the barcodes and genomic insertion locations (Tnpool.txt converted to a .csv file). These tables are then used by our in-house programs that identify the plate location for each barcode. These programs are made up of the KEIO.py script, keio.R script and the process.R script. The scripts for the latter analysis can be found in the GitHub repository (https://github.com/ravinpoudel/KEIO.git). For this analysis it is recommended to use an anaconda environment (anaconda.com) to run the KEIO.py and associated R scripts.
1.Run Illumina sequencing reads received as .fastq files through the FEBA program.
2.Convert TnPool.txt FEBA output file to .csv by changing the file extension.
3.Install Anaconda and create a conda environment with the following dependencies: VSEARCH, pybedtool, NMSLib, Biopython, and Pandas using the following code on the command prompt.
5. Inside the keio conda environment install the KEIO program from https://github.com/ravinpoudel/KEIO.git with the following commands
6. In the keio environment run KEIO.py with the following inputs: -- fastq: the forward Illumina sequencing file for the 384-well arrayed library. –upstreamFASTA: a .fasta file with the unique 16 BSup multiplexing primer sequences. –downstreamrcFASTA: a .fasta file with the unique 24 BSDown primers reverse complement sequences. This program returns the BSUp and BSDown primer pairs with their associated 20 bp random barcode as a results.csv file.
4. Download and install R (https://cran.r-project.org)
5. Create a results directory
6. Run the R script keio.R using the KEIO output.csv as the input. This program returns a .csv file output summarized.csv with the BSUp, BSDown and the 20 bp random barcode associated with the primers and their count statistics.
7. Download a general feature format file (.gff) and save it as a .csv file by changing the extension.
8. Critical step. Update the process.R script to tailor the gff.csv input to the desired organisms gff file.
9. Run the process.R script with the following inputs: the gff.csv file, the summarized.csv files and the Tnpool.csv FEBA output. This program outputs a final results.csv file. This file contains the 384 well plate 20 bp random barcode and associated statistics with the transposon location within the genome and genomic location annotations.
This pipeline will result in a final .csv file with the results for the 384-well plate random barcoded libraries. The final results.csv file will contain the well location for every mutant mapped with its corresponding 20 bp random barcode across all libraries. The results.csv file includes the 20 bp random barcode statistics as well as the transposon location for each mutant per well with its locus tag and other functional annotations.
5. REARRAYING
The arrayed 384-well libraries can be re-arrayed into 96- or 384-well plates to minimize the work required for genome wide screens. The Opentrons liquid handler is used to cherry-pick from the original plate into a new plate with other unique mutants.
Compile desired well locations from the arrayed 384-well libraries and allocate new well locations in destination source plates. Arrange this in a .csv format.
Prepare a .py script for the cherry-picking protocol, following the OT-2 Python Protocol API format.
Import .csv file into .py script. Perform a practice run of final .py as Python protocol on Opentrons liquid handler for further optimization.
Thaw glycerol 384-well source plate(s).
Aliquot 300 μL of ½ YTSS in deep 96-well plate(s). These will serve as destination microplates. Briefly spin down thawed source microplates to bring contents to the bottom of the wells.
Carefully remove foil on source microplates and cover with lids
Place source plate(s), destination plate(s), and tip rack (s) in designed slots of deck in Opentrons liquid handler.
Proceed to run cherry-picking protocol on Opentrons liquid handler.
Once cherry-picking is complete, prepare serial dilutions of source microplates in ½ YTSS at dilution factor necessary to obtain isolated colonies. Spot 10-20 μL of each series onto ½ YTSS kan.
Incubate spot plate (s) at 30° C for approximately 48 hours.
Inoculate isolated colonies into 384-well plates containing 80 μL of ½ YTSS kan. Incubate at 30° C for approximately 48 hours and replicate onto new 384-well plate containing 60 μL of ½ YTSS kan and store at –80° C.
Incubate replicated plate at 30° C for approximately 48 hours and save with 20 μL of 70% glycerol solution at –80° C. Follow previously described library arraying and plate mapping workflow (Sections 3 and 4) to confirm sequencing of rearrayed mutants.
RESULTS
Using the methods described above, a total of four pooled transposon mutant libraries of R. pomeroyi were constructed to ensure a rich diversity of mutants. Random barcode transposon-site sequencing (RB-TnSeq) revealed 88.8% of all R. pomeroyi genes had a transposon insertion, with the transposon inserted in the same orientation as the gene 56.1% of the time. A total of 270,510 20-bp barcodes mapped to a transposon insertion in the genome. The pooled libraries were arrayed into 384-well plates inoculated with isolated colonies by an robotic colony picker, resulting in a total of 125 plates. The plate mapping protocol dubbed PlateSeq described here identified 27,448 of the 48,000 possible mutants in the arrayed plates (Table 1), while the remainder had barcodes that were not present in the TnSeq. This suggests that deeper sequencing in the TnSeq could lead to identification of more arrayed mutants. Regardless, the arrayed mutants represent 73.7% of all R. pomeroyi genes with a median of three mutants per gene.
PlateSeq analysis shows of the total arrayed library transposon mutants 90% map to the chromosome and 9.9% map to the megaplasmid of R. pomeroyi (Figure 2A). The arrayed libraries have a median of 1 barcode and an average of 1.6 barcodes sequenced per well (Table 1). In 66.7% of wells sequenced, only one unique barcode was mapped, indicating a single mutant in those wells (Table 1). In wells with multiple mapped barcodes the most abundant barcode accounts for an average of 91% of total reads for the well (Table 1). While the second most abundant barcode accounts for an average of 20% or less of the barcode reads of their well. Sanger sequencing of select arrayed mutants was used to verify that the barcode identified through NGS was in fact present in the well. Isolated colonies from 206 wells confirmed that 88.4% of wells matched the most abundant barcode from the NGS data (Figure 2B).
Isolates from 8 of the remaining wells matched the second most abundant barcode found by NGS. These results confirm that there was in fact more than mutant present in many wells, though the correct mutant could still be isolated from a majority of wells.
Acknowledgements
This work was supported by the USDA Agricultural Research Services (agreement #6066-21310-005-28-S), Simon’s Foundation, and National Science Foundation Center for Chemical Currencies on a Microbial Planet (OCE 2019589).