Abstract
Multiple methods have been introduced over the past 30 years to identify the genomic insertion sites of transposable elements and other DNA elements that integrate into genomes. However, each of these methods suffer from limitations that can frustrate attempts to map multiple insertions in a single genome and to map insertions in genomes of high complexity that contain extensive repetitive DNA. I introduce a new method for transposon mapping that is simple to perform, can accurately map multiple insertions per genome, and generates long sequence reads that facilitate mapping to complex genomes. The method, called TagMap, for Tagmentation-based Mapping, relies on a modified Tn5 tagmentation protocol with a single tagmentation adaptor followed by PCR using primers specific to the tranposable element and the adaptor sequence. Several minor modifications to normal tagmentation reagents and protocols allow easy and rapid preparation of TagMap libraries. Short read sequencing starting from the adaptor sequence generates oriented reads that flank and are oriented toward the transposable element insertion site. The convergent orientation of adjacent reads at the insertion site allows straightforward prediction of the precise insertion site(s). A Linux shell script is provided to identify insertion sites from fastq files.