Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Strong spurious transcription likely a cause of DNA insert bias in typical metagenomic clone libraries

Kathy N. Lam, Trevor C. Charles
doi: https://doi.org/10.1101/013763
Kathy N. Lam
1Department of Biology, University of Waterloo, Waterloo, Ontario, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Trevor C. Charles
1Department of Biology, University of Waterloo, Waterloo, Ontario, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

ABSTRACT

Background Clone libraries provide researchers with a powerful resource with which to study nucleic acid from diverse sources. Metagenomic clone libraries in particular have aided in studies of microbial biodiversity and function, as well as allowed the mining of novel enzymes for specific functions of interest. These libraries are often constructed by cloning large-inserts (∼30 kb) into a cosmid or fosmid vector. Recently, there have been reports of GC bias in fosmid metagenomic clone libraries, and it was speculated that the bias may be a result of fragmentation and loss of AT-rich sequences during the cloning process. However, evidence in the literature suggests that transcriptional activity or gene product toxicity may play a role in library bias.

Results To explore the possible mechanisms responsible for sequence bias in clone libraries, and in particular whether fragmentation is involved, we constructed a cosmid clone library from a human microbiome sample, and sequenced DNA from three different steps of the library construction process: crude extract DNA, size-selected DNA, and cosmid library DNA. We confirmed a GC bias in the final constructed cosmid library, and we provide strong evidence that the sequence bias is not due to fragmentation and loss of AT-rich sequences but is likely occurring after the DNA is introduced into E. coli. To investigate the influence of strong constitutive transcription, we searched the sequence data for consensus promoters and found that rpoD/σ70 promoter sequences were underrepresented in the cosmid library. Furthermore, when we examined the reference genomes of taxa that were differentially abundant in the cosmid library relative to the original sample, we found that the bias appears to be more closely correlated with the number of rpoD/σ70 consensus sequences in the genome than with simple GC content.

Conclusions The GC bias of metagenomic clone libraries does not appear to be due to DNA fragmentation. Rather, analysis of promoter consensus sequences provides support for the hypothesis that strong constitutive transcription from sequences recognized as rpoD/σ70 consensus-like in E. coli may lead to plasmid instability or loss of insert DNA. Our results suggest that despite widespread use of E. coli to propagate foreign DNA, the effects of in vivo transcriptional activity may be under-appreciated. Further work is required to tease apart the effects of transcription from those of gene product toxicity.

  • LIST OF ABBREVIATIONS

    CE,
    crude extract;
    SS,
    size-selected;
    CL,
    cosmid library;
    F,
    forward reads;
    R,
    reverse reads.
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
    Back to top
    PreviousNext
    Posted January 15, 2015.
    Download PDF

    Supplementary Material

    Email

    Thank you for your interest in spreading the word about bioRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Strong spurious transcription likely a cause of DNA insert bias in typical metagenomic clone libraries
    (Your Name) has forwarded a page to you from bioRxiv
    (Your Name) thought you would like to see this page from the bioRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Strong spurious transcription likely a cause of DNA insert bias in typical metagenomic clone libraries
    Kathy N. Lam, Trevor C. Charles
    bioRxiv 013763; doi: https://doi.org/10.1101/013763
    Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
    Citation Tools
    Strong spurious transcription likely a cause of DNA insert bias in typical metagenomic clone libraries
    Kathy N. Lam, Trevor C. Charles
    bioRxiv 013763; doi: https://doi.org/10.1101/013763

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Molecular Biology
    Subject Areas
    All Articles
    • Animal Behavior and Cognition (4095)
    • Biochemistry (8786)
    • Bioengineering (6493)
    • Bioinformatics (23386)
    • Biophysics (11766)
    • Cancer Biology (9167)
    • Cell Biology (13290)
    • Clinical Trials (138)
    • Developmental Biology (7422)
    • Ecology (11386)
    • Epidemiology (2066)
    • Evolutionary Biology (15119)
    • Genetics (10413)
    • Genomics (14024)
    • Immunology (9145)
    • Microbiology (22108)
    • Molecular Biology (8793)
    • Neuroscience (47445)
    • Paleontology (350)
    • Pathology (1423)
    • Pharmacology and Toxicology (2483)
    • Physiology (3711)
    • Plant Biology (8063)
    • Scientific Communication and Education (1433)
    • Synthetic Biology (2215)
    • Systems Biology (6021)
    • Zoology (1251)