Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Depth and evenness of sequence coverage are associated with assembly quality, genome structure, and choice of sequencing platform in archived plastid genomes

Nils Jenke, View ORCID ProfileMichael Gruenstaeudl
doi: https://doi.org/10.1101/2022.05.06.490930
Nils Jenke
1Institut für Bioinformatik, Freie Universität Berlin, 14195 Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael Gruenstaeudl
2Institut für Biologie, Freie Universität Berlin, 14195 Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Michael Gruenstaeudl
  • For correspondence: m.gruenstaeudl@fu-berlin.de
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

ABSTRACT

In plastid genomes, the depth and evenness of sequence coverage are considered important indicators for assembly quality. However, the precise manifestations that sequencing depth and evenness can have in the assembly of these genomes, as well as any differences across individual genome sections, have yet to be evaluated. This investigation aims to identify the impact that sequencing depth and evenness can have on the assembly of plastid genomes and how both metrics are related to plastid genome structure. Specifically, we assess if sequencing evenness and reduced sequencing depth have significant correlations with, or significant differences among, individual genome sections, assembly quality metrics, the sequencing platforms employed, and the software tools used for genome assembly. To that end, we retrieve published plastid genomes as well as their sequence reads and genome metadata from public databases, measure sequencing depth and evenness across their sequences, and test several hypotheses on genome assembly and structure through non-parametric statistical tests. The results of our analyses show significant differences in sequencing depth across the four structural partitions as well as between the coding and non-coding sections of the plastid genomes, a significant correlation between sequencing evenness and the number of ambiguous nucleotides per genome, and significant differences in sequencing evenness between various sequencing platforms. Based on these results, we conclude that the observed differences and correlations are not a product of chance alone but possibly genuine manifestations of sequencing depth and evenness during the assembly of these genomes.

Competing Interest Statement

The authors have declared no competing interest.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted May 06, 2022.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Depth and evenness of sequence coverage are associated with assembly quality, genome structure, and choice of sequencing platform in archived plastid genomes
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Depth and evenness of sequence coverage are associated with assembly quality, genome structure, and choice of sequencing platform in archived plastid genomes
Nils Jenke, Michael Gruenstaeudl
bioRxiv 2022.05.06.490930; doi: https://doi.org/10.1101/2022.05.06.490930
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Depth and evenness of sequence coverage are associated with assembly quality, genome structure, and choice of sequencing platform in archived plastid genomes
Nils Jenke, Michael Gruenstaeudl
bioRxiv 2022.05.06.490930; doi: https://doi.org/10.1101/2022.05.06.490930

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (3609)
  • Biochemistry (7590)
  • Bioengineering (5533)
  • Bioinformatics (20833)
  • Biophysics (10347)
  • Cancer Biology (7998)
  • Cell Biology (11663)
  • Clinical Trials (138)
  • Developmental Biology (6619)
  • Ecology (10227)
  • Epidemiology (2065)
  • Evolutionary Biology (13648)
  • Genetics (9557)
  • Genomics (12860)
  • Immunology (7932)
  • Microbiology (19575)
  • Molecular Biology (7678)
  • Neuroscience (42193)
  • Paleontology (309)
  • Pathology (1259)
  • Pharmacology and Toxicology (2208)
  • Physiology (3272)
  • Plant Biology (7064)
  • Scientific Communication and Education (1295)
  • Synthetic Biology (1953)
  • Systems Biology (5435)
  • Zoology (1119)