Complete vertebrate mitogenomes reveal widespread gene duplications and repeats
Abstract
Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly. As part of the Vertebrate Genomes Project (VGP) we have developed mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (>10 kbp, PacBio or Nanopore) and short (100-300 bp, Illumina) reads. Our pipeline led to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We have observed that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we have identified errors, missing sequences, and incomplete genes in those references, particularly in repeat regions. Our assemblies have also identified novel gene region duplications, shedding new light on mitochondrial genome evolution and organization.
Competing Interest Statement
V. C., S. M. and D. F. are employees of Oxford Nanopore Technologies Limited. J. K. is Chief Scientific Officer of Pacific Biosciences.
Footnotes
↵+ First author
Author list.
Subject Area
- Biochemistry (11753)
- Bioengineering (8754)
- Bioinformatics (29205)
- Biophysics (14975)
- Cancer Biology (12102)
- Cell Biology (17414)
- Clinical Trials (138)
- Developmental Biology (9423)
- Ecology (14185)
- Epidemiology (2067)
- Evolutionary Biology (18309)
- Genetics (12246)
- Genomics (16805)
- Immunology (11870)
- Microbiology (28098)
- Molecular Biology (11598)
- Neuroscience (60979)
- Paleontology (452)
- Pathology (1871)
- Pharmacology and Toxicology (3238)
- Physiology (4960)
- Plant Biology (10427)
- Synthetic Biology (2886)
- Systems Biology (7341)
- Zoology (1651)