Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

An improved genome assembly uncovers prolific tandem repeats in Atlantic cod

Ole K. Tørresen, Bastiaan Star, Sissel Jentoft, William B. Reinar, Harald Grove, Jason R. Miller, View ORCID ProfileBrian P. Walenz, James Knight, Jenny M. Ekholm, Paul Peluso, Rolf B. Edvardsen, Ave Tooming-Klunderud, Morten Skage, Sigbjørn Lien, Kjetill S. Jakobsen, View ORCID ProfileAlexander J. Nederbragt
doi: https://doi.org/10.1101/060921
Ole K. Tørresen
1Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo., Oslo, Norway.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bastiaan Star
1Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo., Oslo, Norway.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sissel Jentoft
1Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo., Oslo, Norway.
2Department of Natural Sciences, University of Agder, Kristiansand, Norway.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
William B. Reinar
1Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo., Oslo, Norway.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Harald Grove
3Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, NO-1432, Ås, Norway.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jason R. Miller
4J. Craig Venter Institute, 9704 Medical Center Drive, 20850, Rockville, MD, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Brian P. Walenz
5Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Brian P. Walenz
James Knight
6Yale School of Medicine, Yale University, 06520, New Haven, CT, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jenny M. Ekholm
7Pacific Biosciences, Menlo Park, CA, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Paul Peluso
7Pacific Biosciences, Menlo Park, CA, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rolf B. Edvardsen
8Institute of Marine Research, P.O. Box 1870, Nordnes, NO-5817, Bergen, Norway.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ave Tooming-Klunderud
1Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo., Oslo, Norway.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Morten Skage
1Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo., Oslo, Norway.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sigbjørn Lien
3Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, NO-1432, Ås, Norway.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kjetill S. Jakobsen
1Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo., Oslo, Norway.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexander J. Nederbragt
1Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo., Oslo, Norway.
9Biomedical Informatics Research Group, Department of informatics, University of Oslo., Oslo, Norway.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alexander J. Nederbragt
  • For correspondence: lex.nederbragt@ibv.uio.no
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Background: The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies.

Results: By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21 % of the TRs across the assembly, 19 % in the promoter regions and 12 % in the coding sequences are heterozygous in the sequenced individual.

Conclusions: The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted December 19, 2016.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
Ole K. Tørresen, Bastiaan Star, Sissel Jentoft, William B. Reinar, Harald Grove, Jason R. Miller, Brian P. Walenz, James Knight, Jenny M. Ekholm, Paul Peluso, Rolf B. Edvardsen, Ave Tooming-Klunderud, Morten Skage, Sigbjørn Lien, Kjetill S. Jakobsen, Alexander J. Nederbragt
bioRxiv 060921; doi: https://doi.org/10.1101/060921
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
Ole K. Tørresen, Bastiaan Star, Sissel Jentoft, William B. Reinar, Harald Grove, Jason R. Miller, Brian P. Walenz, James Knight, Jenny M. Ekholm, Paul Peluso, Rolf B. Edvardsen, Ave Tooming-Klunderud, Morten Skage, Sigbjørn Lien, Kjetill S. Jakobsen, Alexander J. Nederbragt
bioRxiv 060921; doi: https://doi.org/10.1101/060921

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4235)
  • Biochemistry (9136)
  • Bioengineering (6784)
  • Bioinformatics (24000)
  • Biophysics (12129)
  • Cancer Biology (9534)
  • Cell Biology (13778)
  • Clinical Trials (138)
  • Developmental Biology (7635)
  • Ecology (11702)
  • Epidemiology (2066)
  • Evolutionary Biology (15513)
  • Genetics (10644)
  • Genomics (14326)
  • Immunology (9482)
  • Microbiology (22839)
  • Molecular Biology (9090)
  • Neuroscience (48993)
  • Paleontology (355)
  • Pathology (1482)
  • Pharmacology and Toxicology (2570)
  • Physiology (3846)
  • Plant Biology (8331)
  • Scientific Communication and Education (1471)
  • Synthetic Biology (2296)
  • Systems Biology (6192)
  • Zoology (1301)