Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Chromosome-length genome assembly and structural variations of the primal Basenji dog (Canis lupus familiaris) genome

View ORCID ProfileRichard J. Edwards, View ORCID ProfileMatt A. Field, View ORCID ProfileJames M. Ferguson, View ORCID ProfileOlga Dudchenko, View ORCID ProfileJens Keilwagen, View ORCID ProfileBenjamin D. Rosen, Gary S. Johnson, Edward Rice, LaDeanna Hillier, View ORCID ProfileJillian M. Hammond, Samuel G. Towarnicki, Arina Omer, View ORCID ProfileKsenia Skvortsova, View ORCID ProfileOzren Bogdanovic, View ORCID ProfileRobert A. Zammit, View ORCID ProfileErez Lieberman Aiden, Wesley C. Warren, View ORCID ProfileJ. William O. Ballard
doi: https://doi.org/10.1101/2020.11.11.379073
Richard J. Edwards
1School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney NSW 2052, Australia. , ,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Richard J. Edwards
  • For correspondence: Richard.edwards@unsw.edu.au o.bogdanovic@garvan.org.au s.towanicki@unsw.edu.au
Matt A. Field
2Centre for Tropical Bioinformatics and Molecular Biology, Australian Institute of Tropical Health and Medicine, James Cook University, Cairns, QLD 4878, Australia.
3John Curtin School of Medical Research, Australian National University, Canberra, ACT 2600, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Matt A. Field
  • For correspondence: matt.field@jcu.edu.au matt.field@jcu.edu.au
James M. Ferguson
4Kinghorn Center for Clinical Genomics, Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia. ,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for James M. Ferguson
  • For correspondence: j.ferguson@garvan.org.au j.hammond@garvan.org.au
Olga Dudchenko
5The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA , , ,
6Department of Computer Science, Rice University, Houston, TX, USA. , , ,
7Center for Theoretical and Biological Physics, Rice University, Houston, TX, USA. ,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Olga Dudchenko
  • For correspondence: Olga.Dudchenko@bcm.edu Arina.Omer@bcm.edu Zane.Colaric@bcm.edu erez@erez.com Olga.Dudchenko@bcm.edu Arina.Omer@bcm.edu Zane.Colaric@bcm.edu erez@erez.com Olga.Dudchenko@bcm.edu erez@erez.com
Jens Keilwagen
8Julius Kühn-Institut, Erwin-Baur-Str. 27 06484 Quedlinburg, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jens Keilwagen
  • For correspondence: jens.keilwagen@julius-kuehn.de
Benjamin D. Rosen
9Animal Genomics and Improvement Laboratory, Agricultural Research Service USDA, Beltsville, MD 20705.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Benjamin D. Rosen
  • For correspondence: ben.rosen@usda.gov
Gary S. Johnson
10College of Agriculture, Food & Natural Resources, University of Missouri, MO 65211 USA , ,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: johnsongs@missouri.edu Edward.rice@missouri.edu warrenwc@missouri.edu
Edward Rice
10College of Agriculture, Food & Natural Resources, University of Missouri, MO 65211 USA , ,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: johnsongs@missouri.edu Edward.rice@missouri.edu warrenwc@missouri.edu
LaDeanna Hillier
11Genome Sciences, University of Washington, WA 98195 USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: lhillier@uw.edu
Jillian M. Hammond
4Kinghorn Center for Clinical Genomics, Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia. ,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jillian M. Hammond
  • For correspondence: j.ferguson@garvan.org.au j.hammond@garvan.org.au
Samuel G. Towarnicki
1School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney NSW 2052, Australia. , ,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: Richard.edwards@unsw.edu.au o.bogdanovic@garvan.org.au s.towanicki@unsw.edu.au
Arina Omer
5The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA , , ,
6Department of Computer Science, Rice University, Houston, TX, USA. , , ,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: Olga.Dudchenko@bcm.edu Arina.Omer@bcm.edu Zane.Colaric@bcm.edu erez@erez.com Olga.Dudchenko@bcm.edu Arina.Omer@bcm.edu Zane.Colaric@bcm.edu erez@erez.com
Ksenia Skvortsova
12Genomics and Epigenetics Division, Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia. ,
13St Vincent’s Clinical School, Faculty of Medicine, University of New South Wales, Sydney, NSW, 2010, Australia,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ksenia Skvortsova
  • For correspondence: k.skvortsova@garvan.org.au o.bogdanovic@garvan.org.au k.skvortsova@garvan.org.au
Ozren Bogdanovic
1School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney NSW 2052, Australia. , ,
12Genomics and Epigenetics Division, Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia. ,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ozren Bogdanovic
  • For correspondence: Richard.edwards@unsw.edu.au o.bogdanovic@garvan.org.au s.towanicki@unsw.edu.au k.skvortsova@garvan.org.au o.bogdanovic@garvan.org.au
Robert A. Zammit
14Vineyard Veterinary Hospital, 703 Windsor Rd, Vineyard, NSW, 2765.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Robert A. Zammit
  • For correspondence: Robert@vineyardvet.com.au
Erez Lieberman Aiden
5The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA , , ,
6Department of Computer Science, Rice University, Houston, TX, USA. , , ,
7Center for Theoretical and Biological Physics, Rice University, Houston, TX, USA. ,
15Broad Institute of MIT and Harvard, Cambridge, MA, USA.
16Shanghai Institute for Advanced Immunochemical Studies, ShanghaiTech University, Shanghai, China.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Erez Lieberman Aiden
  • For correspondence: jwoballard@gmail.com warrenwc@missouri.edu erez@erez.com Olga.Dudchenko@bcm.edu Arina.Omer@bcm.edu Zane.Colaric@bcm.edu erez@erez.com Olga.Dudchenko@bcm.edu Arina.Omer@bcm.edu Zane.Colaric@bcm.edu erez@erez.com Olga.Dudchenko@bcm.edu erez@erez.com erez@erez.com erez@erez.com
Wesley C. Warren
10College of Agriculture, Food & Natural Resources, University of Missouri, MO 65211 USA , ,
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jwoballard@gmail.com warrenwc@missouri.edu erez@erez.com johnsongs@missouri.edu Edward.rice@missouri.edu warrenwc@missouri.edu
J. William O. Ballard
14Vineyard Veterinary Hospital, 703 Windsor Rd, Vineyard, NSW, 2765.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for J. William O. Ballard
  • For correspondence: jwoballard@gmail.com warrenwc@missouri.edu erez@erez.com Robert@vineyardvet.com.au
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Basenjis are considered an ancient dog breed of central African origins that still live and hunt with tribesmen in the African Congo. Nicknamed the barkless dog, Basenjis possess unique phylogeny, geographical origins and traits make understanding their genome structure relative to more modern dog breeds of great interest. Here, we report the de novo assemblies of two Basenji: a female, China, and a male, Wags. We conduct pairwise comparisons and report structural variations between assembled genomes of three dog breeds: Basenji (CanFam_Bas), Boxer (CanFam3.1) and German Shepherd Dog (GSD) (CanFam_GSD). We then align representative whole genome sequences from 58 dog breeds and show the importance of genome reference when assessing variation among dog breeds.

Results Here we present two high quality Basenji genome assemblies, CanFam_Bas (China) and Wags. CanFam_Bas is superior to CanFam v3,1 is terms of genome contiguity and comparable overall to the high quality CanFam_GSD assembly. The increasing number of available canid reference genomes allows us to examine the impact the choice of reference genome makes with regard to reference genome quality and breed relatedness. By aligning short read data from 58 representative dog breeds to three reference genomes, we demonstrate how the choice of reference genome significantly impacts both read mapping and variant detection. Further, we generate a conservative list of structural variant calls using a consensus of both Pacific Bioscience and Oxford Nanopore long reads to identify large structural breed differences. Collectively this work highlights the importance the choice of reference genome makes in canid variation studies.

Conclusions The growing number of high-quality canid reference genomes means the choice of reference genome is an increasingly critical decision in subsequent canid variant analyses. The basal position of the Basenji makes it suitable for variant analysis for targeted applications of specific dog breeds. However, as is increasingly being employed in other model organisms, we believe more comprehensive analyses across the entire family of canids is more suited to a pangenome approach.

Competing Interest Statement

JF has received travel and accommodation expenses to speak at Oxford Nanopore Technologies conferences. Otherwise, the authors declare that the submitted work was carried out in the absence of any professional or financial relationships that could potentially be construed as a conflict of interest.

Footnotes

  • http://www.slimsuite.unsw.edu.au/research/basenji/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted November 11, 2020.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Chromosome-length genome assembly and structural variations of the primal Basenji dog (Canis lupus familiaris) genome
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Chromosome-length genome assembly and structural variations of the primal Basenji dog (Canis lupus familiaris) genome
Richard J. Edwards, Matt A. Field, James M. Ferguson, Olga Dudchenko, Jens Keilwagen, Benjamin D. Rosen, Gary S. Johnson, Edward Rice, LaDeanna Hillier, Jillian M. Hammond, Samuel G. Towarnicki, Arina Omer, Ksenia Skvortsova, Ozren Bogdanovic, Robert A. Zammit, Erez Lieberman Aiden, Wesley C. Warren, J. William O. Ballard
bioRxiv 2020.11.11.379073; doi: https://doi.org/10.1101/2020.11.11.379073
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Chromosome-length genome assembly and structural variations of the primal Basenji dog (Canis lupus familiaris) genome
Richard J. Edwards, Matt A. Field, James M. Ferguson, Olga Dudchenko, Jens Keilwagen, Benjamin D. Rosen, Gary S. Johnson, Edward Rice, LaDeanna Hillier, Jillian M. Hammond, Samuel G. Towarnicki, Arina Omer, Ksenia Skvortsova, Ozren Bogdanovic, Robert A. Zammit, Erez Lieberman Aiden, Wesley C. Warren, J. William O. Ballard
bioRxiv 2020.11.11.379073; doi: https://doi.org/10.1101/2020.11.11.379073

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (3688)
  • Biochemistry (7783)
  • Bioengineering (5673)
  • Bioinformatics (21267)
  • Biophysics (10574)
  • Cancer Biology (8170)
  • Cell Biology (11929)
  • Clinical Trials (138)
  • Developmental Biology (6757)
  • Ecology (10394)
  • Epidemiology (2065)
  • Evolutionary Biology (13853)
  • Genetics (9702)
  • Genomics (13063)
  • Immunology (8136)
  • Microbiology (19976)
  • Molecular Biology (7841)
  • Neuroscience (43032)
  • Paleontology (318)
  • Pathology (1278)
  • Pharmacology and Toxicology (2258)
  • Physiology (3350)
  • Plant Biology (7221)
  • Scientific Communication and Education (1311)
  • Synthetic Biology (2000)
  • Systems Biology (5533)
  • Zoology (1127)