Repeat associated mechanisms of genome evolution and function revealed by the Mus caroli and Mus pahari genomes

  1. Paul Flicek1,8
  1. 1European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom;
  2. 2Earlham Institute, Norwich Research Park, Norwich NR4 7UH, United Kingdom;
  3. 3Yale University Medical School, Computational Biology and Bioinformatics Program, New Haven, Connecticut 06520, USA;
  4. 4Department of Biomolecular Engineering, University of California, Santa Cruz, California 95064, USA;
  5. 5University of Cambridge, Cancer Research UK Cambridge Institute, Robinson Way, Cambridge CB2 0RE, United Kingdom;
  6. 6Department of Computer Science and Engineering, University of California, San Diego, La Jolla, California 92092, USA;
  7. 7Department of Zoology, Faculty of Science, Charles University in Prague, 128 44 Prague, Czech Republic;
  8. 8Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom;
  9. 9Wellcome Trust Centre for Human Genetics, Oxford OX3 7BN, United Kingdom;
  10. 10Department of Medicine, College of Medicine, University of Arizona, Tuscon, Arizona 85724, USA;
  11. 11Institute of Mathematics and Computer Science, University of Greifswald, Greifswald 17487, Germany;
  12. 12School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, United Kingdom;
  13. 13Institut des Sciences de l'Evolution de Montpellier, Université Montpellier/CNRS, 34095 Montpellier, France;
  14. 14Bioturing Inc, San Diego, California 92121, USA
  • Corresponding authors: duncan.odom{at}cruk.cam.ac.uk, flicek{at}ebi.ac.uk
  • Abstract

    Understanding the mechanisms driving lineage-specific evolution in both primates and rodents has been hindered by the lack of sister clades with a similar phylogenetic structure having high-quality genome assemblies. Here, we have created chromosome-level assemblies of the Mus caroli and Mus pahari genomes. Together with the Mus musculus and Rattus norvegicus genomes, this set of rodent genomes is similar in divergence times to the Hominidae (human-chimpanzee-gorilla-orangutan). By comparing the evolutionary dynamics between the Muridae and Hominidae, we identified punctate events of chromosome reshuffling that shaped the ancestral karyotype of Mus musculus and Mus caroli between 3 and 6 million yr ago, but that are absent in the Hominidae. Hominidae show between four- and sevenfold lower rates of nucleotide change and feature turnover in both neutral and functional sequences, suggesting an underlying coherence to the Muridae acceleration. Our system of matched, high-quality genome assemblies revealed how specific classes of repeats can play lineage-specific roles in related species. Recent LINE activity has remodeled protein-coding loci to a greater extent across the Muridae than the Hominidae, with functional consequences at the species level such as reproductive isolation. Furthermore, we charted a Muridae-specific retrotransposon expansion at unprecedented resolution, revealing how a single nucleotide mutation transformed a specific SINE element into an active CTCF binding site carrier specifically in Mus caroli, which resulted in thousands of novel, species-specific CTCF binding sites. Our results show that the comparison of matched phylogenetic sets of genomes will be an increasingly powerful strategy for understanding mammalian biology.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.234096.117.

    • Freely available online through the Genome Research Open Access option.

    • Received December 29, 2017.
    • Accepted March 5, 2018.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server