Islands of retroelements are major components of Drosophila centromeres

PLoS Biol. 2019 May 14;17(5):e3000241. doi: 10.1371/journal.pbio.3000241. eCollection 2019 May.

Abstract

Centromeres are essential chromosomal regions that mediate kinetochore assembly and spindle attachments during cell division. Despite their functional conservation, centromeres are among the most rapidly evolving genomic regions and can shape karyotype evolution and speciation across taxa. Although significant progress has been made in identifying centromere-associated proteins, the highly repetitive centromeres of metazoans have been refractory to DNA sequencing and assembly, leaving large gaps in our understanding of their functional organization and evolution. Here, we identify the sequence composition and organization of the centromeres of Drosophila melanogaster by combining long-read sequencing, chromatin immunoprecipitation for the centromeric histone CENP-A, and high-resolution chromatin fiber imaging. Contrary to previous models that heralded satellite repeats as the major functional components, we demonstrate that functional centromeres form on islands of complex DNA sequences enriched in retroelements that are flanked by large arrays of satellite repeats. Each centromere displays distinct size and arrangement of its DNA elements but is similar in composition overall. We discover that a specific retroelement, G2/Jockey-3, is the most highly enriched sequence in CENP-A chromatin and is the only element shared among all centromeres. G2/Jockey-3 is also associated with CENP-A in the sister species D. simulans, revealing an unexpected conservation despite the reported turnover of centromeric satellite DNA. Our work reveals the DNA sequence identity of the active centromeres of a premier model organism and implicates retroelements as conserved features of centromeric DNA.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Centromere / genetics*
  • Centromere Protein A / genetics
  • Chromatin / metabolism
  • DNA Transposable Elements / genetics
  • DNA, Satellite / genetics
  • Drosophila / embryology
  • Drosophila / genetics*
  • Drosophila Proteins / genetics
  • Embryo, Nonmammalian / metabolism
  • Genome, Insect
  • Retroelements / genetics*
  • Terminal Repeat Sequences / genetics

Substances

  • Centromere Protein A
  • Chromatin
  • Cid protein, Drosophila
  • DNA Transposable Elements
  • DNA, Satellite
  • Drosophila Proteins
  • Retroelements

Associated data

  • Dryad/10.5061/dryad.rb1bt3j