Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

From telomere to telomere: the transcriptional and epigenetic state of human repeat elements

Savannah J. Hoyt, Jessica M. Storer, Gabrielle A. Hartley, Patrick G. S. Grady, Ariel Gershman, Leonardo G. de Lima, Charles Limouse, Reza Halabian, Luke Wojenski, Matias Rodriguez, View ORCID ProfileNicolas Altemose, View ORCID ProfileLeighton J. Core, Jennifer L. Gerton, View ORCID ProfileWojciech Makalowski, Daniel Olson, Jeb Rosen, Arian F. A. Smit, View ORCID ProfileAaron F. Straight, View ORCID ProfileMitchell R. Vollger, View ORCID ProfileTravis J. Wheeler, Michael C. Schatz, Evan E. Eichler, Adam M. Phillippy, View ORCID ProfileWinston Timp, Karen H. Miga, View ORCID ProfileRachel J. O’Neill
doi: https://doi.org/10.1101/2021.07.12.451456
Savannah J. Hoyt
1Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jessica M. Storer
2Institute for Systems Biology, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gabrielle A. Hartley
1Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Patrick G. S. Grady
1Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ariel Gershman
3Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Leonardo G. de Lima
4Stowers Institute for Medical Research, Kansas City, MO, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Charles Limouse
5Department of Biochemistry, Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Reza Halabian
6Institute of Bioinformatics, Faculty of Medicine, University of Münster, Münster, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Luke Wojenski
1Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matias Rodriguez
6Institute of Bioinformatics, Faculty of Medicine, University of Münster, Münster, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nicolas Altemose
7Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nicolas Altemose
Leighton J. Core
1Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
8Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Leighton J. Core
Jennifer L. Gerton
4Stowers Institute for Medical Research, Kansas City, MO, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wojciech Makalowski
6Institute of Bioinformatics, Faculty of Medicine, University of Münster, Münster, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Wojciech Makalowski
Daniel Olson
9Department of Computer Science, University of Montana, Missoula, MT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeb Rosen
2Institute for Systems Biology, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Arian F. A. Smit
2Institute for Systems Biology, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Aaron F. Straight
5Department of Biochemistry, Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Aaron F. Straight
Mitchell R. Vollger
11Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mitchell R. Vollger
Travis J. Wheeler
9Department of Computer Science, University of Montana, Missoula, MT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Travis J. Wheeler
Michael C. Schatz
10Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Evan E. Eichler
11Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
12Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Adam M. Phillippy
13Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Winston Timp
3Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
14Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Winston Timp
Karen H. Miga
15UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rachel J. O’Neill
1Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
8Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
16Department of Genetics and Genome Sciences, UConn Health, Farmington, C, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Rachel J. O’Neill
  • For correspondence: rachel.oneill@uconn.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Mobile elements and highly repetitive genomic regions are potent sources of lineage-specific genomic innovation and fingerprint individual genomes. Comprehensive analyses of large, composite or arrayed repeat elements and those found in more complex regions of the genome require a complete, linear genome assembly. Here we present the first de novo repeat discovery and annotation of a complete human reference genome, T2T-CHM13v1.0. We identified novel satellite arrays, expanded the catalog of variants and families for known repeats and mobile elements, characterized new classes of complex, composite repeats, and provided comprehensive annotations of retroelement transduction events. Utilizing PRO-seq to detect nascent transcription and nanopore sequencing to delineate CpG methylation profiles, we defined the structure of transcriptionally active retroelements in humans, including for the first time those found in centromeres. Together, these data provide expanded insight into the diversity, distribution and evolution of repetitive regions that have shaped the human genome.

Competing Interest Statement

KHM has received travel funds to speak at symposia organized by Oxford Nanopore. WT has two patents (8,748,091 and 8,394,584) licensed to Oxford Nanopore Technologies. All other authors declare that they have no competing interests.

Footnotes

  • https://github.com/marbl/CHM13

  • https://www.ncbi.nlm.nih.gov/bioproject/559484

  • https://github.com/marbl/CHM13-issues

  • genome.ucsc.edu/cgi-bin/hgTracks?genome=t2t-chm13-v1.0&hubUrl= http://t2t.gi.ucsc.edu/chm13/hub/hub.txt

  • https://resgen.io/paper-data/T2T-Nurk-et-al-2021/views/t2t-identity

  • https://gitlab.com/SJHoyt/t2t_transposable-elements/Repeat_annotations/Repeatmasker_and_polishing/RepeatLibrary_NewRepeatEntries.embl

  • https://gitlab.com/SJHoyt/t2t_transposable-elements

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted July 12, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
From telomere to telomere: the transcriptional and epigenetic state of human repeat elements
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
From telomere to telomere: the transcriptional and epigenetic state of human repeat elements
Savannah J. Hoyt, Jessica M. Storer, Gabrielle A. Hartley, Patrick G. S. Grady, Ariel Gershman, Leonardo G. de Lima, Charles Limouse, Reza Halabian, Luke Wojenski, Matias Rodriguez, Nicolas Altemose, Leighton J. Core, Jennifer L. Gerton, Wojciech Makalowski, Daniel Olson, Jeb Rosen, Arian F. A. Smit, Aaron F. Straight, Mitchell R. Vollger, Travis J. Wheeler, Michael C. Schatz, Evan E. Eichler, Adam M. Phillippy, Winston Timp, Karen H. Miga, Rachel J. O’Neill
bioRxiv 2021.07.12.451456; doi: https://doi.org/10.1101/2021.07.12.451456
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
From telomere to telomere: the transcriptional and epigenetic state of human repeat elements
Savannah J. Hoyt, Jessica M. Storer, Gabrielle A. Hartley, Patrick G. S. Grady, Ariel Gershman, Leonardo G. de Lima, Charles Limouse, Reza Halabian, Luke Wojenski, Matias Rodriguez, Nicolas Altemose, Leighton J. Core, Jennifer L. Gerton, Wojciech Makalowski, Daniel Olson, Jeb Rosen, Arian F. A. Smit, Aaron F. Straight, Mitchell R. Vollger, Travis J. Wheeler, Michael C. Schatz, Evan E. Eichler, Adam M. Phillippy, Winston Timp, Karen H. Miga, Rachel J. O’Neill
bioRxiv 2021.07.12.451456; doi: https://doi.org/10.1101/2021.07.12.451456

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4229)
  • Biochemistry (9109)
  • Bioengineering (6753)
  • Bioinformatics (23944)
  • Biophysics (12103)
  • Cancer Biology (9498)
  • Cell Biology (13745)
  • Clinical Trials (138)
  • Developmental Biology (7617)
  • Ecology (11664)
  • Epidemiology (2066)
  • Evolutionary Biology (15479)
  • Genetics (10620)
  • Genomics (14297)
  • Immunology (9467)
  • Microbiology (22796)
  • Molecular Biology (9078)
  • Neuroscience (48894)
  • Paleontology (355)
  • Pathology (1479)
  • Pharmacology and Toxicology (2566)
  • Physiology (3824)
  • Plant Biology (8309)
  • Scientific Communication and Education (1467)
  • Synthetic Biology (2290)
  • Systems Biology (6172)
  • Zoology (1297)