Mapping and Initial Analysis of Human Subtelomeric Sequence Assemblies

  1. Harold Riethman1,4,
  2. Anthony Ambrosini1,
  3. Carlos Castaneda1,2,
  4. Jeffrey Finklestein1,3,
  5. Xue-Lan Hu1,
  6. Uma Mudunuri1,
  7. Sheila Paul1, and
  8. Jun Wei1
  1. 1 The Wistar Institute, Philadelphia, Pennsylvania 19104, USA

Abstract

Physical mapping data were combined with public draft and finished sequences to derive subtelomeric sequence assemblies for each of the 41 genetically distinct human telomere regions. Sequence gaps that remain on the reference telomeres are generally small,well-defined,and for the most part,restricted to regions directly adjacent to the terminal (TTAGGG)n tract. Of the 20.66 Mb of subtelomeric DNA analyzed, 3.01 Mb are subtelomeric repeat sequences (Srpt),and an additional 2.11 Mb are segmental duplications. The subtelomeric sequence assemblies are enriched >25-fold in short,internal (TTAGGG)n-like sequences relative to the rest of the genome; a total of 114 (TTAGGG)n-like islands were found,55 within Srpt regions,35 within one-copy regions,11 at one-copy/Srpt or Srpt/segmental duplication boundaries,and 13 at the telomeric ends of assemblies. Transcripts were annotated in each assembly,noting their mapping coordinates relative to their respective telomere and whether they originate in duplicated DNA or single-copy DNA. A total of 697 transcripts were found in 15.53 Mb of one-copy DNA,76 transcripts in 2.11 Mb of segmentally duplicated DNA,and 168 transcripts in 3.01 Mb of Srpt sequence. This overall transcript density is similar (within ∼10%) to that found genome-wide. Zinc finger-containing genes and olfactory receptor genes are duplicated within and between multiple telomere regions.

Footnotes

  • [Supplemental material is available online at www.genome.org. Detailed maps,subtelomeric assemblies (FASTA format),and transcript annotations are also available at our laboratory Web site (http://www.wistar. upenn.edu/Riethman.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1245004.

  • 4 Corresponding author. E-MAIL Riethman{at}wistar.upenn.edu; FAX (215) 898-3868.

  • 2 Present Address: Program in Molecular Biophysics, Johns Hopkins University, Philadelphia, PA 19104, USA;

  • 3 Present Address: Cell and Molecular Biology Program, University of Pennsylvania, Baltimore, MD 21218, USA.

    • Accepted November 4, 2003.
    • Received February 6, 2003.
| Table of Contents

Preprint Server