Experimental annotation of the human pathogen Candida albicans coding and noncoding transcribed regions using high-resolution tiling arrays

Genome Biol. 2010;11(7):R71. doi: 10.1186/gb-2010-11-7-r71. Epub 2010 Jul 9.

Abstract

Background: Compared to other model organisms and despite the clinical relevance of the pathogenic yeast Candida albicans, no comprehensive analysis has been done to provide experimental support of its in silico-based genome annotation.

Results: We have undertaken a genome-wide experimental annotation to accurately uncover the transcriptional landscape of the pathogenic yeast C. albicans using strand-specific high-density tiling arrays. RNAs were purified from cells growing under conditions relevant to C. albicans pathogenicity, including biofilm, lab-grown yeast and serum-induced hyphae, as well as cells isolated from the mouse caecum. This work provides a genome-wide experimental validation for a large number of predicted ORFs for which transcription had not been detected by other approaches. Additionally, we identified more than 2,000 novel transcriptional segments, including new ORFs and exons, non-coding RNAs (ncRNAs) as well as convincing cases of antisense gene transcription. We also characterized the 5' and 3' UTRs of expressed ORFs, and established that genes with long 5' UTRs are significantly enriched in regulatory functions controlling filamentous growth. Furthermore, we found that genomic regions adjacent to telomeres harbor a cluster of expressed ncRNAs. To validate and confirm new ncRNA candidates, we adapted an iterative strategy combining both genome-wide occupancy of the different subunits of RNA polymerases I, II and III and expression data. This comprehensive approach allowed the identification of different families of ncRNAs.

Conclusions: In summary, we provide a comprehensive expression atlas that covers relevant C. albicans pathogenic developmental stages in addition to the discovery of new ORF and non-coding genetic elements.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions / genetics
  • 5' Untranslated Regions / genetics
  • Candida albicans / genetics*
  • Candida albicans / growth & development
  • Chromosomes, Fungal / genetics
  • DNA, Fungal / genetics*
  • DNA, Intergenic / genetics*
  • DNA-Directed RNA Polymerases / metabolism
  • Gene Expression Profiling
  • Gene Expression Regulation, Fungal
  • Genome, Fungal / genetics
  • Humans
  • Molecular Sequence Annotation*
  • Oligonucleotide Array Sequence Analysis / methods*
  • Open Reading Frames / genetics*
  • Pseudogenes / genetics
  • RNA, Antisense / genetics
  • RNA, Fungal / genetics
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • RNA, Untranslated / genetics
  • Telomere / metabolism
  • Transcription, Genetic*

Substances

  • 3' Untranslated Regions
  • 5' Untranslated Regions
  • DNA, Fungal
  • DNA, Intergenic
  • RNA, Antisense
  • RNA, Fungal
  • RNA, Messenger
  • RNA, Untranslated
  • DNA-Directed RNA Polymerases