Next generation sequencing provides rapid access to the genome of Puccinia striiformis f. sp. tritici, the causal agent of wheat stripe rust

PLoS One. 2011;6(8):e24230. doi: 10.1371/journal.pone.0024230. Epub 2011 Aug 31.

Abstract

Background: The wheat stripe rust fungus (Puccinia striiformis f. sp. tritici, PST) is responsible for significant yield losses in wheat production worldwide. In spite of its economic importance, the PST genomic sequence is not currently available. Fortunately Next Generation Sequencing (NGS) has radically improved sequencing speed and efficiency with a great reduction in costs compared to traditional sequencing technologies. We used Illumina sequencing to rapidly access the genomic sequence of the highly virulent PST race 130 (PST-130).

Methodology/principal findings: We obtained nearly 80 million high quality paired-end reads (>50x coverage) that were assembled into 29,178 contigs (64.8 Mb), which provide an estimated coverage of at least 88% of the PST genes and are available through GenBank. Extensive micro-synteny with the Puccinia graminis f. sp. tritici (PGTG) genome and high sequence similarity with annotated PGTG genes support the quality of the PST-130 contigs. We characterized the transposable elements present in the PST-130 contigs and using an ab initio gene prediction program we identified and tentatively annotated 22,815 putative coding sequences. We provide examples on the use of comparative approaches to improve gene annotation for both PST and PGTG and to identify candidate effectors. Finally, the assembled contigs provided an inventory of PST repetitive elements, which were annotated and deposited in Repbase.

Conclusions/significance: The assembly of the PST-130 genome and the predicted proteins provide useful resources to rapidly identify and clone PST genes and their regulatory regions. Although the automatic gene prediction has limitations, we show that a comparative genomics approach using multiple rust species can greatly improve the quality of gene annotation in these species. The PST-130 sequence will also be useful for comparative studies within PST as more races are sequenced. This study illustrates the power of NGS for rapid and efficient access to genomic sequence in non-model organisms.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Basidiomycota / genetics*
  • Contig Mapping
  • DNA Transposable Elements / genetics
  • DNA, Fungal / genetics
  • Ergosterol / biosynthesis
  • Fungal Proteins / metabolism
  • Genes, Fungal / genetics
  • Genome, Fungal / genetics*
  • Molecular Sequence Data
  • Plant Diseases / microbiology*
  • Repetitive Sequences, Nucleic Acid / genetics
  • Sequence Analysis, DNA / methods*
  • Synteny / genetics
  • Triticum / microbiology*

Substances

  • DNA Transposable Elements
  • DNA, Fungal
  • Fungal Proteins
  • Ergosterol

Associated data

  • GENBANK/HQ698552
  • GENBANK/HQ698553
  • GENBANK/HQ698554
  • GENBANK/HQ698555
  • GENBANK/HQ698556
  • GENBANK/HQ698557
  • GENBANK/HQ698558
  • GENBANK/HQ698559
  • GENBANK/HQ698560
  • GENBANK/HQ698561
  • GENBANK/JN033203
  • GENBANK/JN033204
  • GENBANK/JN033205
  • GENBANK/JN033206
  • GENBANK/JN033207
  • GENBANK/JN033208
  • GENBANK/JN033209
  • GENBANK/JN033210
  • GENBANK/JN033211