Multiple multilocus DNA barcodes from the plastid genome discriminate plant species equally well

PLoS One. 2008 Jul 30;3(7):e2802. doi: 10.1371/journal.pone.0002802.

Abstract

A universal barcode system for land plants would be a valuable resource, with potential utility in fields as diverse as ecology, floristics, law enforcement and industry. However, the application of plant barcoding has been constrained by a lack of consensus regarding the most variable and technically practical DNA region(s). We compared eight candidate plant barcoding regions from the plastome and one from the mitochondrial genome for how well they discriminated the monophyly of 92 species in 32 diverse genera of land plants (N = 251 samples). The plastid markers comprise portions of five coding (rpoB, rpoC1, rbcL, matK and 23S rDNA) and three non-coding (trnH-psbA, atpF-atpH, and psbK-psbI) loci. Our survey included several taxonomically complex groups, and in all cases we examined multiple populations and species. The regions differed in their ability to discriminate species, and in ease of retrieval, in terms of amplification and sequencing success. Single locus resolution ranged from 7% (23S rDNA) to 59% (trnH-psbA) of species with well-supported monophyly. Sequence recovery rates were related primarily to amplification success (85-100% for plastid loci), with matK requiring the greatest effort to achieve reasonable recovery (88% using 10 primer pairs). Several loci (matK, psbK-psbI, trnH-psbA) were problematic for generating fully bidirectional sequences. Setting aside technical issues related to amplification and sequencing, combining the more variable plastid markers provided clear benefits for resolving species, although with diminishing returns, as all combinations assessed using four to seven regions had only marginally different success rates (69-71%; values that were approached by several two- and three-region combinations). This performance plateau may indicate fundamental upper limits on the precision of species discrimination that is possible with DNA barcoding systems that include moderate numbers of plastid markers. Resolution to the contentious debate on plant barcoding should therefore involve increased attention to practical issues related to the ease of sequence recovery, global alignability, and marker redundancy in multilocus plant DNA barcoding systems.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods
  • DNA, Chloroplast / metabolism
  • DNA, Mitochondrial / genetics
  • DNA, Plant / chemistry
  • DNA, Ribosomal / chemistry
  • Genes, Plant
  • Genetic Markers
  • Genetic Variation
  • Genome, Plant*
  • Models, Genetic*
  • Plants / metabolism
  • Plastids / chemistry*
  • Plastids / metabolism
  • Reproducibility of Results
  • Sequence Analysis, DNA
  • Species Specificity

Substances

  • DNA, Chloroplast
  • DNA, Mitochondrial
  • DNA, Plant
  • DNA, Ribosomal
  • Genetic Markers