Whole-genome sequencing of multiple Arabidopsis thaliana populations

Nat Genet. 2011 Aug 28;43(10):956-63. doi: 10.1038/ng.911.

Abstract

The plant Arabidopsis thaliana occurs naturally in many different habitats throughout Eurasia. As a foundation for identifying genetic variation contributing to adaptation to diverse environments, a 1001 Genomes Project to sequence geographically diverse A. thaliana strains has been initiated. Here we present the first phase of this project, based on population-scale sequencing of 80 strains drawn from eight regions throughout the species' native range. We describe the majority of common small-scale polymorphisms as well as many larger insertions and deletions in the A. thaliana pan-genome, their effects on gene function, and the patterns of local and global linkage among these variants. The action of processes other than spontaneous mutation is identified by comparing the spectrum of mutations that have accumulated since A. thaliana diverged from its closest relative 10 million years ago with the spectrum observed in the laboratory. Recent species-wide selective sweeps are rare, and potentially deleterious mutations are more common in marginal populations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles
  • Arabidopsis / genetics*
  • Chromosome Mapping
  • Chromosomes, Plant
  • DNA, Plant / genetics
  • Genetic Loci
  • Genetics, Population*
  • Genome, Plant*
  • Geography
  • Linkage Disequilibrium
  • Mutation
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Selection, Genetic
  • Sequence Analysis, DNA / methods*

Substances

  • DNA, Plant