ABSTRACT
Characterization of genetic variations in maize has been challenging, mainly due to deterioration of collinearity between individual genomes in the species and the fact the B73 genome used as the reference only represents a fraction of all haplotypes. An international consortium of maize research groups combined resources to develop the maize haplotype version 3 (HapMap3), built from whole genome sequencing data from 916 maize lines, covering pre-domestication and domesticated Zea mays varieties across the world. A new computational pipeline was set up to process over 7 trillion bp of sequencing data, and a set of population genetics filters were applied to identify over 60 million variant sites in regions where collinearity is largely preserved in the maize species.