ABSTRACT
Plant breeding, like other fields in applied quantitative genetics, has embraced large-scale phenotype data as a way to rapidly and accurately create the next generations of crops. High-throughput phenotyping technologies and large-scale multi-environment breeding trials generate an unprecedented scale of data for breeders to use to make selections and crosses in breeding programs. However, the statistical foundation of multi-trait breeding is based on the multivariate linear mixed effect model, a tool notorious for its fragility when applied to more than a handful of traits. We present MegaLMM, a statistical framework and associated software package that facilitates flexible mixed model analyses on a virtually unlimited number of traits. Using three examples with real plant data, we show that MegaLMM can efficiently model the genetic architecture of thousands of traits at once while significantly improving genetic value prediction accuracy.
Competing Interest Statement
The authors have declared no competing interest.