TY - JOUR T1 - Mega-scale linear mixed models for genomic predictions with thousands of traits JF - bioRxiv DO - 10.1101/2020.05.26.116814 SP - 2020.05.26.116814 AU - Daniel Runcie AU - Hao Cheng AU - Lorin Crawford Y1 - 2020/01/01 UR - http://biorxiv.org/content/early/2020/05/29/2020.05.26.116814.abstract N2 - Plant breeding, like other fields in applied quantitative genetics, has embraced large-scale phenotype data as a way to rapidly and accurately create the next generations of crops. High-throughput phenotyping technologies and large-scale multi-environment breeding trials generate an unprecedented scale of data for breeders to use to make selections and crosses in breeding programs. However, the statistical foundation of multi-trait breeding is based on the multivariate linear mixed effect model, a tool notorious for its fragility when applied to more than a handful of traits. We present MegaLMM, a statistical framework and associated software package that facilitates flexible mixed model analyses on a virtually unlimited number of traits. Using three examples with real plant data, we show that MegaLMM can efficiently model the genetic architecture of thousands of traits at once while significantly improving genetic value prediction accuracy.Competing Interest StatementThe authors have declared no competing interest. ER -