TY - JOUR T1 - Genome-wide association study in two cohorts from a multi-generational mouse advanced intercross line highlights the difficulty of replication JF - bioRxiv DO - 10.1101/387613 SP - 387613 AU - Xinzhu Zhou AU - Celine L. St. Pierre AU - Natalia M. Gonzales AU - Riyan Cheng AU - Apurva Chitre AU - Greta Sokoloff AU - Abraham A. Palmer Y1 - 2019/01/01 UR - http://biorxiv.org/content/early/2019/05/21/387613.abstract N2 - Replication is considered to be critical for genome-wide association studies (GWAS) in humans, but is not routinely performed in model organisms. We explored replication using an advanced intercross line (AIL) which is the simplest possible multigenerational intercross. We re-genotyped a previously published cohort of LG/J x SM/J AIL mice (F34; n=428) using a denser marker set and also genotyped a novel cohort of AIL mice (F39-43; n=600) for the first time. We identified 110 significant loci in the F34 cohort, 36 of which were new discoveries attributable to the denser marker set; we also identified 27 novel significant loci in the F39-43 cohort. For traits measured in both cohorts (locomotor activity, body weight, and coat color), the genetic correlations were high, although, the F39-43 cohort showed systematically lower SNP-heritability estimates. We then attempted to replicate loci identified in either F34 or F39-43 in the other cohort. Albino coat color was robustly replicated; we observed only partial replication of associations for locomotor activity and body weight. Finally, we performed a mega-analysis of locomotor activity and body weight by combining F34 and F39-43 cohorts (n=1,028), which identified four novel loci. The incomplete replication was inconsistent with simulations we performed to estimate our power to replicate. This may reflect: 1) false positives errors in the discovery cohort, 2) environmental or genetic heterogeneity between the two samples, or 3) the systematic over estimation of the effect sizes at significant loci (“Winner’s Curse”). Our results demonstrate that it is difficult to replicate GWAS results even when using similarly sized discovery and replication cohorts drawn from the same population. ER -