TY - JOUR T1 - Multivariate association between single-nucleotide polymorphisms in Alzgene linkage regions and structural changes in the brain: discovery, refinement and validation JF - bioRxiv DO - 10.1101/088310 SP - 088310 AU - Elena Szefer AU - Donghuan Lu AU - Farouk Nathoo AU - Mirza Faisal Beg AU - Jinko Graham AU - for the Alzheimers Disease Neuroimaging Initiative † Y1 - 2017/01/01 UR - http://biorxiv.org/content/early/2017/08/02/088310.abstract N2 - Both genetic variants and brain region abnormalities are recognized to play a role in cognitive decline. We explore the association between single-nucleotide polymorphisms (SNPs) in linkage regions for Alzheimer’s disease and rates of decline in brain structure using data from the Alzheimers Disease Neuroimaging Initiative (ADNI).In an initial discovery stage, we assessed the presence of linear association between the minor allele counts of 75,845 SNPs in the Alzgene linkage regions and predicted rates of change in structural MRI measurements for 56 brain regions using an RV test. In a second, refinement stage, we reduced the number of SNPs using a bootstrap-enhanced sparse canonical correlation analysis (SCCA) with a fixed tuning parameter. Each SNP was assigned an importance measure proportional to the number of times it was estimated to have a nonzero coefficient in repeated re-sampling from the ADNI-1 sample. We created refined lists of SNPs based on importance probabilities greater than 50% and 90%, respectively. In a third, validation stage, we assessed the multivariate association between these refined lists of SNPs and the rates of structural change in the independent ADNI-2 study dataset.There was strong statistical evidence for linear association between the SNPs in the Alzgene linkage regions and the 56 imaging phenotypes in both the ADNI-1 and ADNI-2 samples (p < 0.0001). The bootstrap-enhanced SCCA identified 1,694 priority SNPs with importance probabilities > 50% and 22 SNPs with importance probabilities > 90%. The 1,694 prioritized SNPs in the ADNI-1 data were associated with imaging phenotypes in the ADNI-2 data (p = 0.0021).This manuscript presents an analysis that addresses challenges in current imaging genetics studies such as biased sampling designs and high-dimensional data with low-signal. Genes corresponding to priority SNPs having the highest contribution in the validation data have previously been implicated or hypothesized to be implicated in AD, including GCLC, IDE, and STAMBP1andFAS. We hypothesize that the effect sizes of the 1,694 SNPs in the priority set are likely small, but further investigation within this set may advance understanding of the missing heritability in late-onset Alzheimers disease. Multivariate analysis; Linkage regions; Imaging genetics; Endophenotypes; Inverse probability weighting; Variable importance probabilities ER -