PT - JOURNAL ARTICLE AU - Gad Abraham AU - Yixuan Qiu AU - Michael Inouye TI - FlashPCA2: principal component analysis of biobank-scale genotype datasets AID - 10.1101/094714 DP - 2016 Jan 01 TA - bioRxiv PG - 094714 4099 - http://biorxiv.org/content/early/2016/12/17/094714.short 4100 - http://biorxiv.org/content/early/2016/12/17/094714.full AB - Motivation Principal component analysis (PCA) is a crucial step in quality control of genomic data and a common approach for understanding population genetic structure. With the advent of large genotyping studies involving hundreds of thousands of individuals, standard approaches are no longer computationally feasible. We present FlashPCA2, a tool that can perform PCA on 1 million individuals faster than competing approaches, while requiring substantially less memory.Availability https://github.com/gabraham/ashpcaContact gad.abraham{at}unimelb.edu.au