TY - JOUR
T1 - Inferring the ancestry of parents and grandparents from genetic data
JF - bioRxiv
DO - 10.1101/308494
SP - 308494
AU - Pei, Jingwen
AU - Nielsen, Rasmus
AU - Wu, Yufeng
Y1 - 2018/01/01
UR - http://biorxiv.org/content/early/2018/04/27/308494.abstract
N2 - Inference of admixture proportions is a classical statistical problem in population genetics. Standard methods implicitly assume that both parents of an individual have the same admixture fraction. However, this is rarely the case in real data. In this paper, we show that the distribution of admixture tract lengths in a genome contains information about the admixture proportions of the ancestors of an individual. We develop a Hidden Markov Model (HMM) framework for estimating the admixture proportions of the immediate ancestors of an individual, i.e., a type of splitting of an individualâ€™s admixture proportions into further subsets of ancestral proportions in the ancestors. Based on a genealogical model for admixture tracts, we develop an efficient algorithm for computing the sampling probability of the genome from a single individual as a function of the admixture proportions of the ancestors of this individual. This allows us to perform probabilistic inference of admixture proportions of ancestors, using only the genome of an extant individual. We perform extensive simulations to quantify the error in the estimation of ancestral admixture proportions under various conditions. As an illustration, we also apply the method on real data from the 1000 Genomes Project.
ER -