RT Journal Article SR Electronic T1 On the Number of Siblings and p-th Cousins in a Large Population Sample JF bioRxiv FD Cold Spring Harbor Laboratory SP 145599 DO 10.1101/145599 A1 Vladimir Shchur A1 Rasmus Nielsen YR 2017 UL http://biorxiv.org/content/early/2017/06/02/145599.abstract AB The number of individuals in a random sample with close relatives in the sample is a quantity of interest when designing Genome Wide Association Studies (GWAS) and other cohort based genetic, and non-genetic, studies. In this paper, we develop expressions for the distribution and expectation of the number of p-th cousins in sample from a population of size N under two dioecious generalizations of the Wright-Fisher model. We also develop simple asymptotic expressions for large values of N. For example, the expected proportion of individuals with at least one p-th cousin in a sample of K indivi-dals, for a non-monogamous generalization of the Wright-Fisher model, is approximately eāˆ’(22pāˆ’1)N/K. Our results show that a substantial fraction of individuals in the sample will have at least a second cousin if the sampling fraction (K/N) is on the order of 10āˆ’2. This confirms that, for large cohort samples, relatedness among individuals cannot easily be ignored.