TY - JOUR
T1 - Bayesian estimation of population size and overlap from random subsamples
JF - bioRxiv
DO - 10.1101/2021.07.06.451319
SP - 2021.07.06.451319
AU - Johnson, Erik K.
AU - Larremore, Daniel B.
Y1 - 2021/01/01
UR - http://biorxiv.org/content/early/2021/07/07/2021.07.06.451319.abstract
N2 - Counting the number of species, items, or genes that are shared between two sets is a simple calculation when sampling is complete. However, when only partial samples are available, quantifying the overlap between two sets becomes an estimation problem. Furthermore, to calculate normalized measures of Î˛-diversity, such as the Jaccard and Sorenson-Dice indices, one must also estimate the total sizes of the sets being compared. Previous efforts to address these problems have assumed knowledge of total population sizes and then used Bayesian methods to produce unbiased estimates with quantified uncertainty. Here, we address populations of unknown size and show that this produces systematically better estimatesâ€”both in terms of central estimates and quantification of uncertainty in those estimates. We further show how to use species count data to refine estimates of population size in a Bayesian joint model of populations and overlap.Competing Interest StatementThe authors have declared no competing interest.
ER -