Abstract
Although Siberia was inhabited by modern humans at an early stage, there is still debate over whether this area remained habitable during the extremely cold period of the Last Glacial Maximum or whether it was subsequently repopulated by peoples with a recent shared ancestry. Previous studies of the genetic history of Siberian populations were hampered by the extensive admixture that appears to have taken place among these populations, since commonly used methods assume a tree-like population history and at most single admixture events. We therefore developed a new method based on the covariance of ancestry components, which we validated with simulated data, in order to investigate this potentially complex admixture history and to distinguish the effects of shared ancestry from prehistoric migrations and contact. We furthermore adapted a previously devised method of admixture dating for use with multiple events of gene flow, and applied these methods to whole-genome genotype data from over 500 individuals belonging to 20 different Siberian ethnolinguistic groups. The results of these analyses indicate that there have indeed been multiple layers of admixture detectable in most of the Siberian populations, with considerable differences in the admixture histories of individual populations, and with the earliest events dated to not more than 4500 years ago. Furthermore, most of the populations of Siberia included here, even those settled far to the north, can be shown to have a southern origin. These results provide support for a recent population replacement in this region, with the northward expansions of different populations possibly being driven partly by the advent of pastoralism, especially reindeer domestication. These newly developed methods to analyse multiple admixture events should aid in the investigation of similarly complex population histories elsewhere.
Author Summary We developed a new method that lets us disentangle complex admixture events and date the individual episodes of gene flow. We applied this method to a large dataset of genome-wide SNP genotypes from a linguistically diverse sample of populations covering the extent of Siberia. Our results demonstrate that these populations have been in intense contact with each other, with most of them carrying more than two, and up to six different ancestries. Nevertheless, as shown by the dates we estimate, these various admixture events have taken place over a relatively short time, with the oldest dating to not more than 4500 years ago. Our results emphasize the enormous impact that contact has had on the genetic history of these Siberian populations, and the methods developed here will be of use in the investigation of admixture events in populations around the world.