Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Fine-scale Inference of Ancestry Segments without Prior Knowledge of Admixing Groups

View ORCID ProfileMichael Salter-Townshend, View ORCID ProfileSimon Myers
doi: https://doi.org/10.1101/376137
Michael Salter-Townshend
University College Dublin;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Michael Salter-Townshend
  • For correspondence: michael.salter-townshend@ucd.ie
Simon Myers
Oxford University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Simon Myers
  • Abstract
  • Info/History
  • Metrics
  • Data Supplements
  • Preview PDF
Loading

Abstract

We present an algorithm for inferring ancestry segments and characterizing admixture events, which involve an arbitrary number of genetically differentiated groups coming together. This allows inference of the demographic history of the species, properties of admixing groups, identification of signatures of natural selection, and may aid disease gene mapping. The algorithm employs nested hidden Markov models to obtain local ancestry estimation along the genome for each admixed individual. In a range of simulations, the accuracy of these estimates equals or exceeds leading existing methods that return local ancestry. Moreover, and unlike these approaches, we do not require any prior knowledge of the relationship between sub-groups of donor reference haplotypes and the unseen mixing ancestral populations. Instead, our approach infers these in terms of conditional "copying probabilities". In application to the Human Genome Diversity Panel we corroborate many previously inferred admixture events (e.g. an ancient admixture event in the Kalash). We further identify novel events such as complex 4-way admixture in San-Khomani individuals, and show that Eastern European populations possess 1-5% ancestry from a group resembling modern-day central Asians. We also identify evidence of recent natural selection favouring sub-Saharan ancestry at the HLA region, across North African individuals. We make available an R and C ++ software library, which we term MOSAIC (which stands for MOSAIC Organises Segments of Ancestry In Chromosomes).

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
  • Posted July 25, 2018.

Download PDF

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Fine-scale Inference of Ancestry Segments without Prior Knowledge of Admixing Groups
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
Share
Fine-scale Inference of Ancestry Segments without Prior Knowledge of Admixing Groups
Michael Salter-Townshend, Simon Myers
bioRxiv 376137; doi: https://doi.org/10.1101/376137
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
Fine-scale Inference of Ancestry Segments without Prior Knowledge of Admixing Groups
Michael Salter-Townshend, Simon Myers
bioRxiv 376137; doi: https://doi.org/10.1101/376137

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (814)
  • Biochemistry (1124)
  • Bioengineering (716)
  • Bioinformatics (5722)
  • Biophysics (1943)
  • Cancer Biology (1381)
  • Cell Biology (1957)
  • Clinical Trials (71)
  • Developmental Biology (1337)
  • Ecology (2048)
  • Epidemiology (1096)
  • Evolutionary Biology (4331)
  • Genetics (3042)
  • Genomics (3923)
  • Immunology (836)
  • Microbiology (3289)
  • Molecular Biology (1220)
  • Neuroscience (8382)
  • Paleontology (62)
  • Pathology (169)
  • Pharmacology and Toxicology (304)
  • Physiology (401)
  • Plant Biology (1138)
  • Scientific Communication and Education (318)
  • Synthetic Biology (469)
  • Systems Biology (1596)
  • Zoology (210)