Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Assessment of the performance of different hidden Markov models for imputation in animal breeding

Andrew Whalen, Gregor Gorjanc, View ORCID ProfileRoger Ros-Freixedes, John M Hickey
doi: https://doi.org/10.1101/227157
Andrew Whalen
The Roslin Institute, University of Edinburgh
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: awhalen@gmail.com
Gregor Gorjanc
The Roslin Institute, University of Edinburgh
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Roger Ros-Freixedes
The Roslin Institute, University of Edinburgh
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Roger Ros-Freixedes
John M Hickey
The Roslin Institute, University of Edinburgh
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

In this paper we review the performance of various hidden Markov model-based imputation methods in animal breeding populations. Traditionally, heuristic-based imputation methods have been used for imputation in large animal populations due to their computational efficiency, scalability, and accuracy. However, recent advances in the area of human genetics have increased the ability of probabilistic hidden Markov model methods to perform accurate phasing and imputation in large populations. These advances may enable these methods to be useful for routine use in large animal populations. To test this, we evaluate here the accuracy and computational cost of several methods in a series of simulated populations and a real animal population. We first tested single-step (diploid) imputation, which performs both phasing and imputation. Then we tested pre-phasing followed by haploid imputation. We tested four diploid imputation methods (fastPHASE, Beagle v4.0, IMPUTE2, and MaCH), three phasing methods, (SHAPEIT2, HAPI-UR, and Eagle2), and three haploid imputation methods (IMPUTE2, Beagle v4.1, and minimac3). We found that performing pre-phasing and haploid imputation was faster and more accurate than diploid imputation. In particular, we found that pre-phasing with Eagle2 or HAPI-UR and imputing with minimac3 or IMPUTE2 gave the highest accuracies in both simulated and real data.

Copyright 
The copyright holder for this preprint is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
  • Posted November 30, 2017.

Download PDF

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Assessment of the performance of different hidden Markov models for imputation in animal breeding
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
Share
Assessment of the performance of different hidden Markov models for imputation in animal breeding
Andrew Whalen, Gregor Gorjanc, Roger Ros-Freixedes, John M Hickey
bioRxiv 227157; doi: https://doi.org/10.1101/227157
del.icio.us logo Digg logo Reddit logo Technorati logo Twitter logo CiteULike logo Connotea logo Facebook logo Google logo Mendeley logo
Citation Tools
Assessment of the performance of different hidden Markov models for imputation in animal breeding
Andrew Whalen, Gregor Gorjanc, Roger Ros-Freixedes, John M Hickey
bioRxiv 227157; doi: https://doi.org/10.1101/227157

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetics
Subject Areas
All Articles
  • Animal Behavior and Cognition (546)
  • Biochemistry (746)
  • Bioengineering (449)
  • Bioinformatics (4351)
  • Biophysics (1331)
  • Cancer Biology (899)
  • Cell Biology (1264)
  • Clinical Trials (43)
  • Developmental Biology (850)
  • Ecology (1460)
  • Epidemiology (705)
  • Evolutionary Biology (3444)
  • Genetics (2337)
  • Genomics (3025)
  • Immunology (486)
  • Microbiology (1951)
  • Molecular Biology (762)
  • Neuroscience (5795)
  • Paleontology (36)
  • Pathology (110)
  • Pharmacology and Toxicology (185)
  • Physiology (241)
  • Plant Biology (811)
  • Scientific Communication and Education (225)
  • Synthetic Biology (352)
  • Systems Biology (1197)
  • Zoology (148)