Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Cyrius: accurate CYP2D6 genotyping using whole genome sequencing data

View ORCID ProfileXiao Chen, Fei Shen, Nina Gonzaludo, Alka Malhotra, Cande Rogert, Ryan J Taft, David R Bentley, View ORCID ProfileMichael A Eberle
doi: https://doi.org/10.1101/2020.05.05.077966
Xiao Chen
1Illumina Inc., 5200 Illumina Way, San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Xiao Chen
Fei Shen
1Illumina Inc., 5200 Illumina Way, San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nina Gonzaludo
1Illumina Inc., 5200 Illumina Way, San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alka Malhotra
1Illumina Inc., 5200 Illumina Way, San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cande Rogert
1Illumina Inc., 5200 Illumina Way, San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ryan J Taft
1Illumina Inc., 5200 Illumina Way, San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David R Bentley
2Illumina Cambridge Ltd., Illumina Centre 19 Granta Park, Great Abington, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael A Eberle
1Illumina Inc., 5200 Illumina Way, San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Michael A Eberle
  • For correspondence: meberle@illumina.com
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Responsible for the metabolism of 25% of all drugs, CYP2D6 is a critical component of personalized medicine initiatives. Genotyping CYP2D6 is challenging due to sequence similarity with its pseudogene paralog CYP2D7 and a high number and variety of common structural variants (SVs). Here we describe a novel bioinformatics method, Cyrius, that accurately genotypes CYP2D6 using whole-genome sequencing (WGS) data. Using a validation data set consisting of reference samples with diverse genotypes as well as PacBio long read data, we show that Cyrius has superior performance (96.5% concordance with truth genotypes) compared to existing methods (83.8-86.6%). After implementing the improvements identified from the comparison against the truth data, Cyrius’s accuracy has since been improved to 99.3%. Using Cyrius, we built a haplotype frequency database from 2504 ethnically diverse samples and estimate that SV-containing star alleles are more frequent than previously reported. Cyrius will be a useful tool for pharmacogenomics applications with WGS and help bring the promise of precision medicine one step closer to reality.

Competing Interest Statement

XC, FS, NG, AM, CR, RJT, DRB and MAE are employees of Illumina Inc.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted May 19, 2020.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Cyrius: accurate CYP2D6 genotyping using whole genome sequencing data
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Cyrius: accurate CYP2D6 genotyping using whole genome sequencing data
Xiao Chen, Fei Shen, Nina Gonzaludo, Alka Malhotra, Cande Rogert, Ryan J Taft, David R Bentley, Michael A Eberle
bioRxiv 2020.05.05.077966; doi: https://doi.org/10.1101/2020.05.05.077966
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
Cyrius: accurate CYP2D6 genotyping using whole genome sequencing data
Xiao Chen, Fei Shen, Nina Gonzaludo, Alka Malhotra, Cande Rogert, Ryan J Taft, David R Bentley, Michael A Eberle
bioRxiv 2020.05.05.077966; doi: https://doi.org/10.1101/2020.05.05.077966

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (2408)
  • Biochemistry (4756)
  • Bioengineering (3294)
  • Bioinformatics (14573)
  • Biophysics (6586)
  • Cancer Biology (5125)
  • Cell Biology (7365)
  • Clinical Trials (138)
  • Developmental Biology (4308)
  • Ecology (6817)
  • Epidemiology (2057)
  • Evolutionary Biology (9836)
  • Genetics (7305)
  • Genomics (9463)
  • Immunology (4502)
  • Microbiology (12580)
  • Molecular Biology (4897)
  • Neuroscience (28074)
  • Paleontology (198)
  • Pathology (796)
  • Pharmacology and Toxicology (1372)
  • Physiology (1993)
  • Plant Biology (4447)
  • Scientific Communication and Education (965)
  • Synthetic Biology (1293)
  • Systems Biology (3889)
  • Zoology (716)