Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Phylovar: Towards scalable phylogeny-aware inference of single-nucleotide variations from single-cell DNA sequencing data

Mohammadamin Edrisi, Monica V. Valecha, Sunkara B. V. Chowdary, Sergio Robledo, Huw A. Ogilvie, David Posada, Hamim Zafar, Luay Nakhleh
doi: https://doi.org/10.1101/2022.01.16.476509
Mohammadamin Edrisi
1Department of Computer Science, Rice University, Texas, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: edrisi@rice.edu nakhleh@rice.edu hamim@iitk.ac.in
Monica V. Valecha
2CINBIO, Universidade de Vigo, Vigo, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sunkara B. V. Chowdary
3Department of Computer Science & Engineering, Indian Institute of Technology Kanpur, Kanpur, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sergio Robledo
4University of Houston, Texas, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Huw A. Ogilvie
1Department of Computer Science, Rice University, Texas, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Posada
2CINBIO, Universidade de Vigo, Vigo, Spain
5Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, Vigo, Spain
6Department of Biochemistry, Genetics, and Immunology, Universidade de Vigo, Vigo, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hamim Zafar
3Department of Computer Science & Engineering, Indian Institute of Technology Kanpur, Kanpur, India
7Department of Biological Sciences & Bioengineering, Institute of Technology Kanpur, Kanpur, India
8Mehta Family Centre for Engineering in Medicine, Indian Institute of Technology Kanpur, Kanpur, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: edrisi@rice.edu nakhleh@rice.edu hamim@iitk.ac.in
Luay Nakhleh
1Department of Computer Science, Rice University, Texas, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: edrisi@rice.edu nakhleh@rice.edu hamim@iitk.ac.in
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Single-nucleotide variants (SNVs) are the most common variations in the human genome. Recently developed methods for SNV detection from single-cell DNA sequencing (scDNAseq) data, such as SCIΦ and scVILP, leverage the evolutionary history of the cells to overcome the technical errors associated with single-cell sequencing protocols. Despite being accurate, these methods are not scalable to the extensive genomic breadth of single-cell whole-genome (scWGS) and whole-exome sequencing (scWES) data.

Here we report on a new scalable method, Phylovar, which extends the phylogeny-guided variant calling approach to sequencing datasets containing millions of loci. Through benchmarking on simulated datasets under different settings, we show that, Phylovar outperforms SCIΦ in terms of running time while being more accurate than Monovar (which is not phylogeny-aware) in terms of SNV detection. Furthermore, we applied Phylovar to two real biological datasets: an scWES triple-negative breast cancer data consisting of 32 cells and 3375 loci as well as an scWGS data of neuron cells from a normal human brain containing 16 cells and approximately 2.5 million loci. For the cancer data, Phylovar detected somatic SNVs with high or moderate functional impact that were also supported by bulk sequencing dataset and for the neuron dataset, Phylovar identified 5745 SNVs with non-synonymous effects some of which were associated with neurodegenerative diseases. We implemented Phylovar and made it publicly available at https://github.com/mae6/Phylovar.git.

Competing Interest Statement

The authors have declared no competing interest.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted January 18, 2022.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Phylovar: Towards scalable phylogeny-aware inference of single-nucleotide variations from single-cell DNA sequencing data
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Phylovar: Towards scalable phylogeny-aware inference of single-nucleotide variations from single-cell DNA sequencing data
Mohammadamin Edrisi, Monica V. Valecha, Sunkara B. V. Chowdary, Sergio Robledo, Huw A. Ogilvie, David Posada, Hamim Zafar, Luay Nakhleh
bioRxiv 2022.01.16.476509; doi: https://doi.org/10.1101/2022.01.16.476509
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Phylovar: Towards scalable phylogeny-aware inference of single-nucleotide variations from single-cell DNA sequencing data
Mohammadamin Edrisi, Monica V. Valecha, Sunkara B. V. Chowdary, Sergio Robledo, Huw A. Ogilvie, David Posada, Hamim Zafar, Luay Nakhleh
bioRxiv 2022.01.16.476509; doi: https://doi.org/10.1101/2022.01.16.476509

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Cancer Biology
Subject Areas
All Articles
  • Animal Behavior and Cognition (3513)
  • Biochemistry (7358)
  • Bioengineering (5334)
  • Bioinformatics (20290)
  • Biophysics (10032)
  • Cancer Biology (7753)
  • Cell Biology (11323)
  • Clinical Trials (138)
  • Developmental Biology (6442)
  • Ecology (9962)
  • Epidemiology (2065)
  • Evolutionary Biology (13340)
  • Genetics (9363)
  • Genomics (12594)
  • Immunology (7717)
  • Microbiology (19055)
  • Molecular Biology (7452)
  • Neuroscience (41085)
  • Paleontology (300)
  • Pathology (1232)
  • Pharmacology and Toxicology (2140)
  • Physiology (3169)
  • Plant Biology (6867)
  • Scientific Communication and Education (1275)
  • Synthetic Biology (1899)
  • Systems Biology (5320)
  • Zoology (1089)