Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Detection of pathogenic splicing events from RNA-sequencing data using dasper

David Zhang, Regina H. Reynolds, Sonia Garcia-Ruiz, Emil K Gustavsson, Sid Sethi, Sara Aguti, Ines A. Barbosa, Jack J. Collier, Henry Houlden, Robert McFarland, Francesco Muntoni, Monika Oláhová, Joanna Poulton, Michael Simpson, View ORCID ProfileRobert D.S. Pitceathly, Robert W. Taylor, Haiyan Zhou, Charu Deshpande, View ORCID ProfileJuan A. Botia, Leonardo Collado-Torres, Mina Ryten
doi: https://doi.org/10.1101/2021.03.29.437534
David Zhang
1Institute of Child Health, University College London (UCL), UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Regina H. Reynolds
2Department of Neurodegenerative diseases, UCL Queen Square Institute of Neurology, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sonia Garcia-Ruiz
1Institute of Child Health, University College London (UCL), UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Emil K Gustavsson
1Institute of Child Health, University College London (UCL), UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sid Sethi
2Department of Neurodegenerative diseases, UCL Queen Square Institute of Neurology, London, UK
3Astex Pharmaceuticals, 436 Cambridge Science Park, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sara Aguti
1Institute of Child Health, University College London (UCL), UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ines A. Barbosa
4Department of Medical & Molecular Genetics, King’s College London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jack J. Collier
5Wellcome Centre for Mitochondrial Research, Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Henry Houlden
2Department of Neurodegenerative diseases, UCL Queen Square Institute of Neurology, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Robert McFarland
5Wellcome Centre for Mitochondrial Research, Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Francesco Muntoni
1Institute of Child Health, University College London (UCL), UK
6NIHR Great Ormond Street Hospital Biomedical Research Centre, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Monika Oláhová
5Wellcome Centre for Mitochondrial Research, Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joanna Poulton
7Nuffield Department of Women’s & Reproductive Health, University of Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael Simpson
4Department of Medical & Molecular Genetics, King’s College London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Robert D.S. Pitceathly
8Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Robert D.S. Pitceathly
Robert W. Taylor
5Wellcome Centre for Mitochondrial Research, Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Haiyan Zhou
1Institute of Child Health, University College London (UCL), UK
6NIHR Great Ormond Street Hospital Biomedical Research Centre, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Charu Deshpande
9Centre for Genomic Medicine, Manchester, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Juan A. Botia
10Departamento de Ingeniería de la Información y las Comunicaciones, Universidad de Murcia, 30100 Murcia, Spain
2Department of Neurodegenerative diseases, UCL Queen Square Institute of Neurology, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Juan A. Botia
Leonardo Collado-Torres
11Lieber Institute for Brain Development, Baltimore, MD, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mina Ryten
1Institute of Child Health, University College London (UCL), UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: mina.ryten@ucl.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Although next-generation sequencing technologies have accelerated the discovery of novel gene-to-disease associations, many patients with suspected Mendelian diseases still leave the clinic without a genetic diagnosis. An estimated one third of these patients will have disorders caused by mutations impacting splicing. RNA-sequencing has been shown to be a promising diagnostic tool, however few methods have been developed to integrate RNA-sequencing data into the diagnostic pipeline. Here, we introduce dasper, an R/Bioconductor package that improves upon existing tools for detecting aberrant splicing by using machine learning to incorporate disruptions in exon-exon junction counts as well as coverage. dasper is designed for diagnostics, providing a rank-based report of how aberrant each splicing event looks, as well as including visualization functionality to facilitate interpretation. We validate dasper using 16 patient-derived fibroblast cell lines harbouring pathogenic variants known to impact splicing. We find that dasper is able to detect pathogenic splicing events with greater accuracy than existing LeafCutterMD or z-score approaches. Furthermore, by only applying a broad OMIM gene filter (without any variant-level filters), dasper is able to detect pathogenic splicing events within the top 10 most aberrant identified for each patient. Since using publicly available control data minimises costs associated with incorporating RNA-sequencing into diagnostic pipelines, we also investigate the use of 504 GTEx fibroblast samples as controls. We find that dasper leverages publicly available data effectively, ranking pathogenic splicing events in the top 25. Thus, we believe dasper can increase diagnostic yield for a pathogenic splicing variants and enable the efficient implementation of RNA-sequencing for diagnostics in clinical laboratories.

Competing Interest Statement

The authors have declared no competing interest.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted March 30, 2021.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Detection of pathogenic splicing events from RNA-sequencing data using dasper
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Detection of pathogenic splicing events from RNA-sequencing data using dasper
David Zhang, Regina H. Reynolds, Sonia Garcia-Ruiz, Emil K Gustavsson, Sid Sethi, Sara Aguti, Ines A. Barbosa, Jack J. Collier, Henry Houlden, Robert McFarland, Francesco Muntoni, Monika Oláhová, Joanna Poulton, Michael Simpson, Robert D.S. Pitceathly, Robert W. Taylor, Haiyan Zhou, Charu Deshpande, Juan A. Botia, Leonardo Collado-Torres, Mina Ryten
bioRxiv 2021.03.29.437534; doi: https://doi.org/10.1101/2021.03.29.437534
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Detection of pathogenic splicing events from RNA-sequencing data using dasper
David Zhang, Regina H. Reynolds, Sonia Garcia-Ruiz, Emil K Gustavsson, Sid Sethi, Sara Aguti, Ines A. Barbosa, Jack J. Collier, Henry Houlden, Robert McFarland, Francesco Muntoni, Monika Oláhová, Joanna Poulton, Michael Simpson, Robert D.S. Pitceathly, Robert W. Taylor, Haiyan Zhou, Charu Deshpande, Juan A. Botia, Leonardo Collado-Torres, Mina Ryten
bioRxiv 2021.03.29.437534; doi: https://doi.org/10.1101/2021.03.29.437534

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (3698)
  • Biochemistry (7809)
  • Bioengineering (5689)
  • Bioinformatics (21330)
  • Biophysics (10595)
  • Cancer Biology (8199)
  • Cell Biology (11961)
  • Clinical Trials (138)
  • Developmental Biology (6777)
  • Ecology (10419)
  • Epidemiology (2065)
  • Evolutionary Biology (13900)
  • Genetics (9726)
  • Genomics (13094)
  • Immunology (8164)
  • Microbiology (20058)
  • Molecular Biology (7871)
  • Neuroscience (43147)
  • Paleontology (321)
  • Pathology (1280)
  • Pharmacology and Toxicology (2264)
  • Physiology (3362)
  • Plant Biology (7246)
  • Scientific Communication and Education (1315)
  • Synthetic Biology (2010)
  • Systems Biology (5547)
  • Zoology (1132)