Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Fusion detection and quantification by pseudoalignment

View ORCID ProfilePáll Melsted, Shannon Hateley, Isaac Charles Joseph, Harold Pimentel, Nicolas L Bray, Lior Pachter
doi: https://doi.org/10.1101/166322
Páll Melsted
University of Iceland;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Páll Melsted
  • For correspondence: pmelsted@gmail.com
Shannon Hateley
Department of Molecular and Cell Biology, University of California, Berkeley;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Isaac Charles Joseph
Graduate Program in Computational Biology, University of California, Berkeley;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Harold Pimentel
UC Berkeley;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nicolas L Bray
Innovative Genomics Initiative, University of California, Berkeley;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lior Pachter
Division of Biology and Biological Engineering, California Institute of Technology, Pasadena
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

RNA sequencing in cancer cells is a powerful technique to detect chromosomal rearrangements, allowing for de novo discovery of actively expressed fusion genes. Here we focus on the problem of detecting gene fusions from raw sequencing data, assembling the reads to define fusion transcripts and their associated breakpoints, and quantifying their abundances. Building on the pseudoalignment idea that simplifies and accelerates transcript quantification, we introduce a novel approach to fusion detection based on inspecting paired reads that cannot be pseudoaligned due to conflicting matches. The method and software, called pizzly, filters false positives, assembles new transcripts from the fusion reads, and reports candidate fusions. With pizzly, fusion detection from raw RNA-Seq reads can be performed in a matter of minutes, making the program suitable for the analysis of large cancer gene expression databases and for clinical use. pizzly is available at https://github.com/pmelsted/pizzly

Copyright 
The copyright holder for this preprint is the author/funder. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
  • Posted July 20, 2017.

Download PDF

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Fusion detection and quantification by pseudoalignment
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
Share
Fusion detection and quantification by pseudoalignment
Páll Melsted, Shannon Hateley, Isaac Charles Joseph, Harold Pimentel, Nicolas L Bray, Lior Pachter
bioRxiv 166322; doi: https://doi.org/10.1101/166322
del.icio.us logo Digg logo Reddit logo Technorati logo Twitter logo CiteULike logo Connotea logo Facebook logo Google logo Mendeley logo
Citation Tools
Fusion detection and quantification by pseudoalignment
Páll Melsted, Shannon Hateley, Isaac Charles Joseph, Harold Pimentel, Nicolas L Bray, Lior Pachter
bioRxiv 166322; doi: https://doi.org/10.1101/166322

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (620)
  • Biochemistry (860)
  • Bioengineering (516)
  • Bioinformatics (4762)
  • Biophysics (1503)
  • Cancer Biology (1030)
  • Cell Biology (1448)
  • Clinical Trials (52)
  • Developmental Biology (974)
  • Ecology (1633)
  • Epidemiology (808)
  • Evolutionary Biology (3690)
  • Genetics (2513)
  • Genomics (3266)
  • Immunology (602)
  • Microbiology (2416)
  • Molecular Biology (895)
  • Neuroscience (6488)
  • Paleontology (42)
  • Pathology (124)
  • Pharmacology and Toxicology (220)
  • Physiology (287)
  • Plant Biology (893)
  • Scientific Communication and Education (247)
  • Synthetic Biology (386)
  • Systems Biology (1323)
  • Zoology (162)