Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Prowler: A novel trimming algorithm for Oxford Nanopore sequence data

Simon Lee, Loan T Nguyen, Ben J Hayes, Elizabeth Ross
doi: https://doi.org/10.1101/2021.05.09.443332
Simon Lee
1Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia QLD 4069, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Loan T Nguyen
1Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia QLD 4069, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ben J Hayes
1Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia QLD 4069, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Elizabeth Ross
1Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia QLD 4069, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: e.ross@uq.edu.au
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Motivation Quality control (QC) tools are critical in DNA sequencing analysis because they increase the accuracy of sequence alignments and thus the reliability of results. Oxford Nanopore Technologies (ONT) QC is currently rudimentary, generally based on whole read average quality. This results in discarding reads that contain regions of high quality sequence. Here we propose Prowler, a multi-window approach inspired by algorithms used to QC short read data. Importantly, we retain the phase and read length information by optionally replacing trimmed sections with Ns.

Results Prowler was applied to mammalian and bacterial datasets, to assess effects on alignment and assembly respectively. Compared to Nanofilt, alignments of data QC’ed with Prowler had lower error rates and more mapped reads. Assemblies of Prowler QC’ed data had a lower error rate than Nanofilt QCed data however this came at some cost to assembly contiguity.

Availability and implementation Prowler is implemented in Python and is available at: https://github.com/ProwlerForNanopore/ProwlerTrimmer

Contact e.ross{at}uq.edu.au

Supplementary information Supplementary data are available at Bioinformatics online.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • https://drive.google.com/file/d/1ye7l6Zj7gySrWjO1om2cIUWeI8dk6-B1/view?usp=sharing

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted May 10, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Prowler: A novel trimming algorithm for Oxford Nanopore sequence data
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Prowler: A novel trimming algorithm for Oxford Nanopore sequence data
Simon Lee, Loan T Nguyen, Ben J Hayes, Elizabeth Ross
bioRxiv 2021.05.09.443332; doi: https://doi.org/10.1101/2021.05.09.443332
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Prowler: A novel trimming algorithm for Oxford Nanopore sequence data
Simon Lee, Loan T Nguyen, Ben J Hayes, Elizabeth Ross
bioRxiv 2021.05.09.443332; doi: https://doi.org/10.1101/2021.05.09.443332

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (3476)
  • Biochemistry (7313)
  • Bioengineering (5288)
  • Bioinformatics (20170)
  • Biophysics (9966)
  • Cancer Biology (7693)
  • Cell Biology (11242)
  • Clinical Trials (138)
  • Developmental Biology (6409)
  • Ecology (9907)
  • Epidemiology (2065)
  • Evolutionary Biology (13260)
  • Genetics (9345)
  • Genomics (12541)
  • Immunology (7664)
  • Microbiology (18918)
  • Molecular Biology (7411)
  • Neuroscience (40844)
  • Paleontology (298)
  • Pathology (1224)
  • Pharmacology and Toxicology (2124)
  • Physiology (3137)
  • Plant Biology (6832)
  • Scientific Communication and Education (1268)
  • Synthetic Biology (1890)
  • Systems Biology (5294)
  • Zoology (1083)