FindPeaks 3.1: a tool for identifying areas of enrichment from massively parallel short-read sequencing technology

Anthony P Fejes; Gordon Robertson; Mikhail Bilenky; Richard Varhol; Matthew Bainbridge; Steven J M Jones

doi:10.1093/bioinformatics/btn305

FindPeaks 3.1: a tool for identifying areas of enrichment from massively parallel short-read sequencing technology

Bioinformatics. 2008 Aug 1;24(15):1729-30. doi: 10.1093/bioinformatics/btn305. Epub 2008 Jul 3.

Authors

Anthony P Fejes¹, Gordon Robertson, Mikhail Bilenky, Richard Varhol, Matthew Bainbridge, Steven J M Jones

Affiliation

¹ Genome Sciences Centre, BC Cancer Agency, Suite 100 570 West 7th Avenue, Vancouver, British Columbia, Canada. afejes@bcgsc.ca

Abstract

Summary: Next-generation sequencing can provide insight into protein-DNA association events on a genome-wide scale, and is being applied in an increasing number of applications in genomics and meta-genomics research. However, few software applications are available for interpreting these experiments. We present here an efficient application for use with chromatin-immunoprecipitation (ChIP-Seq) experimental data that includes novel functionality for identifying areas of gene enrichment and transcription factor binding site locations, as well as for estimating DNA fragment size distributions in enriched areas. The FindPeaks application can generate UCSC compatible custom 'WIG' track files from aligned-read files for short-read sequencing technology. The software application can be executed on any platform capable of running a Java Runtime Environment. Memory requirements are proportional to the number of sequencing reads analyzed; typically 4 GB permits processing of up to 40 million reads.

Availability: The FindPeaks 3.1 package and manual, containing algorithm descriptions, usage instructions and examples, are available at http://www.bcgsc.ca/platform/bioinfo/software/findpeaks Source files for FindPeaks 3.1 are available for academic use.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Binding Sites
Chromatin Immunoprecipitation / methods*
Chromosome Mapping / methods*
Pattern Recognition, Automated / methods*
Sequence Analysis, DNA / methods*
Software*
Transcription Factors / genetics*

Substances

Transcription Factors