MOIRAI: a compact workflow system for CAGE analysis

BMC Bioinformatics. 2014 May 16:15:144. doi: 10.1186/1471-2105-15-144.

Abstract

Background: Cap analysis of gene expression (CAGE) is a sequencing based technology to capture the 5' ends of RNAs in a biological sample. After mapping, a CAGE peak on the genome indicates the position of an active transcriptional start site (TSS) and the number of reads correspond to its expression level. CAGE is prominently used in both the FANTOM and ENCODE project but presently there is no software package to perform the essential data processing steps.

Results: Here we describe MOIRAI, a compact yet flexible workflow system designed to carry out the main steps in data processing and analysis of CAGE data. MOIRAI has a graphical interface allowing wet-lab researchers to create, modify and run analysis workflows. Embedded within the workflows are graphical quality control indicators allowing users assess data quality and to quickly spot potential problems. We will describe three main workflows allowing users to map, annotate and perform an expression analysis over multiple samples.

Conclusions: Due to the many built in quality control features MOIRAI is especially suitable to support the development of new sequencing based protocols.

Availiability: The MOIRAI source code is freely available at http://sourceforge.net/projects/moirai/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gene Expression Profiling / methods*
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • K562 Cells
  • Molecular Sequence Annotation
  • Software*
  • Workflow