PT - JOURNAL ARTICLE AU - Sung-Huan Yu AU - Jörg Vogel AU - Konrad U. Förstner TI - ANNOgesic: A pipeline to translate bacterial/archaeal RNA-Seq data into high-resolution genome annotations AID - 10.1101/143081 DP - 2017 Jan 01 TA - bioRxiv PG - 143081 4099 - http://biorxiv.org/content/early/2017/05/29/143081.short 4100 - http://biorxiv.org/content/early/2017/05/29/143081.full AB - To understand the gene regulation of an organism of interest, a comprehensive genome annotation is essential. While some features, such as coding sequences, can be computationally predicted with high accuracy based purely on the genomic sequence, others, such as promoter elements or noncoding RNAs are harder to detect. RNA-Seq has proven to be an efficient method to identify these genomic features and to improve genome annotations. However, processing and integrating RNA-Seq data in order to generate high-resolution annotations is challenging, time consuming and requires numerous different steps. We have constructed a powerful and modular pipeline called ANNOgesic that provides the required analyses and simplifies RNA-Seq-based bacterial and archaeal genome annotation. It predicts and annotates numerous features, including small non-coding RNAs, with high precision. The software is available under an open source license (ISCL) at https://pythonhosted.org/ANNOgesic/.