PT - JOURNAL ARTICLE AU - Marek Cmero AU - Nadia M Davidson AU - Alicia Oshlack TI - Fast and accurate differential transcript usage by testing equivalence class counts AID - 10.1101/501106 DP - 2018 Jan 01 TA - bioRxiv PG - 501106 4099 - http://biorxiv.org/content/early/2018/12/19/501106.short 4100 - http://biorxiv.org/content/early/2018/12/19/501106.full AB - RNA sequencing has enabled high-throughput and fine-grained quantitative analyses of the transcriptome. While differential gene expression is the most widely used application of this technology, RNA-seq data also has the resolution to infer differential transcript usage (DTU), which can elucidate the role of different transcript isoforms between experimental conditions, cell types or tissues. DTU has typically been inferred from exon-count data, which has issues with assigning reads unambiguously to counting bins, and requires alignment of reads to the genome. Recently, approaches have emerged that use transcript quantifications estimates directly for DTU. Transcript counts can be inferred from ‘pseudo’ or lightweight aligners, which are significantly faster than traditional genome alignment. However, recent evaluations show lower sensitivity in DTU analysis. Transcript abundances are estimated from equivalence classes (ECs), which determine the transcripts that any given read is compatible with. Here we propose performing DTU testing directly on equivalence class read counts. We evaluate this approach on simulated human and drosophila data, as well as on a real dataset through subset testing. We find that ECs counts have similar sensitivity and false discovery rates as exon-level counts but can be generated in a fraction of the time through the use of pseudo-aligners. We posit that equivalent class counts is a natural unit on which to perform many types of analysis.