Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

tappAS: a comprehensive computational framework for the analysis of the functional impact of differential splicing

Lorena de la Fuente, Ángeles Arzalluz-Luque, Manuel Tardáguila, Manuel Tardáguila, Héctor del Risco, Cristina Martí, Sonia Tarazona, Pedro Salguero, Raymond Scott, Ana Alastrue-Agudo, Pablo Bonilla, Jeremy Newman, Lauren McIntyre, Victoria Moreno-Manzano, Ana Conesa
doi: https://doi.org/10.1101/690743
Lorena de la Fuente
1Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ángeles Arzalluz-Luque
2Department of Statistics and Operational Research, Polytechnical University of Valencia, Valencia, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Manuel Tardáguila
3Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, Florida, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Manuel Tardáguila
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Héctor del Risco
3Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, Florida, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cristina Martí
1Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sonia Tarazona
2Department of Statistics and Operational Research, Polytechnical University of Valencia, Valencia, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pedro Salguero
1Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Raymond Scott
3Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, Florida, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ana Alastrue-Agudo
4Neural Regeneration Laboratory, Prince Felipe Research Center, Valencia, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pablo Bonilla
4Neural Regeneration Laboratory, Prince Felipe Research Center, Valencia, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeremy Newman
5Genetics Institute, University of Florida, Gainesville, Florida, USA
6Department of Pathology, University of Florida, Gainesville, Florida, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lauren McIntyre
5Genetics Institute, University of Florida, Gainesville, Florida, USA
7Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, Florida, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Victoria Moreno-Manzano
4Neural Regeneration Laboratory, Prince Felipe Research Center, Valencia, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ana Conesa
3Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, Florida, USA
5Genetics Institute, University of Florida, Gainesville, Florida, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: aconesa@ufl.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Traditionally, the functional analysis of gene expression data has used pathway and network enrichment algorithms. These methods are usually gene rather than transcript centric and hence fall short to unravel functional roles associated to posttranscriptional regulatory mechanisms such as Alternative Splicing (AS) and Alternative PolyAdenylation (APA), jointly referred here as Alternative Transcript Processing (AltTP). Moreover, short-read RNA-seq has serious limitations to resolve full-length transcripts, further complicating the study of isoform expression. Recent advances in long-read sequencing open exciting opportunities for studying isoform biology and function. However, there are no established bioinformatics methods for the functional analysis of isoform-resolved transcriptomics data to fully leverage these technological advances. Here we present a novel framework for Functional Iso-Transcriptomics analysis (FIT). This framework uses a rich isoform-level annotation database of functional domains, motifs and sites –both coding and non-coding- and introduces novel analysis methods to interrogate different aspects of the functional relevance of isoform complexity. The Functional Diversity Analysis (FDA) evaluates the variability at the inclusion/exclusion of functional domains across annotated transcripts of the same gene. Parameters can be set to evaluate if AltTP partially or fully disrupts functional elements. FDA is a measure of the potential of a multiple isoform transcriptome to have a functional impact. By combining these functional labels with expression data, the Differential Analysis Module evaluates the relative contribution of transcriptional (i.e. gene level) and post-transcriptional (i.e. transcript/protein levels) regulation on the biology of the system. Measures of isoform relevance such as Minor Isoform Filtering, Isoform Switching Events and Total Isoform Usage Change contribute to restricting analysis to biologically meaningful changes. Finally, novel methods for Differential Feature Inclusion, Co-Feature Inclusion, and the combination of UTR-lengthening with Alternative Polyadenylation analyses carefully dissects the contextual regulation of functional elements resulting from differential isoforms usage. These methods are implemented in the software tappAS, a user-friendly Java application that brings FIT to the hands of non-expert bioinformaticians supporting several model and non-model species. tappAS complements statistical analyses with powerful browsing tools and highly informative gene/transcript/CDS graphs.

We applied tappAS to the analysis of two mouse Neural Precursor Cells (NPCs) and Oligodendrocyte Precursor Cells (OPCs) whose transcriptome was defined by PacBio and quantified by Illumina. Using FDA we confirmed the high potential of AltTP regulation in our system, in which 90% of multi-isoform genes presented variation in functional features at the transcript or protein level. The Differential Analysis module revealed a high interplay between transcriptional and AltTP regulation in neural development, mainly controlled by differential expression, but where AltTP acts the main driver of important neural development biological mechanisms such as vesicle trafficking, signal transduction and RNA processing. The DFI analysis revealed that, globally, AltTP increased the availability of functional features in differentiated neural cells. DFI also showed that AltTP is a mechanism for altering gene function by changing cellular localization and binding properties of proteins, via the differential inclusion of NLS, transmembrane domains or DNA binding motifs, for example. Some of these findings were experimentally validated by others and us.

In summary, we propose a novel framework for the functional analysis of transcriptomes at isoform resolution. We anticipate the tappAS tool will be an important resource for the adoption of the Functional Iso-Transcriptomics analysis by functional genomics community.

Footnotes

  • ↵b These authors jointly supervised this work.

  • http://tappas.org/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted July 04, 2019.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
tappAS: a comprehensive computational framework for the analysis of the functional impact of differential splicing
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
tappAS: a comprehensive computational framework for the analysis of the functional impact of differential splicing
Lorena de la Fuente, Ángeles Arzalluz-Luque, Manuel Tardáguila, Manuel Tardáguila, Héctor del Risco, Cristina Martí, Sonia Tarazona, Pedro Salguero, Raymond Scott, Ana Alastrue-Agudo, Pablo Bonilla, Jeremy Newman, Lauren McIntyre, Victoria Moreno-Manzano, Ana Conesa
bioRxiv 690743; doi: https://doi.org/10.1101/690743
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
tappAS: a comprehensive computational framework for the analysis of the functional impact of differential splicing
Lorena de la Fuente, Ángeles Arzalluz-Luque, Manuel Tardáguila, Manuel Tardáguila, Héctor del Risco, Cristina Martí, Sonia Tarazona, Pedro Salguero, Raymond Scott, Ana Alastrue-Agudo, Pablo Bonilla, Jeremy Newman, Lauren McIntyre, Victoria Moreno-Manzano, Ana Conesa
bioRxiv 690743; doi: https://doi.org/10.1101/690743

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (2430)
  • Biochemistry (4791)
  • Bioengineering (3333)
  • Bioinformatics (14684)
  • Biophysics (6640)
  • Cancer Biology (5172)
  • Cell Biology (7429)
  • Clinical Trials (138)
  • Developmental Biology (4367)
  • Ecology (6874)
  • Epidemiology (2057)
  • Evolutionary Biology (9926)
  • Genetics (7346)
  • Genomics (9533)
  • Immunology (4558)
  • Microbiology (12686)
  • Molecular Biology (4948)
  • Neuroscience (28348)
  • Paleontology (199)
  • Pathology (809)
  • Pharmacology and Toxicology (1392)
  • Physiology (2024)
  • Plant Biology (4504)
  • Scientific Communication and Education (977)
  • Synthetic Biology (1299)
  • Systems Biology (3917)
  • Zoology (726)