Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Mining all publicly available expression data to compute dynamic microbial transcriptional regulatory networks

View ORCID ProfileAnand V. Sastry, View ORCID ProfileSaugat Poudel, View ORCID ProfileKevin Rychel, View ORCID ProfileReo Yoo, Cameron R. Lamoureux, View ORCID ProfileSiddharth Chauhan, View ORCID ProfileZachary B. Haiman, Tahani Al Bulushi, View ORCID ProfileYara Seif, View ORCID ProfileBernhard O. Palsson
doi: https://doi.org/10.1101/2021.07.01.450581
Anand V. Sastry
1Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Anand V. Sastry
  • For correspondence: avsastry@eng.ucsd.edu
Saugat Poudel
1Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Saugat Poudel
Kevin Rychel
1Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kevin Rychel
Reo Yoo
1Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Reo Yoo
Cameron R. Lamoureux
1Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Siddharth Chauhan
1Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Siddharth Chauhan
Zachary B. Haiman
1Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Zachary B. Haiman
Tahani Al Bulushi
1Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yara Seif
1Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yara Seif
Bernhard O. Palsson
1Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
2Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, 2800, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Bernhard O. Palsson
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

We are firmly in the era of biological big data. Millions of omics datasets are publicly accessible and can be employed to support scientific research or build a holistic view of an organism. Here, we introduce a workflow that converts all public gene expression data for a microbe into a dynamic representation of the organism’s transcriptional regulatory network. This five-step process walks researchers through the mining, processing, curation, analysis, and characterization of all available expression data, using Bacillus subtilis as an example. The resulting reconstruction of the B. subtilis regulatory network can be leveraged to predict new regulons and analyze datasets in the context of all published data. The results are hosted at https://imodulondb.org/, and additional analyses can be performed using the PyModulon Python package. As the number of publicly available datasets increases, this pipeline will be applicable to a wide range of microbial pathogens and cell factories.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • https://github.com/avsastry/modulome-workflow

  • https://pymodulon.readthedocs.io/en/latest/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted July 02, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Mining all publicly available expression data to compute dynamic microbial transcriptional regulatory networks
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Mining all publicly available expression data to compute dynamic microbial transcriptional regulatory networks
Anand V. Sastry, Saugat Poudel, Kevin Rychel, Reo Yoo, Cameron R. Lamoureux, Siddharth Chauhan, Zachary B. Haiman, Tahani Al Bulushi, Yara Seif, Bernhard O. Palsson
bioRxiv 2021.07.01.450581; doi: https://doi.org/10.1101/2021.07.01.450581
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Mining all publicly available expression data to compute dynamic microbial transcriptional regulatory networks
Anand V. Sastry, Saugat Poudel, Kevin Rychel, Reo Yoo, Cameron R. Lamoureux, Siddharth Chauhan, Zachary B. Haiman, Tahani Al Bulushi, Yara Seif, Bernhard O. Palsson
bioRxiv 2021.07.01.450581; doi: https://doi.org/10.1101/2021.07.01.450581

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4397)
  • Biochemistry (9624)
  • Bioengineering (7120)
  • Bioinformatics (24937)
  • Biophysics (12665)
  • Cancer Biology (9991)
  • Cell Biology (14395)
  • Clinical Trials (138)
  • Developmental Biology (7988)
  • Ecology (12146)
  • Epidemiology (2067)
  • Evolutionary Biology (16022)
  • Genetics (10950)
  • Genomics (14778)
  • Immunology (9899)
  • Microbiology (23732)
  • Molecular Biology (9503)
  • Neuroscience (51045)
  • Paleontology (370)
  • Pathology (1544)
  • Pharmacology and Toxicology (2692)
  • Physiology (4037)
  • Plant Biology (8693)
  • Scientific Communication and Education (1512)
  • Synthetic Biology (2404)
  • Systems Biology (6456)
  • Zoology (1349)