Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Eukaryotic genomes from a global metagenomic dataset illuminate trophic modes and biogeography of ocean plankton

View ORCID ProfileHarriet Alexander, View ORCID ProfileSarah K. Hu, View ORCID ProfileArianna I. Krinos, View ORCID ProfileMaria Pachiadaki, View ORCID ProfileBenjamin J. Tully, View ORCID ProfileChristopher J. Neely, View ORCID ProfileTaylor Reiter
doi: https://doi.org/10.1101/2021.07.25.453713
Harriet Alexander
1Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA, USA, 02543
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Harriet Alexander
  • For correspondence: halexander@whoi.edu
Sarah K. Hu
2Marine Chemistry and Geochemistry, Woods Hole Oceanographic Institution, Woods Hole, MA, USA, 02543
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sarah K. Hu
Arianna I. Krinos
1Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA, USA, 02543
3MIT-WHOI Joint Program in Oceanography, Cambridge and Woods Hole, MA, 02540
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Arianna I. Krinos
Maria Pachiadaki
1Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA, USA, 02543
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Maria Pachiadaki
Benjamin J. Tully
4Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Benjamin J. Tully
Christopher J. Neely
5Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Christopher J. Neely
Taylor Reiter
6Population Health and Reproduction, University of California, Davis, Davis, CA, 95616
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Taylor Reiter
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Metagenomics is a powerful method for interpreting the ecological roles and physiological capabilities of mixed microbial communities. Yet, many tools for processing metagenomic data are not designed to consider eukaryotes, nor are they built for an increasing amount of sequence data. EukHeist is an automated pipeline to retrieve eukaryotic and prokaryotic metagenome assembled genomes (MAGs) from large-scale metagenomic datasets. We developed the EukHeist workflow to specifically process large amounts of both metagenomic and/or metatranscriptomic sequence data in an automated and reproducible fashion. Here, we applied EukHeist to the large-size fraction data (0.8-2000µm) from Tara Oceans to recover both eukaryotic and prokaryotic MAGs, which we refer to as TOPAZ (Tara Oceans Particle-Associated MAGs). The TOPAZ MAGs consisted of >900 environmentally-relevant eukaryotic MAGs and >4,000 bacterial and archaeal MAGs. The bacterial and archaeal TOPAZ MAGs expand the known marine phylogenetic diversity through the increase in coverage of likely particle- and host-associated taxa. We also demonstrate an approach to infer the putative functional mode of the recovered eukaryotic MAGs. A global survey of the TOPAZ MAGs enabled the identification of ecological cohorts, driven by specific environmental factors, and putative host-microbe associations.

Importance Despite the ecological importance of single-celled eukaryotic organisms in marine environments, the majority are difficult to cultivate in the lab. Sequencing genetic material extracted from environmental samples enables researchers to document naturally-occurring protistan communities. However, conventional sequencing methodologies cannot separate out the genomes of individual organisms. To more completely capture the entire genomic content of mixed protistan community, we can create bins of sequences that represent the same organism. We developed a pipeline that enables scientists to bin individual organisms out of metagenomic reads, and show results that provide exciting insights into what protistan communities are present in the ocean and what roles they play in the ecosystem. Here, a global survey of both eukaryotic and prokaryotic MAGs enabled the identification of ecological cohorts, driven by specific environmental factors, and putative host-microbe associations. Accessible and scalable computational tools, such as EukHeist, are likely to accelerate the identification of meaningful genetic signatures from large datasets, ultimately expanding the eukaryotic tree of life.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • https://osf.io/gm564/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted June 01, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Eukaryotic genomes from a global metagenomic dataset illuminate trophic modes and biogeography of ocean plankton
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Eukaryotic genomes from a global metagenomic dataset illuminate trophic modes and biogeography of ocean plankton
Harriet Alexander, Sarah K. Hu, Arianna I. Krinos, Maria Pachiadaki, Benjamin J. Tully, Christopher J. Neely, Taylor Reiter
bioRxiv 2021.07.25.453713; doi: https://doi.org/10.1101/2021.07.25.453713
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Eukaryotic genomes from a global metagenomic dataset illuminate trophic modes and biogeography of ocean plankton
Harriet Alexander, Sarah K. Hu, Arianna I. Krinos, Maria Pachiadaki, Benjamin J. Tully, Christopher J. Neely, Taylor Reiter
bioRxiv 2021.07.25.453713; doi: https://doi.org/10.1101/2021.07.25.453713

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Microbiology
Subject Areas
All Articles
  • Animal Behavior and Cognition (3688)
  • Biochemistry (7783)
  • Bioengineering (5673)
  • Bioinformatics (21267)
  • Biophysics (10574)
  • Cancer Biology (8170)
  • Cell Biology (11929)
  • Clinical Trials (138)
  • Developmental Biology (6757)
  • Ecology (10394)
  • Epidemiology (2065)
  • Evolutionary Biology (13853)
  • Genetics (9702)
  • Genomics (13063)
  • Immunology (8136)
  • Microbiology (19976)
  • Molecular Biology (7841)
  • Neuroscience (43032)
  • Paleontology (318)
  • Pathology (1278)
  • Pharmacology and Toxicology (2258)
  • Physiology (3350)
  • Plant Biology (7221)
  • Scientific Communication and Education (1311)
  • Synthetic Biology (2000)
  • Systems Biology (5533)
  • Zoology (1127)