Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Recovery of 447 Eukaryotic bins reveals major challenges for Eukaryote genome reconstruction from metagenomes

View ORCID ProfileJoao Pedro Saraiva, Alexander Bartholomäus, Rodolfo Brizola Toscan, View ORCID ProfilePetr Baldrian, Ulisses Nunes da Rocha
doi: https://doi.org/10.1101/2022.04.07.487146
Joao Pedro Saraiva
1Department of Environmental Microbiology, Helmholtz Centre for Environmental Research – UFZ GmbH, Leipzig, Saxony, 04318, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Joao Pedro Saraiva
Alexander Bartholomäus
2GFZ German Research Centre for Geosciences, Section 3.7 Geomicrobiology, Telegrafenberg, Potsdam, 14473, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rodolfo Brizola Toscan
1Department of Environmental Microbiology, Helmholtz Centre for Environmental Research – UFZ GmbH, Leipzig, Saxony, 04318, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Petr Baldrian
3Laboratory of Environmental Microbiology, Institute of Microbiology of the Czech Academy of Sciences, Videnska 1083, Praha 4, 14220, Czech Republic
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Petr Baldrian
Ulisses Nunes da Rocha
1Department of Environmental Microbiology, Helmholtz Centre for Environmental Research – UFZ GmbH, Leipzig, Saxony, 04318, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: ulisses.rocha@ufz.de
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

An estimated 8.7 million eukaryotic species exist on our planet. However, recent tools for taxonomic classification of eukaryotes only dispose of 734 reference genomes. As most Eukaryotic genomes are yet to be sequenced, the mechanisms underlying their contribution to different ecosystem processes remain untapped. Although approaches to recover Prokaryotic genomes have become common in genome biology, few studies have tackled the recovery of Eukaryotic genomes from metagenomes. This study assessed the reconstruction of Eukaryotic genomes using 215 metagenomes from diverse environments using the EukRep pipeline. We obtained 447 eukaryotic bins from 15 classes (e.g., Saccharomycetes, Sordariomycetes, and Mamiellophyceae) and 16 orders (e.g., Mamiellales, Saccharomycetales, and Hypocreales). More than 73% of the obtained eukaryotic bins were recovered from samples whose biomes were classified as host-associated, aquatic and anthropogenic terrestrial. However, only 93 bins showed taxonomic classification to (9 unique) genera and 17 bins to (6 unique) species. A total of 193 bins contained completeness and contamination measures. Average completeness and contamination were 44.64% (σ=27.41%) and 3.97% (σ=6.53%), respectively. Micromonas commoda was the most frequent taxa found while Saccharomyces cerevisiae presented the highest completeness, possibly resulting from a more significant number of reference genomes. However, mapping eukaryotic bins to the chromosomes of the reference genomes suggests that completeness measures should consider both single-copy genes and chromosome coverage. Recovering eukaryotic genomes will benefit significantly from long-read sequencing, intron removal after assembly, and improved reference genomes databases.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • Joao Pedro Saraiva: joao.saraiva{at}ufz.de, Alexander Bartholomäus: abartho{at}gfz-potsdam.de, Rodolfo Brizola Toscan: rodolfo.toscan{at}ufz.de, Petr Baldrian: baldrian{at}biomed.cas.cz

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted April 10, 2022.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Recovery of 447 Eukaryotic bins reveals major challenges for Eukaryote genome reconstruction from metagenomes
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Recovery of 447 Eukaryotic bins reveals major challenges for Eukaryote genome reconstruction from metagenomes
Joao Pedro Saraiva, Alexander Bartholomäus, Rodolfo Brizola Toscan, Petr Baldrian, Ulisses Nunes da Rocha
bioRxiv 2022.04.07.487146; doi: https://doi.org/10.1101/2022.04.07.487146
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Recovery of 447 Eukaryotic bins reveals major challenges for Eukaryote genome reconstruction from metagenomes
Joao Pedro Saraiva, Alexander Bartholomäus, Rodolfo Brizola Toscan, Petr Baldrian, Ulisses Nunes da Rocha
bioRxiv 2022.04.07.487146; doi: https://doi.org/10.1101/2022.04.07.487146

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4688)
  • Biochemistry (10380)
  • Bioengineering (7696)
  • Bioinformatics (26374)
  • Biophysics (13551)
  • Cancer Biology (10731)
  • Cell Biology (15464)
  • Clinical Trials (138)
  • Developmental Biology (8509)
  • Ecology (12844)
  • Epidemiology (2067)
  • Evolutionary Biology (16887)
  • Genetics (11416)
  • Genomics (15493)
  • Immunology (10639)
  • Microbiology (25258)
  • Molecular Biology (10241)
  • Neuroscience (54598)
  • Paleontology (402)
  • Pathology (1671)
  • Pharmacology and Toxicology (2899)
  • Physiology (4355)
  • Plant Biology (9263)
  • Scientific Communication and Education (1588)
  • Synthetic Biology (2561)
  • Systems Biology (6789)
  • Zoology (1472)