Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Self-Supervised Deep Learning Encodes High-Resolution Features of Protein Subcellular Localization

View ORCID ProfileHirofumi Kobayashi, View ORCID ProfileKeith C. Cheveralls, View ORCID ProfileManuel D. Leonetti, View ORCID ProfileLoic A. Royer
doi: https://doi.org/10.1101/2021.03.29.437595
Hirofumi Kobayashi
1CZ Biohub, San Francisco, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Hirofumi Kobayashi
  • For correspondence: hirofumi.kobayashi@czbiohub.org manuel.leonetti@czbiohub.org loic.royer@czbiohub.org
Keith C. Cheveralls
1CZ Biohub, San Francisco, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Keith C. Cheveralls
Manuel D. Leonetti
1CZ Biohub, San Francisco, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Manuel D. Leonetti
  • For correspondence: hirofumi.kobayashi@czbiohub.org manuel.leonetti@czbiohub.org loic.royer@czbiohub.org
Loic A. Royer
1CZ Biohub, San Francisco, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Loic A. Royer
  • For correspondence: hirofumi.kobayashi@czbiohub.org manuel.leonetti@czbiohub.org loic.royer@czbiohub.org
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Elucidating the diversity and complexity of protein localization is essential to fully understand cellular architecture. Here, we present cytoself, a deep-learning approach for fully self-supervised protein localization profiling and clustering. cytoself leverages a self-supervised training scheme that does not require pre-existing knowledge, categories, or annotations. Training cytoself on images of 1,311 endogenously labeled proteins from the OpenCell database reveals a highly resolved protein localization atlas that recapitulates major scales of cellular organization, from coarse classes such as nuclear, cytoplasmic and vesicular, to the subtle localization signatures of individual protein complexes. We quantitatively validate cytoself’s ability to cluster proteins into organelles and protein complex clusters using a clustering score, and show that cytoself attains higher scores than previous unsupervised or self-supervised approaches. Finally, to better understand the inner workings of our model, we dissect the emergent features from which our clustering is derived, interpret these features in the context of the fluorescence images, and analyze the performance contributions of the different components of our approach.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • New quantitive results and supporting supplementary tables and figures.

  • http://github.com/royerlab/cytoself

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted March 09, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Self-Supervised Deep Learning Encodes High-Resolution Features of Protein Subcellular Localization
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Self-Supervised Deep Learning Encodes High-Resolution Features of Protein Subcellular Localization
Hirofumi Kobayashi, Keith C. Cheveralls, Manuel D. Leonetti, Loic A. Royer
bioRxiv 2021.03.29.437595; doi: https://doi.org/10.1101/2021.03.29.437595
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Self-Supervised Deep Learning Encodes High-Resolution Features of Protein Subcellular Localization
Hirofumi Kobayashi, Keith C. Cheveralls, Manuel D. Leonetti, Loic A. Royer
bioRxiv 2021.03.29.437595; doi: https://doi.org/10.1101/2021.03.29.437595

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Cell Biology
Subject Areas
All Articles
  • Animal Behavior and Cognition (4237)
  • Biochemistry (9147)
  • Bioengineering (6786)
  • Bioinformatics (24024)
  • Biophysics (12137)
  • Cancer Biology (9545)
  • Cell Biology (13795)
  • Clinical Trials (138)
  • Developmental Biology (7642)
  • Ecology (11716)
  • Epidemiology (2066)
  • Evolutionary Biology (15518)
  • Genetics (10650)
  • Genomics (14332)
  • Immunology (9493)
  • Microbiology (22858)
  • Molecular Biology (9103)
  • Neuroscience (49032)
  • Paleontology (355)
  • Pathology (1484)
  • Pharmacology and Toxicology (2572)
  • Physiology (3849)
  • Plant Biology (8338)
  • Scientific Communication and Education (1472)
  • Synthetic Biology (2296)
  • Systems Biology (6196)
  • Zoology (1302)