Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

SignalP 6.0 achieves signal peptide prediction across all types using protein language models

View ORCID ProfileFelix Teufel, José Juan Almagro Armenteros, Alexander Rosenberg Johansen, Magnús Halldór Gíslason, Silas Irby Pihl, View ORCID ProfileKonstantinos D. Tsirigos, View ORCID ProfileOle Winther, Søren Brunak, View ORCID ProfileGunnar von Heijne, View ORCID ProfileHenrik Nielsen
doi: https://doi.org/10.1101/2021.06.09.447770
Felix Teufel
1Section for Bioinformatics, Department of Health Technology, Technical University of Denmark; 2800 Kgs. Lyngby, Denmark
2Department of Biosystems Science and Engineering, ETH Zurich; 4058 Basel, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Felix Teufel
José Juan Almagro Armenteros
1Section for Bioinformatics, Department of Health Technology, Technical University of Denmark; 2800 Kgs. Lyngby, Denmark
3Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen; 2200 Copenhagen, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexander Rosenberg Johansen
4Department of Computer Science, Stanford University; Stanford, CA 94305, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Magnús Halldór Gíslason
5Center for Genomic Medicine, Rigshospitalet (Copenhagen University Hospital); 2100 Copenhagen, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Silas Irby Pihl
1Section for Bioinformatics, Department of Health Technology, Technical University of Denmark; 2800 Kgs. Lyngby, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Konstantinos D. Tsirigos
6EMBL-EBI, Wellcome Genome Campus; Cambridge, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Konstantinos D. Tsirigos
Ole Winther
5Center for Genomic Medicine, Rigshospitalet (Copenhagen University Hospital); 2100 Copenhagen, Denmark
7Department of Biology, Bioinformatics Centre, University of Copenhagen; 2200 Copenhagen, Denmark
8Section for Cognitive Systems, Department of Applied Mathematics and Computer Science, Technical University of Denmark; 2800 Kgs. Lyngby, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ole Winther
Søren Brunak
1Section for Bioinformatics, Department of Health Technology, Technical University of Denmark; 2800 Kgs. Lyngby, Denmark
3Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen; 2200 Copenhagen, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gunnar von Heijne
9Department of Biochemistry and Biophysics, Stockholm University; 10691 Stockholm, Sweden
10Science for Life Laboratory, Stockholm University; Box 101, 171 21 Solna, Sweden
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Gunnar von Heijne
Henrik Nielsen
1Section for Bioinformatics, Department of Health Technology, Technical University of Denmark; 2800 Kgs. Lyngby, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Henrik Nielsen
  • For correspondence: henni@dtu.dk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Signal peptides (SPs) are short amino acid sequences that control protein secretion and translocation in all living organisms. As experimental characterization of SPs is costly, prediction algorithms are applied to predict them from sequence data. However, existing methods are unable to detect all known types of SPs. We introduce SignalP 6.0, the first model capable of detecting all five SP types. Additionally, the model accurately identifies the positions of regions within SPs, revealing the defining biochemical properties that underlie the function of SPs in vivo. Results show that SignalP 6.0 has improved prediction performance, and is the first model to be applicable to metagenomic data.

SignalP 6.0 is available at https://services.healthtech.dtu.dk/service.php?SignalP-6.0

Competing Interest Statement

The downloadable version of SignalP 6.0 has been commercialized (it is licensed for a fee to commercial users). The revenue from these commercial sales is divided between the program developers and the Technical University of Denmark.

Footnotes

  • https://services.healthtech.dtu.dk/service.php?SignalP-6.0

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted June 10, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
SignalP 6.0 achieves signal peptide prediction across all types using protein language models
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
SignalP 6.0 achieves signal peptide prediction across all types using protein language models
Felix Teufel, José Juan Almagro Armenteros, Alexander Rosenberg Johansen, Magnús Halldór Gíslason, Silas Irby Pihl, Konstantinos D. Tsirigos, Ole Winther, Søren Brunak, Gunnar von Heijne, Henrik Nielsen
bioRxiv 2021.06.09.447770; doi: https://doi.org/10.1101/2021.06.09.447770
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
SignalP 6.0 achieves signal peptide prediction across all types using protein language models
Felix Teufel, José Juan Almagro Armenteros, Alexander Rosenberg Johansen, Magnús Halldór Gíslason, Silas Irby Pihl, Konstantinos D. Tsirigos, Ole Winther, Søren Brunak, Gunnar von Heijne, Henrik Nielsen
bioRxiv 2021.06.09.447770; doi: https://doi.org/10.1101/2021.06.09.447770

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4235)
  • Biochemistry (9136)
  • Bioengineering (6784)
  • Bioinformatics (24001)
  • Biophysics (12129)
  • Cancer Biology (9534)
  • Cell Biology (13778)
  • Clinical Trials (138)
  • Developmental Biology (7636)
  • Ecology (11702)
  • Epidemiology (2066)
  • Evolutionary Biology (15513)
  • Genetics (10644)
  • Genomics (14327)
  • Immunology (9483)
  • Microbiology (22841)
  • Molecular Biology (9090)
  • Neuroscience (48995)
  • Paleontology (355)
  • Pathology (1482)
  • Pharmacology and Toxicology (2570)
  • Physiology (3846)
  • Plant Biology (8331)
  • Scientific Communication and Education (1471)
  • Synthetic Biology (2296)
  • Systems Biology (6192)
  • Zoology (1301)