NoD: a Nucleolar localization sequence detector for eukaryotic and viral proteins

BMC Bioinformatics. 2011 Aug 3:12:317. doi: 10.1186/1471-2105-12-317.

Abstract

Background: Nucleolar localization sequences (NoLSs) are short targeting sequences responsible for the localization of proteins to the nucleolus. Given the large number of proteins experimentally detected in the nucleolus and the central role of this subnuclear compartment in the cell, NoLSs are likely to be important regulatory elements controlling cellular traffic. Although many proteins have been reported to contain NoLSs, the systematic characterization of this group of targeting motifs has only recently been carried out.

Results: Here, we describe NoD, a web server and a command line program that predicts the presence of NoLSs in proteins. Using the web server, users can submit protein sequences through the NoD input form and are provided with a graphical output of the NoLS score as a function of protein position. While the web server is most convenient for making prediction for just a few proteins, the command line version of NoD can return predictions for complete proteomes. NoD is based on our recently described human-trained artificial neural network predictor. Through stringent independent testing of the predictor using available experimentally validated NoLS-containing eukaryotic and viral proteins, the NoD sensitivity and positive predictive value were estimated to be 71% and 79% respectively.

Conclusions: NoD is the first tool to provide predictions of nucleolar localization sequences in diverse eukaryotes and viruses. NoD can be run interactively online at http://www.compbio.dundee.ac.uk/nod or downloaded to use locally.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs
  • Amino Acid Sequence
  • Animals
  • Cell Nucleolus / metabolism*
  • Humans
  • Neural Networks, Computer*
  • Nuclear Proteins / chemistry*
  • Proteomics / methods*
  • Regulatory Sequences, Nucleic Acid
  • Viral Proteins / metabolism*

Substances

  • Nuclear Proteins
  • Viral Proteins