Twenty thousand ORFan microbial protein families for the biologist?

Structure. 2003 Jan;11(1):7-9. doi: 10.1016/s0969-2126(02)00938-3.

Abstract

The genomes of most newly sequenced organisms contain a significant fraction of ORFs (open reading frames) that match no other sequence in the databases. We refer to these singleton ORFs as sequence ORFans. Because little can be learned about ORFans by homology, the origin and functions of ORFans remain a mystery. However, in this era of full genome sequencing, it seems that ORFans have been underemphasized. In this minireview, we draw attention to the increasing number of ORFans and to the consequences of this growth to biological research in the postgenomic era.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Animals
  • Genome, Bacterial*
  • Genome, Fungal*
  • Open Reading Frames*
  • Proteins / classification
  • Proteins / genetics

Substances

  • Proteins