Abstract
Protein compartments represent an important strategy for subcellular spatial control and compartmentalization. Encapsulins are a class of microbial protein compartments defined by the viral HK97-fold of their capsid protein, self-assembly into icosahedral shells, and dedicated cargo loading mechanism for sequestering specific enzymes. Encapsulins are often misannotated and traditional sequence-based searches yield many false positive hits in the form of phage capsids. This has hampered progress in understanding the distribution and functional diversity of encapsulins. Here, we develop an integrated search strategy to carry out a large-scale computational analysis of prokaryotic genomes with the goal of discovering an exhaustive and curated set of all HK97-fold encapsulin-like systems. We report the discovery and analysis of over 6,000 encapsulin-like systems in 31 bacterial and 4 archaeal phyla, including two novel encapsulin families as well as many new operon types that fall within the two already known families. We formulate hypotheses about the biological functions and biomedical relevance of newly identified operons which range from natural product biosynthesis and stress resistance to carbon metabolism and anaerobic hydrogen production. We conduct an evolutionary analysis of encapsulins and related HK97-type virus families and show that they share a common ancestor. We conclude that encapsulins likely evolved from HK97-type bacteriophages. Our study sheds new light on the evolutionary interplay of viruses and cellular organisms, the recruitment of protein folds for novel functions, and the functional diversity of microbial protein organelles.
Competing Interest Statement
The authors have declared no competing interest.