ABSTRACT
Microbes produce a plethora of secondary metabolites that although not essential for primary metabolism benefit them to survive in the environment, communicate, and influence differentiation. Biosynthetic gene clusters (BGCs) responsible for the production of these secondary metabolites are readily identifiable on the genome sequence of bacteria. Understanding the phylogeny and distribution of BGCs helps us to predict natural product synthesis ability of new isolates. Here, we examined the inter- and intraspecies patterns of absence/presence for all BGCs identified with antiSMASH 5.0 in 310 genomes from the B. subtilis group and assigned them to defined gene cluster families (GCFs). This allowed us to establish patterns in distribution for both known and unknown products. Further, we analyzed variations in the BGC structure of particular families encoding for natural products such as plipastatin, fengycin, iturin, mycosubtilin and bacillomycin. Our detailed analysis revealed multiple GCFs that are species or clade specific and few others that are scattered within or between species, which will guide exploration of the chemodiversity within the B. subtilis group. Uniquely, we discovered that partial deletion of BGCs and frameshift mutations in selected biosynthetic genes are conserved within phylogenetically related isolates, although isolated from around the globe. Our results highlight the importance of detailed analysis of BGCs and the remarkable phylogenetically conserved errodation of secondary metabolite biosynthetic potential in the B. subtilis group.
IMPORTANCE Members of the B. subtilis species complex are commonly recognized producers of secondary metabolites, among those the production of antifungals makes them promising biocontrol strains. However, while there are studies examining the distribution of well-known B. subtilis metabolites, this has not yet been systematically reported for the group. Here, we report the complete biosynthetic potential within the Bacillus subtilis group species to explore the distribution of the biosynthetic gene clusters and to provide an exhaustive phylogenetic conservation of secondary metabolite production supporting the chemodiversity of Bacilli. We identify that certain gene clusters acquired deletions of genes and particular frame-shift mutations rendering them inactive for secondary metabolite biosynthesis, a conserved genetic trait within phylogenetically conserved clades of certain species. The overview presented will superbly guide assigning the secondary metabolite production potential of newly isolated strains based on genome sequence and phylogenetic relatedness.
Competing Interest Statement
The authors have declared no competing interest.