Abstract
Solanum habrochaites LA1353 is a self-incompatible, highly heterozygous wild tomato that is a useful germplasm resource for the study of metabolism, reproduction and disease resistance. We generated a draft genome assembly with PacBio HiFi reads and genome annotations, which underscored the expansion of gene families associated with metabolite-production, self-incompatibility, DNA regulation and immunity. After manually curating intracellular nucleotide-binding leucine-rich repeat immune receptors (NLRs), we found that S. habrochaites LA1353 has a larger NLR inventory than other wild tomato species. A great number of heterozygous local copy number variations (CNVs) driven by haplotypic structural variations further expands the inventory, both enhancing NLR diversity and providing more opportunities for sequence evolution. The NLRs associated with local CNVs predominantly appear in the helper NLR (NRC)-related phylogenetic clades and are concentrated in a few physical NLR gene clusters. Synteny analysis points out that these genomic regions correspond to the known NLR clusters from which experimentally validated, functional NLRs, such as Hero, Mi-1.2 and Rpi-amr1, have been identified. Producing and incorporating Resistance Gene Enrichment Sequencing (RenSeq) data across wild tomato species, we reveal that the regions with local CNVs might have been shaped nearly equally by recent NLR gains and losses, along with enhanced sequence diversification that diminishes one-to-one orthology between heterozygous alleles. Our analysis suggests that these genomic regions may have accelerated evolutionary dynamics for NLR diversity generation in S. habrochaites LA1353.
Competing Interest Statement
The authors have declared no competing interest.