Abstract
Azolla is a genus of freshwater ferns that is economically important as a nitrogen-fixing biofertilizer, biofuel, bioremediator, and for potential carbon sequestration, but also contains weedy invasive species. In California, only two species are currently recognized but there may be up to six putative species, with the discrepancy being due to the difficulty in identifying taxa, hybridization, and the introduction of non-native species. Here, we report a new haplotype-resolved, chromosome-level assembly of Azolla caroliniana as part of the California Conservation Genomics Project (CCGP), using a combination of PacBio HiFi and Omni-C sequencing technologies. The assembly is 521 Mb in length, with a contig N50 of 1.6 Mb, and is scaffolded into 22 pseudo-chromosomes. The BUSCO completeness score is 87.5%, making it the most complete and most contiguous Azolla assembly to date. In combination with the previously published A. filiculoides genome, this A. caroliniana genome will be a powerful tool for understanding the population genetics and taxonomy of one of the most cryptic, economically important, and poorly circumscribed fern taxa, and for facilitating land plant genomics more broadly.
Competing Interest Statement
The authors have declared no competing interest.
Data Availability
Data generated for this study are available under NCBI BioProject PRJNA720569. Raw sequencing data for sample PRJNA777144 (NCBI BioSample SAMN41564273) are deposited in the NCBI Short Read Archive (SRA). Assembly scripts and other data for the analyses presented can be found at the following GitHub repository: www.github.com/ccgproject/ccgp_assembly