Abstract
Recent technological developments have made genome sequencing and assembly accessible to many groups. However, the presence in sequenced organisms of certain genomic features such as high heterozygosity, polyploidy, aneuploidy, or heterokaryosis can challenge current standard assembly procedures and result in highly fragmented assemblies. Hence, we hypothesized that genome databases must contain a non-negligible fraction of low-quality assemblies that result from such type of intrinsic genomic factors. Here we present Karyon, a Python-based toolkit that uses raw sequencing data and de novo genome assembly to assess several parameters and generate informative plots to assist in the identification of non-chanonical genomic traits. Karyon includes automated de novo genome assembly and variant calling pipelines. We tested Karyon by diagnosing 35 highly fragmented publicly available assemblies from 19 different Mucorales (Fungi) species. Our results show that 6 (17%) of the assemblies presented signs of unusual genomic configurations, suggesting that these are common, at least within the Fungi.
Competing Interest Statement
The authors have declared no competing interest.