RT Journal Article SR Electronic T1 Single-molecule sequencing of long DNA molecules allows high contiguity de novo genome assembly for the fungus fly, Sciara coprophila JF bioRxiv FD Cold Spring Harbor Laboratory SP 2020.02.24.963009 DO 10.1101/2020.02.24.963009 A1 John M. Urban A1 Michael S. Foulk A1 Jacob E. Bliss A1 C. Michelle Coleman A1 Nanyan Lu A1 Reza Mazloom A1 Susan J. Brown A1 Allan C. Spradling A1 Susan A. Gerbi YR 2020 UL http://biorxiv.org/content/early/2020/02/25/2020.02.24.963009.abstract AB The lower Dipteran fungus fly, Sciara coprophila, has many unique biological features. For example, Sciara undergoes paternal chromosome elimination and maternal X chromosome nondisjunction during spermatogenesis, paternal X elimination during embryogenesis, intrachromosomal DNA amplification of DNA puff loci during larval development, and germline-limited chromosome elimination from all somatic cells. Paternal chromosome elimination in Sciara was the first observation of imprinting, though the mechanism remains a mystery. Here, we present the first draft genome sequence for Sciara coprophila to take a large step forward in aiding these studies. We approached assembling the Sciara genome using multiple sequencing technologies: PacBio, Oxford Nanopore MinION, and Illumina. To find an optimal assembly using these datasets, we generated 44 Illumina assemblies using 7 short-read assemblers and 50 long-read assemblies of PacBio and MinION sequence data using 6 long-read assemblers. We ranked assemblies using a battery of reference-free metrics, and scaffolded a subset of the highest-ranking assemblies using BioNano Genomics optical maps. RNA-seq datasets from multiple life stages and both sexes facilitated genome annotation. Moreover, we anchored nearly half of the Sciara genome sequence into chromosomes. Finally, we used the signal level of both the PacBio and Oxford Nanopore data to explore the presence or absence of DNA modifications in the Sciara genome since DNA modifications may play a role in imprinting in Sciara, as they do in mammals. These data serve as the foundation for future research by the growing community studying the unique features of this emerging model system.