RT Journal Article SR Electronic T1 The de novo reference genome and transcriptome assemblies of the wild tomato species Solanum chilense JF bioRxiv FD Cold Spring Harbor Laboratory SP 612085 DO 10.1101/612085 A1 Remco Stam A1 Tetyana Nosenko A1 Anja C. Hörger A1 Wolfgang Stephan A1 Michael Seidel A1 José M.M. Kuhn A1 Georg Haberer A1 Aurelien Tellier YR 2019 UL http://biorxiv.org/content/early/2019/04/26/612085.abstract AB Background Wild tomato species, like Solanum chilense, are important germplasm resources for enhanced biotic and abiotic stress resistance in tomato breeding. In addition, S. chilense serves as a model system to study adaptation of plants to drought and to investigate the evolution of seed banks. However to date, the absence of a well annotated reference genome in this compulsory outcrossing, very diverse species limits in-depth studies on the genes involved.Findings We generated ∼134 Gb of DNA and 157 Gb of RNA sequence data of S chilense, which yielded a draft genome with an estimated length of 914 Mb in total encoding 25,885 high-confidence (hc) predicted gene models, which show homology to known protein-coding genes of other tomato species. Approximately 71% (18,290) of the hc gene models are additionally supported by RNA-seq data derived from leaf tissue samples. A benchmarking with Universal Single-Copy Orthologs (BUSCO) analysis of predicted gene models retrieved 93.3% BUSCO genes, which is in the current range of high-quality genomes for non-inbred plants. To further verify the genome annotation completeness and accuracy, we manually inspected the NLR resistance gene family and assessed its assembly quality. We revealed the existence of unique gene families of NLRs to S. chilense. Comparative genomics analyses of S. chilense, cultivated tomato S. lycopersicum and its wild relative S. pennellii revealed similar levels of highly syntenic gene clusters between the three species.Conclusions We generated the first genome and transcriptome sequence assembly for the wild tomato species Solanum chilense and demonstrated its value in comparative genomics analyses. We make these genomes available for the scientific community as an important resource for studies on adaptation to biotic and abiotic stress in Solanaceae, on evolution of self-incompatibility, and for tomato breeding.