Comparative genomics of six Juglans species reveals disease-associated gene family contractions

Plant J. 2020 Apr;102(2):410-423. doi: 10.1111/tpj.14630. Epub 2020 Jan 16.

Abstract

Juglans (walnuts), the most speciose genus in the walnut family (Juglandaceae), represents most of the family's commercially valuable fruit and wood-producing trees. It includes several species used as rootstock for their resistance to various abiotic and biotic stressors. We present the full structural and functional genome annotations of six Juglans species and one outgroup within Juglandaceae (Juglans regia, J. cathayensis, J. hindsii, J. microcarpa, J. nigra, J. sigillata and Pterocarya stenoptera) produced using BRAKER2 semi-unsupervised gene prediction pipeline and additional tools. For each annotation, gene predictors were trained using 19 tissue-specific J. regia transcriptomes aligned to the genomes. Additional functional evidence and filters were applied to multi-exonic and mono-exonic putative genes to yield between 27 000 and 44 000 high-confidence gene models per species. Comparison of gene models to the BUSCO embryophyta dataset suggested that, on average, genome annotation completeness was 85.6%. We utilized these high-quality annotations to assess gene family evolution within Juglans, and among Juglans and selected Eurosid species. We found notable contractions in several gene families in J. hindsii, including disease resistance-related wall-associated kinase (WAK), Catharanthus roseus receptor-like kinase (CrRLK1L) and others involved in abiotic stress response. Finally, we confirmed an ancient whole-genome duplication that took place in a common ancestor of Juglandaceae using site substitution comparative analysis.

Keywords: Juglans; Juglandaceae; comparative genomics; crop genomics; disease resistance; genome annotation; phenology; walnut; whole-genome duplication; wingnut.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Disease Resistance / genetics
  • Genome, Plant / genetics*
  • Genomics*
  • Juglans / genetics*
  • Juglans / physiology
  • Stress, Physiological
  • Transcriptome*