RT Journal Article SR Electronic T1 Factors influencing gene family size variation among related species in a plant family JF bioRxiv FD Cold Spring Harbor Laboratory SP 270009 DO 10.1101/270009 A1 Peipei Wang A1 Bethany M. Moore A1 Nicholas L. Panchy A1 Fanrui Meng A1 Melissa D. Lehti-Shiu A1 Shin-Han Shiu YR 2018 UL http://biorxiv.org/content/early/2018/02/23/270009.abstract AB Gene duplication and loss contribute to gene content differences as well as phenotypic divergence across species. However, the extent to which gene content varies among closely related plant species and the factors responsible for such variation remain unclear. Here, we used the Solanaceae family as a model to investigate differences in gene family size and the likely factors contributing to these differences. We found that genes in highly variable families have high turnover rate and tend to be involved in processes that have diverged between Solanaceae species, whereas genes in low-variability families tend to have housekeeping roles. In addition, genes in high-and low-variability gene families tend to be duplicated by tandem and whole genome duplication, respectively. This finding together with the observation that genes duplicated by different mechanisms experience different selection pressures suggests that duplication mechanism impacts gene family turnover. We explored using pseudogene number as a proxy for gene loss but discovered that a substantial number of pseudogenes are actually products of pseudogene duplication, contrary to the expectation that most plant pseudogenes are remnants of once-functional duplicates. Our findings reveal complex relationships between variation in gene family size, gene functions, duplication mechanism, and evolutionary rate. The patterns of lineage-specific gene family expansion within the Solanaceae provide the foundation for a better understanding of the genetic basis underlying phenotypic diversity in this economically important family.