PT - JOURNAL ARTICLE AU - Peipei Wang AU - Bethany M. Moore AU - Nicholas L. Panchy AU - Fanrui Meng AU - Melissa D. Lehti-Shiu AU - Shin-Han Shiu TI - Factors influencing gene family size variation among related species in a plant family AID - 10.1101/270009 DP - 2018 Jan 01 TA - bioRxiv PG - 270009 4099 - http://biorxiv.org/content/early/2018/02/23/270009.short 4100 - http://biorxiv.org/content/early/2018/02/23/270009.full AB - Gene duplication and loss contribute to gene content differences as well as phenotypic divergence across species. However, the extent to which gene content varies among closely related plant species and the factors responsible for such variation remain unclear. Here, we used the Solanaceae family as a model to investigate differences in gene family size and the likely factors contributing to these differences. We found that genes in highly variable families have high turnover rate and tend to be involved in processes that have diverged between Solanaceae species, whereas genes in low-variability families tend to have housekeeping roles. In addition, genes in high-and low-variability gene families tend to be duplicated by tandem and whole genome duplication, respectively. This finding together with the observation that genes duplicated by different mechanisms experience different selection pressures suggests that duplication mechanism impacts gene family turnover. We explored using pseudogene number as a proxy for gene loss but discovered that a substantial number of pseudogenes are actually products of pseudogene duplication, contrary to the expectation that most plant pseudogenes are remnants of once-functional duplicates. Our findings reveal complex relationships between variation in gene family size, gene functions, duplication mechanism, and evolutionary rate. The patterns of lineage-specific gene family expansion within the Solanaceae provide the foundation for a better understanding of the genetic basis underlying phenotypic diversity in this economically important family.