PT - JOURNAL ARTICLE AU - Aleksia Vaattovaara AU - Benjamin Brandt AU - Sitaram Rajaraman AU - Omid Safronov AU - Andres Veidenberg AU - Markéta Luklová AU - Jaakko Kangasjärvi AU - Ari Löytynoja AU - Michael Hothorn AU - Jarkko Salojärvi AU - Michael Wrzaczek TI - Mechanistic insights into the evolution of DUF26-containing proteins in land plants AID - 10.1101/493502 DP - 2018 Jan 01 TA - bioRxiv PG - 493502 4099 - http://biorxiv.org/content/early/2018/12/11/493502.short 4100 - http://biorxiv.org/content/early/2018/12/11/493502.full AB - Large protein families are a prominent feature of plant genomes and their size variation is a key element for adaptation in plants. Here we infer the evolutionary history of a representative protein family, the DOMAIN OF UNKNOWN FUNCTION (DUF) 26-containing proteins. The DUF26 first appeared in secreted proteins. Domain duplications and rearrangements led to the emergence of CYSTEINE-RICH RECEPTOR-LIKE PROTEIN KINASES (CRKs) and PLASMODESMATA-LOCALIZED PROTEINS (PDLPs). While the DUF26 itself is specific to land plants, structural analyses of Arabidopsis PDLP5 and PDLP8 ectodomains revealed strong similarity to fungal lectins. Therefore, we propose that DUF26-containing proteins constitute a novel group of plant carbohydrate-binding proteins. Following their appearance, CRKs expanded both through tandem duplications and preferential retention of duplicates in whole genome duplication events, whereas PDLPs evolved according to the dosage balance hypothesis. Based on our findings, we suggest that the main mechanism of expansion in new gene families is small-scale duplication, whereas genome fractionation and genetic drift after whole genome multiplications drive families towards dosage balance.