RT Journal Article SR Electronic T1 Hidden diversity and macroevolutionary mode of Caulimoviridae uncovered by euphyllophyte paleoviruses JF bioRxiv FD Cold Spring Harbor Laboratory SP 170415 DO 10.1101/170415 A1 Zhen Gong A1 Guan-Zhu Han YR 2017 UL http://biorxiv.org/content/early/2017/07/31/170415.abstract AB Few viruses have been documented in plants outside angiosperms. Endogenous viral elements (paleoviruses) provide ‘molecular fossils’ for studying the deep history and macroevolution of viruses. Endogenous plant pararetroviruses (EPRVs) are widespread across angiosperms, but little is known about EPRVs in earlier branching plants. Here we use a large-scale phylogenomic approach to investigate the diversity and macroevolution of plant pararetroviruses (formally known as Caulimoviridae). We uncover an unprecedented and unappreciated diversity of EPRVs in the genomes of gymnosperms and ferns. The known angiosperm viruses only constitute a minor part of the Caulimoviridae diversity. By characterizing the distribution of EPRVs, we show that no major euphyllophyte lineages escape the activity of Caulimoviridae, raising the possibility that many exogenous Caulimoviridae remain to be discovered in euphyllophytes. We find that the copy numbers of EPRVs are generally high, suggesting that EPRVs define a unique group of repetitive elements and represent major components of euphyllophyte genomes. Phylogenetic analyses reveal an ancient monilophyte origin of Caulimoviridae and at least three independent origins of Caulimoviridae in angiosperms by cross-division transmissions. Our findings uncover the remarkable diversity of Caulimoviridae and have important implications in understanding the origin and macroevolution of plant pararetroviruses.