TY - JOUR T1 - Non-phylogenetic identification of co-evolving genes for reconstructing the archaeal Tree of Life JF - bioRxiv DO - 10.1101/2020.10.16.343293 SP - 2020.10.16.343293 AU - L. Thibério Rangel AU - Shannon M. Soucy AU - João C. Setubal AU - Johann Peter Gogarten AU - Gregory P. Fournier Y1 - 2021/01/01 UR - http://biorxiv.org/content/early/2021/01/13/2020.10.16.343293.abstract N2 - Assessing the phylogenetic compatibility between individual gene families is a crucial and often computationally demanding step in many phylogenomics analyses. Here we describe the Evolutionary Similarity Index (IES) to assess shared evolution between gene families using a weighted Orthogonal Distance Regression applied to sequence distances. This approach allows for straightforward pairing of paralogs between co-evolving gene families without resorting to multiple tests, or a priori assumptions of molecular interactions between protein products from assessed genes. The utilization of pairwise distance matrices, while less informative than phylogenetic trees, circumvents error-prone comparisons between trees whose topologies are inherently uncertain. Analyses of simulated gene family evolution datasets showed that IES was more accurate and less susceptible to noise than popular tree-based methods (Robinson-Foulds and geodesic distance) for assessing evolutionary signal compatibility, since it bypasses phylogenetic reconstruction and its inherent uncertainty. Applying IES to a real dataset of 1,322 genes from 42 archaeal genomes identified eight major clusters of gene families with compatible evolutionary trends. Four of these clusters included genes with a taxonomic distribution across all archaeal phyla, while other clusters included a subset of taxa that do not map to generally accepted archaeal clades, indicating possible shared horizontal transfers by clustered gene families. We identify one strongly connected set of 62 genes from the same cluster, occurring as both single-copy and multiple homologs per genome, with compatible phylogenetic reconstructions closely matching previously published species trees for Archaea. An IES implementation is available at https://github.com/lthiberiol/evolSimIndex.Competing Interest StatementThe authors have declared no competing interest. ER -