RT Journal Article SR Electronic T1 Phylogenomic Testing of Root Hypotheses Without a Species Tree JF bioRxiv FD Cold Spring Harbor Laboratory SP 758581 DO 10.1101/758581 A1 Fernando D. K. Tria A1 Giddy Landan A1 Tal Dagan YR 2019 UL http://biorxiv.org/content/early/2019/09/05/758581.abstract AB The determination of the last common ancestor (LCA) of a group of species plays a vital role in evolutionary theory. Traditionally, an LCA is inferred by the rooting of a fully resolved species tree. From a theoretical perspective, however, inference of the LCA amounts to the reconstruction of just one branch - the root branch - of the true unrooted species tree, and should therefore be a much easier task than the full resolution of the species tree. Discarding the reliance on a hypothesised species tree and its rooting leads us to re-evaluate what phylogenetic signal is directly relevant to LCA inference, and to recast the task as that of sampling the total evidence from all gene families at the genomic scope. Here we reformulate LCA and root inference in the framework of statistical hypothesis testing and outline an analytical procedure to formally test competing a-priori LCA hypotheses and to infer confidence sets for the earliest speciation events in the history of a group of species. Applying our methods to three demonstrative datasets we show that our inference of the metazoan LCA as well as the cyanobacterial LCA is well in agreement with the common knowledge. Inference of the proteobacteria LCA shows that it is most closely related to modern Epsilonproteobacteria, hence it was most likely characterized by a chemolithoautotrophic and anaerobic life-style. Our inference is based on data comprising between 43% (opisthokonta) and 86% (proteobacteria) of all gene families. Approaching LCA inference within a statistical framework thus renders the phylogenomic inference powerful and robust.