TY - JOUR T1 - Phylogenetic weighting does little to improve the accuracy of evolutionary coupling analyses JF - bioRxiv DO - 10.1101/736173 SP - 736173 AU - Adam J. Hockenberry AU - Claus O. Wilke Y1 - 2019/01/01 UR - http://biorxiv.org/content/early/2019/08/15/736173.abstract N2 - Homologous sequence alignments contain important information about the constraints that shape protein family evolution. Correlated changes between different residues, for instance, can be highly predictive of physical contacts within three-dimensional structures. Detecting such co-evolutionary signals via direct coupling analysis is particularly challenging given xsthe shared phylogenetic history and uneven sampling of different lineages from which protein sequences are derived. Current best practices for mitigating such effects include sequence-identity-based weighting of input sequences and post-hoc re-scaling of evolutionary coupling scores. However, numerous weighting schemes have been previously developed for other applications, and it is unknown whether any of these schemes may better account for phylogenetic artifacts in evolutionary coupling analyses. Here, we show across a dataset of 150 diverse protein families that the current best practices out-perform several alternative sequence- and tree-based weighting methods. Nevertheless, we find that sequence weighting in general provides only a minor benefit relative to post-hoc transformations that re-scale the derived evolutionary couplings. While our findings do not rule out the possibility that an as-yet-untested weighting method may show improved results, the similar predictive accuracies that we observe across distinct weighting methods suggests that there may be little room for further improvement on top of existing strategies.HHweights derived via the method of Henikoff and Henikoff [44]GSCweights derived via the method of Gerstein et al. [43]ACLweights derived via the method of Altschul et al. [38]APCAverage Product Correction/edPPVPositive Predictive Value ER -