Generalizing Bayesian phylogenetics to infer shared evolutionary events

Jamie R. Oaks; Perry L. Wood; Cameron D. Siler; Rafe M. Brown

doi:10.1101/2021.07.23.453597

Abstract

Many processes of biological diversification can simultaneously affect multiple evolutionary lineages. Examples include multiple members of a gene family diverging when a region of a chromosome is duplicated, multiple viral strains diverging at a “super-spreading” event, and a geological event fragmenting whole communities of species. It is difficult to test for patterns of shared divergences predicted by such processes, because all phylogenetic methods assume that lineages diverge independently. We introduce a Bayesian phylogenetic approach to relax the assumption of independent, bifurcating divergences by expanding the space of topologies to include trees with shared and multifurcating divergences. This allows us to jointly infer phylogenetic relationships, divergence times, and patterns of divergences predicted by processes of diversification that affect multiple evolutionary lineages simultaneously or lead to more than two descendant lineages. Using simulations, we find the new method accurately infers shared and multifurcating divergence events when they occur, and performs as well as current phylogenetic methods when divergences are independent and bifurcating. We apply our new approach to genomic data from two genera of geckos from across the Philippines to test if past changes to the islands’ landscape caused bursts of speciation. Unlike our previous analyses restricted to only pairs of gecko populations, we find evidence for patterns of shared divergences. By generalizing the space of phylogenetic trees in a way that is independent from the likelihood model, our approach opens many avenues for future research into processes of diversification across the life sciences.

Significance statement Phylogenetic models have long assumed that lineages diverge independently. Processes of diversification that are of interest in biogeography, epidemiology, and genome evolution, violate this assumption by affecting multiple evolutionary lineages. To relax the assumption of independent divergences and infer patterns of divergences predicted by such processes, we introduce a new way of conceptualizing, modeling, and inferring phylogenetic trees. We apply the new approach to genomic data from geckos distributed across the Philippines, and find support for patterns of shared divergences predicted by repeated fragmentation of the archipelago by interglacial rises in sea level.

1 Introduction

There are many processes of biological diversification that affect multiple evolutionary lineages, generating patterns of temporally clustered divergences across the tree of life. Understanding such processes of diversification has important implications across many fields and scales of biology. At the scale of genome evolution, the duplication of a chromosome segment harboring multiple members of a gene family causes multiple, simultaneous (or “shared”) divergences across the phylogenetic history of the gene family (Doyle and Egan, 2010; Jiao et al., 2011; Clark and Donoghue, 2017; Li et al., 2018). In epidemiology, when a pathogen is spread by multiple infected individuals at a social gathering, this will create shared divergences across the pathogen’s “transmission tree” (Pybus and Rambaut, 2009; Ypma et al., 2013; Klinkenberg et al., 2017). If one of these individuals infects two or more others, this will create a multifurcation (a lineage diverging into three or more descendants) in the transmission tree. At regional or global scales, when biogeographic processes fragment communities, this can cause shared divergences across multiple affected species (Hickerson et al., 2006; Leaché et al., 2007; Plouviez et al., 2009; Voje et al., 2009; Daza et al., 2010; Barber and Klicka, 2010). If the landscape is fragmented into three or more regions, this can also cause multifurcations (Hoelzer and Meinick, 1994). For example, the repeated fragmentation of the Philippines by interglacial rises in sea level since the late Pliocene (Haq et al., 1987; Rohling et al., 1998; Siddall et al., 2003; Miller et al., 2005; Spratt and Lisiecki, 2016) has been an important model to help explain remarkably high levels of microendemism and biodiversity across the archipelago (Inger, 1954; Heaney, 1985; Brown and Guttman, 2002; Evans et al., 2003; Heaney et al., 2005; Roberts, 2006; Linkem et al., 2010; Siler et al., 2010, 2011, 2012; Brown and Siler, 2014). This model predicts that recently diverged taxa across the islands should have (potentially multifurcating) divergence times clustered around the beginning of interglacial periods. We are limited in our ability to infer patterns of divergences predicted by such processes, because phylogenetic methods assume lineages diverge independently.

To formalize this assumption of independent divergences and develop ways to relax it, it is instructive to view phylogenetic inference as an exercise of statistical model selection where each topology is a separate model (Yang, 1994; Yang et al., 1995; Suchard et al., 2001). Current methods for estimating rooted phylogenies with N tips only consider tree models with N – 1 bifurcating divergences, and assume these divergences are independent, conditional on the topology (see Lewis et al., 2005, for multifurcations in unrooted trees). If, in the history leading to the tips we are studying, diversification processes affected multiple lineages simultaneously or caused them to diverge into more than two descendants, the true tree could have shared or multifurcating divergences. This would make current phylogenetic models with N – 1 independent divergence times over-parameterized, introducing unnecessary error (Figure 1). Even worse, with current methods, we lack an obvious way of using our data to test for patterns of shared or multifurcating divergences predicted by such processes.

Figure 1.

An example evolutionary history with shared divergences (left), and the benefits of the generalizing tree space under such conditions (right). Current methods are restricted to one class of tree models, where the tree is fully bifurcating and independent divergence-time parameters are estimated for all internal nodes (center). Figure made using Gram (Version 4.0.0; Foster, 2018) and the P4 phylogenetic toolkit (Version 1.4 5742542; Foster, 2004). Middle three lizard silhouettes from pixabay.com, and others from phylopic.org; all licensed under the Creative Commons (CC0) Public Domain Dedication.

We relax the assumption of independent, bifurcating divergences by introducing a Bayesian approach to generalizing the space of tree models to allow for shared and multifurcating divergences. In our approach, we view trees with N – 1 bifurcating divergences as only one class of tree models in a greater space of trees with anywhere from 1 to N – 1 potentially shared and/or multifurcating divergences (Figure S1). We introduce reversible-jump Markov chain Monte Carlo algorithms (Metropolis et al., 1953; Hastings, 1970; Green, 1995) to sample this generalized space of trees, allowing us to jointly infer evolutionary relationships, shared and multifurcating divergences, and divergence times. We couple these algorithms with a likelihood model for directly calculating the probability of biallelic characters given a population (or species) phylogeny, while analytically integrating over all possible gene trees under a coalescent model and all possible mutational histories under a finite-sites model of character evolution (Bryant et al., 2012; Oaks, 2019). Using simulations, we find the generalized tree model accurately infers shared and multifurcating divergences while maintaining a low rate of falsely inferring such divergences. To test for patterns of shared and multifurcating divergences predicted by repeated fragmentation of the Philippines by interglacial rises in sea level (Oaks et al., 2013; Brown et al., 2013; Oaks et al., 2019), we apply the generalized tree model to genomic data from two genera of geckos codistributed across the islands.

2 Results

2.1 Simulations on fixed trees

The generalized tree model (M_G) sampled trees significantly closer (Robinson and Foulds, 1979; Kuhner and Felsenstein, 1994) to the true tree than an otherwise equivalent model that assumes independent, bifurcating divergences (M_IB), when applied to 100 data sets simulated along the species tree in Figure 2A, each with 50,000 unlinked biallelic characters (Figure 2B). From these simulated data, the generalized model consistently inferred the correct shared and multifurcating divergences with high posterior probabilities (Figure 2C). Unlike the independent-bifurcating model, the generalized approach avoids strong support for nonexistent branches that spuriously split truly multifurcating nodes (Figure 2D). Under both models, analyzing only the variable characters causes a reduction in tree accuracy (Figure 2B), but yields similar posterior probabilities for shared and multifurcating divergences (Figure 2C).

Figure 2.

Results of analyses of 100 data sets, each with 50,000 biallelic characters simulated on the species tree shown in (A) with divergence times in units of expected substitutions per site. (B) The square root of the sum of squared differences in branch lengths between the true tree and each posterior tree sample (Kuhner and Felsenstein, 1994); the point and bars represent the posterior mean and equal-tailed 95% credible interval, respectively. P-values are shown for Wilcoxon signed-rank tests (Wilcoxon, 1945) comparing the paired differences in tree distances between methods. (C) Violin plots of the posterior probabilities of each node and shared divergence in the true tree across the 100 simulated data sets. (D) Violin plots of the most probable incorrect root node and most probable of the three incorrect splittings of the t₃ and t₄ multifurcations. For each simulation, the mutation-scaled effective population size (N_eμ) was drawn from a gamma distribution (shape = 20, mean = 0.001) and shared across all the branches of the tree; this distribution was used as the prior in analyses. Tree plotted using Gram (Version 4.0.0, Commit 02286362; Foster, 2018) and the P4 phylogenetic toolkit (Version 1.4, Commit d9c8d1b1; Foster, 2004). Other plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

When applied to data sets of 50,000 characters simulated along a tree with independent, bifurcating divergences (Figure 3A), both the M_G and M_IB models consistently inferred the correct topology with strong support (Figure 2B), and the M_G method did not support incorrect shared or multifurcating divergences (Figure 3C). This was true whether all the characters or only the variable characters were analyzed (Figure 3B&C). Looking at the distances (Robinson and Foulds, 1979; Kuhner and Felsenstein, 1994) between the trees from the posterior samples and the true tree, there is no difference between the M_G and M_IB models when the true tree has only independent, bifurcating divergences (Figure 3D). For both models, using all the characters yields posterior samples of more accurate trees than only analyzing variable characters (Figure 3D).

Figure 3.

Results of analyses of 100 data sets, each with 50,000 biallelic characters simulated on the species tree shown in (A) with divergence times in units of expected substitutions per site. (B) The posterior probability of the true topology. (C) The posterior probability of incorrectly shared or multifurcating nodes. (D) The square root of the sum of squared differences in branch lengths between the true tree and each posterior tree sample (Kuhner and Felsenstein, 1994); the point and bars represent the posterior mean and equal-tailed 95% credible interval, respectively. P-values are shown for Wilcoxon signed-rank tests (Wilcoxon, 1945) comparing the paired differences in tree distances between methods. For each simulation, the mutation-scaled effective population size (N_eμ) was drawn from a gamma distribution (shape = 20, mean = 0.001) and shared across all the branches of the tree; this distribution was used as the prior in analyses. Tree plotted using Gram (Version 4.0.0, Commit 02286362; Foster, 2018) and the P4 phylogenetic toolkit (Version 1.4, Commit d9c8d1b1; Foster, 2004). Other plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

2.2 Simulations on random trees

When we simulated 100 data sets (each with nine species and 50,000 characters) where the true tree and divergence times were randomly drawn from the generalized tree distribution (M_G), we again found that the M_G performs better than the M_IB at inferring the correct tree and divergence times (Figure 4A), and generally recovers true shared and multifurcating divergences with moderate to strong support (Figure 4B&C). When the tree and divergence times were randomly drawn from an independent, bifurcating tree model (M_IB), the generalized model performs similarly to the true model (Figure S2).

Figure 4.

The performance of the M_G and M_IB tree models when applied to 100 data sets, each with 50,000 biallelic characters simulated on species trees randomly drawn from the M_G tree distribution. (A) The square root of the sum of squared differences in branch lengths between the true tree and each posterior tree sample (Kuhner and Felsenstein, 1994); the point and bars represent the posterior mean and equal-tailed 95% credible interval, respectively. P-values are shown for Wilcoxon signed-rank tests (Wilcoxon, 1945) comparing the paired differences in tree distances between methods. Violin plots show posterior probabilities of all true (B) shared divergences and (C) multifurcating nodes across all simulated trees. For each simulation, the mutation-scaled effective population size (N_eμ) was drawn from a gamma distribution (shape = 20, mean = 0.001) and shared across all the branches of the tree; this distribution was used as the prior in analyses. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Both the M_G and M_IB models accurately and precisely estimate the age of the root, tree length, and effective population size from the data sets simulated on random M_G and M_IB trees (Top two rows of Figures S3, S4, and S5, respectively). Accuracy is similar with and without constant characters, but precision is higher when including constant characters.

2.3 The rate of falsely inferring shared divergences

To quantify the rate at which phycoeval incorrectly infers shared and/or multifurcating divergences, we used the results from the M_G analyses of the data sets simulated on random trees from the M_G and M_IB models. From the posterior sample of each analysis, we used sumphycoeval to calculate the proportion of samples that contained incorrectly merged neighboring divergence times. To do this, we merged all possible neighboring divergence times from the true tree, each of which creates a shared divergence or multifurcation, and counted how many posterior samples contained each divergence scenario. We found that phycoeval had a low false-positive rate for the simulated data; less than 1% (Figure 5A & S6) and 5% (Figure 5D & S7) of incorrectly merged divergence times had an approximate posterior probability greater than 0.5 when analyzing data simulated on trees sampled from the M_G and M_IB models, respectively. In all cases with moderate to strong support for falsely merged divergences, the difference in time between the merged divergences was small (< 0.005 expected substitutions per site; Figure 5B&E). There was no correlation between support for incorrectly merged divergences and their age (Figure 5C&F; the p-value for a t-test that Pearson’s correlation coefficient = 0 using all points with posterior probability > 0 was 0.11 and 0.25 for results from data simulated under M_G and M_IB, respectively).

Figure 5.

The M_G tree model has (A & D) a low false positive rate (FPR; the proportion of incorrectly merged divergence times with a posterior probability > 0.5) when applied to data simulated on trees drawn from the (A–C) M_G and (D–F) M_IB models. Support for incorrectly merged divergence times is high only when the difference between the times is small (B & E), and is not correlated with the age of the merged nodes; p-value = (C) 0.11 and (F) 0.25 for a t-test that Pearson’s correlation coefficient = 0 using all points with posterior probability > 0. Time units are expected substitutions per site. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

2.4 Convergence and mixing of MCMC chains

For all analyses of simulated data, the root age, tree length, and effective population size had a potential-scale reduction factor (PSRF; the square root of Equation 1.1 in Brooks and Gelman, 1998) less than 1.2 and effective sample size (ESS; Gong and Flegal, 2016) greater than 200. The average standard deviation of split frequencies (ASDSF) among the four MCMC chains was less than 0.017 for all analyses and less than 0.01 for most (Figure S8).

Convergence and mixing was better under M_G than M_IB when applied to data sets simulated on trees with shared or multifurcating divergences (left column of Figure S8). When applied to data sets simulated with no shared or multifurcating divergences, MCMC performance was similar between M_G and M_IB (right column of Figure S8).

The MCMC settings used for M_G and M_IB are identical except for the reversible-jump moves that add or remove divergence-time parameters are turned off under the latter model. Under the M_IB model, the tree topology is updated by several MCMC moves (see Sections 4.5 and 4.6.1 of the Supporting Information), which performed well when divergences are independent and bifurcating (Figure 3 and Figure S8). To further probe the improved MCMC behavior of MG in the face of shared and multifurcating divergences, we re-ran the analyses under the M_IB model on the 100 data sets simulated along the tree in Figure 2A with more favorable MCMC settings. In these M_IB reanalyses, we ran the MCMC chains twice as long, sampled them half as frequently, and started them with the correct tree. The results were nearly identical to the original MCMC chains under the M_IB model (Figure S9), suggesting the improved mixing under MG was not simply do to insufficient MCMC sampling effort under the M_IB model.

2.5 Simulations of linked characters

The multi-species coalescent likelihood we have coupled with our generalized tree model assumes each biallelic character is unlinked (i.e., each character evolved along a gene tree that was independent of other characters, conditional on the species tree; Bryant et al., 2012; Oaks, 2019). However, each locus comprising the gecko data sets we analyzed (see below) consists of approximately 90 contiguous nucleotides. To assess whether linked sites might bias our results, we repeated the simulations above, but with 500 loci, each with 100 linked characters. When all characters (variable and constant) are analyzed, results from the data sets simulated with linked characters are very similar to results from unlinked characters above (Figures S3–5 and 7 and S10–13). When all but one variable character per locus is discarded to avoid violating the assumption of unlinked characters, performance is greatly reduced due to the large loss of data (Figures S3–5 and 7 and S10–13). These results suggest the model is robust to linked characters and it is better to analyze all sites from multi-locus data sets, rather than reduce them to only one SNP per locus.

2.6 Testing for shared divergences in Philippine gekkonids predicted by glacial cycles

If the repeated fragmentation of the Philippines by interglacial rises in sea level generated pulses of speciation, taxa distributed across the archipelago should have divergence times clustered around the beginning of interglacial periods. We tested this prediction by applying our generalized tree model to RADseq data from species of Cyrtodactylus and Gekko collected from 27 and 26 locations across the islands, respectively (Tables S1 & S2). We analyzed each genus separately, because the rate of mutation differs between the genera, and phycoeval currently assumes a strict clock (though this is not required by the generalized tree model).

The maximum a posteriori (MAP) trees for both genera had 16 divergence times and weak to moderate support for five shared divergences (Figure 6; see Figures S14 & S15 and Table S3 for more details about shared divergences). The MAP tree of Cyrtodactylus and Gekko had three and two multifurcations, respectively. For both genera, two of the shared divergences involved three nodes, and of the remaining three that involved two nodes, one involved a trichotomous node (three descending lineages). There were no other strongly supported shared divergences that were not included in the MAP trees of either genera. Most of the shared and multifurcating divergences occurred after the late Pliocene (Figure 6 and Table S3), based on re-scaling the branch lengths of the posterior sample of trees from expected substitutions per site to millions of years using secondary calibrations (see methods).

Figure 6.

A summary of the generalized trees inferred from the (A–B) Cyrtodactylus and (C–D) Gekko RADseq data sets. The maximum a posteriori (MAP) tree is shown for both genera along with the approximate posterior probabilities of the number of divergences. Shared divergences in MAP trees indicated by dashed lines, with approximate posterior probabilities shown along the top. All clades (splits) had approximate posterior probabilities (PP) greater than 0.95 except for one indicated with a dot (PP = 0.89) within G. mindorensis (C). Approximate posterior probabilities of nodes shown in grey boxes for the root and multifurcating nodes. To illustrate timescale, branch lengths of posterior samples of trees were rescaled from expected substitutions per site to millions of years using secondary calibrations (see methods). Top photo of Cyrtodactylus sp. by CDS; bottom photo of Gekko sp. by Jason Fernandez & RMB. Created using ggplot2 (v3.3.5; Wickham, 2016), ggtree (v3.1.0; Yu et al., 2017), treeio (v1.17.0; Wang et al., 2019), deeptime (v0.0.6; Gearty, 2021), cowplot (v1.1.1; Wilke, 2020), and ggrepel (v0.9.1; Slowikowski, 2020). Links to nexus-formatted annotated trees: Cyrtodactylus & Gekko.

For both genera, the number of divergence times with the highest approximate posterior probability (0.33 for Cyrtodactylus and 0.32 for Gekko) was 17, and the 95% credible interval spanned 15–19 divergences (Figure 6). No trees with more than 22 divergence times were sampled for either genera, making the approximate posterior probability of 23 or more divergences less than 2.9×10⁻⁵ for both genera. The average standard deviation of split frequencies (0.0027 for Cyrtodactylus and 0.0009 for Gekko) and other statistics were consistent with the MCMC chains converging and mixing well (Table S4).

3 Discussion

To relax the assumption that all processes of biological diversification affect evolutionary lineages independently, we introduced a generalized Bayesian phylogenetic approach to inferring phylogenies with shared and multifurcating divergences. Using simulations we found this approach can accurately infer shared and multifurcating divergences from moderately sized data sets, while maintaining a low rate of incorrectly inferring such patterns of divergence. When we used the generalized approach to infer the evolutionary histories of two genera of gekkonid lizards across the Philippines, we found strong support against tree models assumed by current phylogenetic methods. The posterior probability of all trees with N – 1 independent, bifurcating divergences was less than 2.9×10⁻⁵ for both genera, suggesting that trees with shared and multifurcating divergences better explain the gekkonid sequence data. It will be interesting to see if such improvement in model fit is common as the generalized tree distribution is applied to more systems, regardless of the biological processes responsible (if any).

Despite greatly expanding the number of possible topologies, we saw better MCMC behavior under the M_G model (Figure S8), even when the M_IB chains were started with the true tree and run twice as long (Figure S9). This could be due to the generalized tree distribution providing more ways to traverse tree space. For example, when a posterior distribution restricted to trees with independent bifurcating divergences has multiple “peaks” associated with different topologies, the generalized distribution includes tree models that are special cases of these topologies. Explicitly including these “intermediate” trees could make the posterior less rugged and allow MCMC chains to more easily traverse tree space.

By accommodating multifurcations, our generalized tree approach helped avoid the “startree paradox,” where arbitrary resolutions of a true polytomy can be strongly supported (Figure 2D; Suzuki et al., 2002; Lewis et al., 2005). Lewis et al. (2005) found the same result by expanding the space of unrooted tree topologies to include multifurcations. Our results show that this solution to the star-tree paradox extends to rooted trees.

3.1 Robustness of coalescent models that assume unlinked characters

Our finding that the multi-species coalescent model of Bryant et al. (2012) is robust to linked characters is consistent with previous simulations using species trees with one and two tips (Oaks, 2019; Oaks et al., 2019, 2020). Our simulation results show that this robustness extends to larger trees with multifurcations and shared divergences, and suggest that discarding data to avoid linked characters can have a worse effect on inference than violating the assumption of unlinked characters. This is consistent with the findings of Chifman and Kubatko (2014) that quartet inference of splits in multi-species coalescent trees from SNP data was also robust to the violation of the assumption that characters are unlinked.

3.2 Diversification of Philippine gekkonid lizards

How the 7,100 islands of the Philippines accumulated one of the highest concentrations of terrestrial biodiversity on Earth (Catibog-Sinha and Heaney, 2006; Brown and Diesmos, 2009; Heaney and Regalado, 1998; Brown et al., 2013) has been of interest to evolutionary biologists since the founding of biogeography (Wallace, 1869; Huxley, 1868; Dickerson, 1928; Diamond and Gilpin, 1983; Brown, 2016; Lomolino et al., 2016). Since the late Pliocene, the archipelago’s five major (and several minor) aggregate island complexes were repeatedly fragmented by interglacial rises in sea level into clusters of landmasses resembling today’s islands, followed by island fusion via land bridge exposure as sea levels fell during glacial periods (Haq et al., 1987; Rohling et al., 1998; Siddall et al., 2003; Miller et al., 2005; Spratt and Lisiecki, 2016). The repeated fragmentation-fusion cycles of this insular landscape has generated a prominent hypothesis to explain the high levels of terrestrial biodiversity across the Philippines (Inger, 1954; Heaney, 1985; Brown and Guttman, 2002; Evans et al., 2003; Heaney et al., 2005; Roberts, 2006; Linkem et al., 2010; Siler et al., 2010, 2011, 2012; Brown and Siler, 2014). However, there is growing evidence that (1) older tectonic processes (~30–5 mya) of precursor paleoislands (Jansa et al., 2006; Blackburn et al., 2010; Siler et al., 2012; Brown and Siler, 2014; Brown et al., 2016), (2) dispersal events from mainland source populations (Diamond and Gilpin, 1983; Brown and Guttman, 2002; Brown and Siler, 2014; Chan and Brown, 2017), (3) repeated colonizations among islands (Siler et al., 2011; Justiniano et al., 2015; Brown et al., 2016), and (4) fine-scale in situ isolating mechanisms (Heaney et al., 2011; Linkem et al., 2011; Siler et al., 2011, 2012; Hosner et al., 2013; Brown et al., 2015), have been important causes of diversification among and within many of the islands.

Oaks et al. (2019) found support for independent divergence times among inter-island pairs of Cyrtodactylus and Gekko populations from across the Philippines, suggesting that rare, over-water colonization, perhaps mediated by rafting on vegetation, might have been a more important mechanism of isolation than sea-level fragmentation in these gekkonid lizards. Our fully phylogenetic approach to this problem has allowed us to look for shared divergences across the full evolutionary history of extant populations in these clades, finding evidence for shared divergences that were missed by the pairwise approach. These results emphasize a pitfall of previous methods: choosing pairs of populations, for comparison under previous methods for inferring shared divergences (Hickerson et al., 2006; Huang et al., 2011; Oaks, 2014, 2019), was problematic in the sense that it was somewhat arbitrary and could miss more complex patterns of shared divergences in the shared ancestry of the taxa under study.

We recognize that our use of secondary calibrations to convert the timescale of the diversification of each genus into millions of years is error-prone, and should not be used to tie estimated shared divergences to specific geological or climatic events. However, given how recent most of the estimated shared divergences are for both Cyrtodactylus and Gekko (Figure 6; Table S3), it is unlikely the magnitude of error from our calibrations is great enough such that the true timing of these divergences would pre-date the late Pliocene. Thus, we conclude these estimated divergences are consistent with predictions of the model of diversification based on Plio-Pleistocene interglacial fragmentation of the islands. Given the numbers of shared or multifurcating divergence events estimated within this time frame are relatively low for both Cyrtodactylus and Gekko (6 and 5, respectively), and have weak to moderate support, our findings also are consistent with accumulating evidence that a number of complex processes of diversification have played important roles in shaping the distribution of life across the Philippines, not just paleo-island fragmentation.

A simultaneous analysis involving broader taxonomic sampling of Philippine gekkonids (e.g., Gekko, Cyrtodactylus, Pseudogekko, Lepidodactylus, and Luperosaurus; Wood, Jr. et al., 2020) would likely reveal support for an increased number of shared divergences across the archipelago, including older divergences predicted by geological processes that pre-date the Plio-Pleistocene interglacial fragmentations. When comparing our results between Cyrtodactylus and Gekko, we see some patterns suggestive of such shared diversification. For example, early divergences in both genera show patterns consistent with arrival into the archipelago, and subsequent diversification, via the Palawan Island Arc (Blackburn et al., 2010; Siler et al., 2012). Results from both genera support a pattern where a clade, sister to species endemic to the Palawan microcontinental block, began diversifying across the oceanic islands of the Philippines approximately 25–20 mya (Figure 6). Among the divergences estimated to have occurred within the last 2 my, there also appear to be regional consistencies in when and where lineages were diversifying in the Philippines, including population-level diversification for the widespread Cyrtodactylus philippinicus and Gekko mindorensis within and among the Mindoro and West Visayan faunal regions in the central Philippines (Figures 6, S14, & S15; Siler et al., 2012, 2014). Regardless of temporal concordance among divergences, the results of this work further support Philippine species within both focal clades having originated in the archipelago as a result of one or more faunal exchanges between oceanic portions of the Philippines associated historically with the Philippine mobile belt and the Palawan microcontinental block (Brown et al., 2013; Yumul et al., 2008).

Currently, broader taxonomic analyses are limited by a simplifying assumption of phycoeval that mutation rates are constant across the tree. We sought to minimize the effects of violations of this assumption by analyzing the two gekkonid genera separately. The Philippine species in each genus are closely related (the posterior mean root age in expected substitutions per site for Cyrtodactylus and Gekko was 0.012 and 0.013, respectively) and share similar natural histories, so an assumption of a similar rate of mutation across the populations we sampled within each genus seems reasonable. Future developments of phycoeval allowing the rate to vary across the phylogeny would be an obvious way to improve our current implementation and make it more generally applicable to a greater diversity of systems.

3.3 Future directions

Given that processes of co-diversification are of interest to fields as diverse as biogeography, epidemiology, and genome evolution, we hope the generalized tree model offers a statistical framework for studying these processes across the life sciences. To help achieve this, there are several ways to improve upon our current implementation of this approach. Allowing the generalized tree model and associated MCMC algorithms to be coupled with a diverse set of phylogenetic likelihood models is an obvious way to expand its applicability to more data types and systems. The independence of the tree model and MCMC algorithms from the likelihood function makes this relatively straightforward. Similarly, our approach can be extended to accommodate tips sampled through time (Stadler, 2010; Heath et al., 2014; Gavryushkina et al., 2016; Stadler et al., 2018) and “relaxed-clock” models (Drummond et al., 2006; Drummond and Suchard, 2010; Heath et al., 2011). The former would allow for fossil and epidemiological data, and the latter would allow it to be applied to diverse sets of taxa that are expected to vary in their rates of mutation.

As we alluded to above when discussing MCMC behavior, expanding the set of tree models to include all possible non-reticulating topologies with one to N – 1 divergence times could have important implications for the joint posterior distribution of phylogenetic models. We suggest posteriors that are rugged under a tree model with strictly independent and bifurcating divergences might be smoother under a generalized tree model, but more formal theoretical work to characterize this joint space is needed.

Lastly, the distribution we used over the generalized tree space (uniform over topologies with beta-distributed node heights) is motivated by mathematical convenience, rather than inspired by biological processes. Process-based models, like a generalized birth-death model, could provide additional insights. In addition to inferring phylogenies with shared or multifurcating divergences, process-based models would allow us to infer the macroevolutionary parameters that govern the rate of such divergences.

4 Methods

4.1 Generalized tree model

Let T represent a rooted, potentially multifurcating tree topology with N tips and n(t) internal nodes t = t₁, t₂, … t_n(t), where n(t) can range from 1 (the “comb” tree) to N – 1 (fully bifurcating, independent divergences). Each internal node t is assigned to one divergence time τ, which it may share with other internal nodes in the tree. We will use τ = τ₁, …, τ_n_(τ) to represent n(τ) divergence times, where n(τ) can also range from 1 to N – 1, and every τ has at least one node assigned to it, and every node maps to a divergence time more recent than its parent (Figure S17).

To formalize a distribution across this space of generalized trees, we assume all possible topologies (T) are equally probable (see Figure S1 for an example of the sample space of topologies). We also assume the age of the root node follows a parametric distribution (e.g., a gamma distribution), and each of the other divergence times is beta-distributed between the present (τ₀) and the height of the youngest parent of a node mapped to the divergence time (Figure S17). This was inspired by and related to the Dirichlet distribution on divergence times of Kishino et al. (2001), but we use beta distributions to make it easier to deal with the fact that under our generalized tree model, multiple nodes can be mapped to each divergence time. For additional flexibility, we allow a distribution to be placed on the alpha parameter of the beta distributions of all the non-root divergence times, which we denote as α_τ.

4.2 Likelihood model

To perform Bayesian phylogenetic inference under the generalized tree model, it can be coupled with any function for calculating the probability of data evolving along a tree. This means it can be coupled with any data type and associated phylogenetic likelihood function. Even if the likelihood function does not explicitly accommodate multifurcations, these can be treated as a series of arbitrary bifurcations with branches of zero length to obtain the same likelihood of the tree.

Here, we couple the generalized tree model with a multi-species coalescent model that allows the likelihood of any species tree to be estimated directly from biallelic character data, while analytically integrating out all possible gene trees and character substitution histories along those gene trees. Below we give a brief overview of this model; for a full description of this likelihood model, please see Bryant et al. (2012), and see Oaks (2019) for a correction when only variable characters are analyzed.

4.2.1 The data

From N species for which we wish to infer a phylogeny, we assume we have collected orthologous, biallelic genetic characters. By “biallelic”, we mean that each character has at most two states, which we refer to as “red” and “green” following Bryant et al. (2012). For each character from each species, we have collected n copies of the locus, r of which are copies of the red allele. We will use n and r to denote allele counts for one character from all N species; i.e., n, r = {(n₁, r₁), (n₂, r₂), … (n_N, r_N)}. We use D to represent these allele counts across all the characters.

4.2.2 The evolution of characters

We assume each character evolved along a gene tree (g) according to a finite-characters, continuous-time Markov chain (CTMC) model, and the gene tree of each character is in-dependent of the others, conditional on the species tree (i.e., the characters are effectively unlinked). We use u and v to denote the relative rate of mutating from the red to green state and vice versa, respectively, as a character evolves along a gene tree, forward in time (Bryant et al., 2012; Oaks, 2019). Thus, π = u/(u + v) is the stationary frequency of the green state. We denote the overall rate of mutation as μ, which we assume is constant across the tree (i.e., a “strict clock”). Because evolutionary change is the product of μ and time, when μ = 1, time is measured in units of expected substitutions per character. If a mutation rate per character per unit of time is given, then time is measured in those units (e.g., generations or years).

4.2.3 The evolution of gene trees

We assume the gene trees of each character branched according to a multi-species coalescent model within a single, shared, generalized species tree, where each branch i represents a population with a constant effective size (Nielsen and Wakeley, 2001; Rannala and Yang, 2003; Liu and Pearl, 2007; Heled and Drummond, 2010; Bryant et al., 2012). We use N_e to denote the effective population sizes for all branches in the generalized tree, with topology T and divergence times τ; where n(t) + N is equal to the number of branches in the tree.

4.2.4 The likelihood

Using the work of Bryant et al. (2012), we analytically integrate over all possible gene trees and character substitution histories to compute the likelihood of the species tree directly from all m biallelic characters under a multi-population coalescent model (Kingman, 1982a,b; Rannala and Yang, 2003),

To accommodate multifurcations, we used recursion and Equation 19 of Bryant et al. (2012). This equation shows how to obtain the conditional probabilities at the bottom of an ancestral branch by merging the conditional probabilities at the top of its two descendant branches. At a multifurcation, we recursively apply Equation 19 of Bryant et al. (2012) to merge the conditional probabilities of each descendant branch in arbitrary order. We confirmed that this recursion returns an identical likelihood as treating the multifurcation as a series of bifurcations with zero-length branches.

4.3 Bayesian inference

The joint posterior probability distribution of the tree (with potential shared and multifurcating divergences) and other model parameters is

4.3.1 Priors

We use the generalized tree distribution described above as the prior on the topology (T) and divergence times (τ). For all of our analyses below, we (1) set the alpha parameter of the beta distributions on non-root divergence times (α_τ) to 1, (2) set the mutation rate (μ) to 1, so that time is in units of expected substitutions per character, (3) assume one gamma-distributed effective population size is shared across all the branches of the species tree, and (4) set the stationary frequencies of the two character states to be equal (π = 0.5), making our CTMC model of character evolution a two-state equivalent to the “JC69” model of nucleotide substitution (Jukes and Cantor, 1969).

4.4 Approximating the posterior of generalized trees

We use Markov chain Monte Carlo (MCMC) algorithms (Metropolis et al., 1953; Hastings, 1970; Green, 1995) to sample from the joint posterior in Equation 2. To sample across trees with different numbers of divergence times during the MCMC chain, we use reversible-jump MCMC (Green, 1995). We also use univariate and multivariate Metropolis-Hastings algorithms (Metropolis et al., 1953; Hastings, 1970) to update the divergence times and effective population sizes. See the Supporting Information for details and validations of our MCMC algorithms.

4.5 Software implementation

We implemented the models and algorithms above for approximating the joint posterior distribution of generalized trees, divergence times, and other model parameters in the software package ecoevolity Oaks (2019); Oaks et al. (2019, 2020). The C++ source code for ecoevolity is freely available from https://github.com/phyletica/ecoevolity and includes an extensive test suite. From the C++ source code, three command-line tools are compiled for generalized tree analyses: (1) phycoeval, for performing Bayesian inference under the model described above, (2) simphycoeval for simulating data under the model described above, and (3) sumphycoeval for summarzing the posterior samples of generalized trees collected by phycoeval. Documentation for how to install and use the software is available at http://phyletica.org/ecoevolity/. A detailed, version-controlled history of this project, including all of the data and scripts needed to produce our results, is available as a GitHub repository https://github.com/phyletica/phycoeval-experiments and was archived on zenodo (Oaks, 2021). We used multiple commits of ecoevolity for the analyses below, as we added features to the sumphycoeval tool (this history is documented in the project repository). However, all of our analyses can be replicated using Version 1.0.0 (Commit 2ed8d6ec) of ecoevolity.

4.6 Simulation-based analyses

4.6.1 Methods used for all our simulations (unless noted)

We used sumphycoeval to simulate data sets of 50,000 biallelic characters from one diploid individual from nine species (i.e., two copies of each character sampled from each species). Except for our simulations of linked characters described below, the characters were unlinked (i.e., each character was simulated along an independent gene tree within the species tree). For all of our simulations and analyses, we constrained the branches of the species tree to share the same mutation-scaled, diploid effective population size (N_e μ), which we randomly drew from a gamma distribution with a shape of 20 and mean of 0.001. We used this distribution as the prior on N_e in subsequent analyses of the simulated data sets. The mean of this distribution corresponds to an average number of differences per character between individuals of 0.004, which is comparable to estimates from genomic data from populations of zooplankton (Choquet et al., 2019), stickleback fish (Hohenlohe et al., 2010), humans (Auton et al., 2015), and the gecko species we analyze here (Oaks et al., 2019).

We analyzed each simulated data set under two models using phycoeval: the generalized tree model described above, which we denote as M_G, and an otherwise equivalent model that is constrained to the space of trees with independent, bifurcating divergences (i.e., trees with N – 1 divergence times), which we denote as M_IB. For both M_G and M_IB, we used a gammadistributed prior on the age of the root node with a shape of 10 and mean of 0.2. For each data set we ran four independent MCMC chains for 15,000 generations, sampling every 10 generations, and retaining the last 1000 samples of each chain to approximate the posterior (4000 total samples). For each generation, nine (equal to the number of tips) MCMC moves are randomly selected in proportion to specified weights, some of which automatically call other moves after finishing to improve mixing. Each chain started from a random bifurcating topology with no shared divergences, and the root age and other divergence times drawn randomly from their respective prior distributions.

From the 4000 posterior samples collected for each simulated dataset, we used sumphycoeval to calculate the mean and 95% credible intervals of the root age, tree length, effective population size, and the number of divergence times, and to summarize the frequency of sampled topologies, splits, nodes, and shared divergences. We define a split as a branch in the tree that “splits” the tips of the tree into two non-overlapping subsets; those that do and do not descend from the branch. We define a node as a split with a particular set of splits that descend from it; this is necessary to summarize the frequency of multifurcations. We also used sumphycoeval to calculate the distance between every sampled tree and the true tree using the square root of the sum of squared differences in branch lengths (Robinson and Foulds, 1979; Kuhner and Felsenstein, 1994). To assess convergence and mixing of the chains, we used sumphycoeval to calculate the average standard deviation of split frequencies (ASDSF; Lakner et al., 2008) across the four chains with a minimum split frequency threshold of 10%, as well as the potential scale reduction factor (PSRF; the square root of Equation 1.1 in Brooks and Gelman, 1998) and effective sample size (ESS; Gong and Flegal, 2016) of the log likelihood, root age, tree length, and effective population size.

4.6.2 Simulations on fixed trees

We used simphycoeval to simulate 100 data sets on two fixed trees with 9 species, one with shared and multifurcating divergences (Figure 2A) and the other with only bifurcating, independent divergences (Figure 3A). We analyzed each simulated data set under models M_G and M_IB, both with and without constant characters; for the latter we specified for phycoeval to correct the likelihood for only sampling variable characters (Bryant et al., 2012; Oaks, 2019).

To explore the improved MCMC mixing under the M_G model for data sets simulated on trees with shared or multifurcating divergences (see results), we re-ran the analyses under the M_IB on the 100 data sets simulated on the tree shown in Figure 2A. In these re-analyses under M_IB, we ran the MCMC chain twice as long (30,000 generations versus 15,0000) while sampling half as frequently (every 20 generations versus 10), and started each chain with the correct tree.

4.6.3 Simulations on random trees

Using simphycoeval, we also simulated 100 data sets on trees randomly drawn from the prior distributions of the M_G and M_IB models. As above, we analyzed each simulated data set with and without constant characters under the M_G and M_IB models. We used MCMC to sample trees randomly from the prior distributions of both models. More specifically, we used simphycoeval to (1) randomly assemble a strictly bifurcating tree with no shared divergences times, (2) run an MCMC chain of topology changing moves for a specified number of generations (we used 1000), and (3) draw the root age, other divergence times, and the effective population sizes randomly from their respective prior distributions. For each MCMC generation, nine (equal to the number of tips) topology changing moves were randomly selected in proportion to specified weights.

Due to the nested beta (uniform) distributions on non-root divergence times, some trees sampled from M_G and M_IB will have all or most of the divergence times close to zero. This happens when one of the oldest non-root divergences is randomly assigned a time near zero. For example, the trees shown in Figure S16A–C all have eight independent, bifurcating divergences. Given such trees, it is nearly impossible to differentiate independent divergences with a finite data set. It is also not clear what an investigator would want phycoeval to infer given a true tree like Figure S16A; whereas eight independent divergences is technically correct in the synthetic world of nested beta distributions, it seems an unlikely biological explanation. To avoid such extreme scenarios, we rejected any trees that had divergences times closer than 0.001 substitutions per site. This resulted in 61 and 201 trees being rejected in order to obtain 100 trees under the M_G and M_IB models, respectively. Despite this arbitrary filtering threshold, challenging tree shapes remained in our sample for simulations. For example, see the trees in Figure S16D–F, all with eight independent, bifurcating divergences.

4.6.4 Simulations of linked characters

The likelihood model above assumes characters are unlinked (i.e., they evolved along gene trees that are independent of one another conditional on the species tree). To assess the effect on inference of violating this assumption, we repeated the simulations and analyses above (for both fixed and random trees), but simulated 500 loci of 100 linked characters each (i.e. for each locus, 100 characters evolved along a shared gene tree). We used simphycoeval to simulate these data sets in two ways: (1) all 50,000 characters are simulated and retained, and (2) only (at most) one variable character is retained for each locus. For the latter data sets, characters are unlinked, but only (at most) 500 characters, all variable, are sampled. We analyzed all of these data sets under both the M_G and M_IB models. For data sets with only variable characters, we corrected the likelihood for not sampling constant characters (Bryant et al., 2012; Oaks, 2019).

4.7 Inference of shared divergences in Philippine gekkonids

We applied our new approach to two genera of geckos, Gekko and Cyrtodactylus, sampled across the Philippine Islands. We used the RADseq data of Oaks et al. (2019) available on the NCBI Sequence Read Archive (Bioproject PRJNA486413, SRA Study SRP158258).

4.7.1 Assembling alignments

We used ipyrad (Version 0.9.43; Eaton and Overcast, 2020) to assemble the RADseq reads into loci for both genera. All of the scripts and ipyrad parameter files we used to assemble the data are available in our gekkonid project repository (https://github.com/phyletica/gekgo) archived on Zenodo (Oaks and Wood, Jr., 2021), and the ipyrad settings are listed in Table S5. Using pycoevolity (Version 0.2.9; Commit 217dbeea; Oaks, 2019), we converted the ipyrad alignments into nexus format, and in the process, removed sites that had more than two character states. The final alignment for Cyrtodactylus contained 1702 loci and 155,887 characters from 27 individuals, after 567 characters with more than two states were removed. The final alignment for Gekko contained 1033 loci and 94,612 characters from 26 individuals, after 201 characters with more than two states were removed. Both alignments had less than 1% missing characters. The assembled data matrices for Cyrtodactylus and Gekko are available in our project repository (https://github.com/phyletica/phycoeval-experiments) and the data associated with specimens are provided in Tables S1 & S2.

4.7.2 Phylogenetic analyses

When analyzing the Cyrtodactylus and Gekko character matrices with phycoeval, we (1) fixed stationary state frequencies to be equal (π = 0.5), (2) set the mutation rate (μ) to 1 so that divergence times are in units of expected substitutions per site, (3) used an exponentially distributed prior with a mean of 0.01 for the age of the root, (4) set α_τ = 1 so that non-root divergence times are uniformly distributed between zero and the age of the youngest parent node, and (5) assumed a single diploid effective population size (N_e) shared across the branches of the tree with a gamma-distributed prior. For the gamma prior on N_e, we used a shape of 2.0 and mean of 0.0005 for Cyrtodactylus, and a shape of 4.0 and mean of 0.0002 for Gekko, based on estimates of Oaks et al. (2019) from the same and related species.

For both genera, we ran 25 independent MCMC chains for 15,000 generations, sampling the state of the chain every 10 generations. In each generation, phycoeval attempts N MCMC moves (27 and 26 for Cyrtodactylus and Gekko, respectively) randomly selected in proportion to specified weights, some of which automatically call other moves after finishing to improve mixing. For 20 of the chains, we specified for phycoeval to start from the “comb” topology (n(τ) = 1). For the remaining five chains, we had phycoeval start with a random bifurcating topology with no shared divergences (n(τ) = N – 1).

We used sumphycoeval to summarize the sampled values of all parameters and the frequency of sampled topologies, splits, nodes, and shared divergences. To assess convergence and mixing, we used sumphycoeval to calculate the average standard deviation of split frequencies (ASDSF; Lakner et al., 2008) and the potential scale reduction factor (PSRF; the square root of Equation 1.1 in Brooks and Gelman, 1998) and effective sample size (ESS; Gong and Flegal, 2016) of all parameters across all 25 MCMC chains. We present these convergence statistics in Table S4.

To plot the trees, we used sumphycoeval to scale the branch lengths of all the sampled trees so that the posterior mean root age was 23.07 million years for Cyrtodactylus (Grismer et al., 2022) and 33.76 million years for Gekko; the latter age is based on a time-calibrated phylogenetic estimate from another data set that is being prepared for publication. Our goal in scaling the branch lengths to millions of years is not to test whether shared divergence events correspond with the onset of specific interglacial periods, but rather to see if shared divergences fall within the general time frame predicted by a model of sea-level-driven diversification (i.e., within the last ≈4 million years).

Data availability

We recorded a detailed history of all our analyses for this project in a version-controlled repository, which is publicly available at github.com/phyletica/phycoeval-experiments and archived on Zenodo (doi.org/10.5281/zenodo.5162056; Oaks, 2021). The C++ source code for ecoevolity is freely available from github.com/phyletica/ecoevolity, with detailed documentation and tutorials available at phyletica.org/ecoevolity. All of the scripts and parameter files we used to assemble the gecko datasets are available in our gekkonid project repository (github.com/phyletica/gekgo) that is archived on Zenodo (doi.org/10.5281/zenodo.5162085; Oaks and Wood, Jr., 2021). The gecko sequence reads are available on the NCBI Sequence Read Archive (Bioproject PRJNA486413, SRA Study SRP158258).

Supporting Information

1 Figures referenced in main text

Figure S1.

An example of the expanded sample space of topologies for the generalized tree distribution (M_G) versus a tree model that assumes independent, bifurcating divergences (M_IB). With four tips, there are 15 topologies for a model, all with three divergence times (within box). The M_G model has 14 additional topologies with fewer than three divergence times. Notice the internal nodes are not labeled; i.e., the rank, or order, of non-nested internal nodes do not matter. E.g., in the topology at the top-left, regardless of whether A & B or C & D diverge first, it is the same topology. However, if A & B and C & D diverge at the same time, this is a different topology (or tree model) with one fewer divergence-time parameter (the topology in the upper-left of the n(τ) = 2 section). Trees plotted using Gram (Version 4.0.0; Foster, 2018) and the P4 phylogenetic toolkit (Version 1.4 5742542; Foster, 2004).

Figure S2.

The performance of the M_G and M_IB tree models when applied to 100 data sets, each with 50,000 biallelic characters simulated on species trees randomly drawn from the M_IB tree distribution. (A) The square root of the sum of squared differences in branch lengths between the true tree and each posterior tree sample (Kuhner and Felsenstein, 1994); the point and bars represent the posterior mean and equal-tailed 95% credible interval, respectively. P-values are shown for Wilcoxon signed-rank tests (Wilcoxon, 1945) comparing the paired differences in tree distances between methods. (B) Violin plots comparing the mean posterior probabilities of true splits for each of the 100 simulated trees. For each simulation, the mutation-scaled effective population size (N_eμ) was drawn from a gamma distribution (shape = 20, mean = 0.001) and shared across all the branches of the tree; this distribution was used as the prior in analyses. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S3.

The accuracy and precision of the M_G and M_IB models at estimating the age of the root (in expected subsitutions per site) from data sets with 50,000 biallelic characters simulated on species trees randomly drawn from the M_G and M_IB tree distributions. Each plotted circle and associated error bars represent the posterior mean and 95% credible interval. Estimates for which the potential-scale reduction factor was greater than 1.2 (Brooks and Gelman, 1998) or the effective sample size was less than 200 are highlighted in red. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S4.

The accuracy and precision of the M_G and M_IB models at estimating the tree length (the sum of all branch lengths in units of expected substitutions per site) from data sets with 50,000 biallelic characters simulated on species trees randomly drawn from the M_G and M_IB tree distributions. Each plotted circle and associated error bars represent the posterior mean and 95% credible interval. Estimates for which the potentialscale reduction factor was greater than 1.2 (Brooks and Gelman, 1998) or the effective sample size was less than 200 are highlighted in red. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S5.

The accuracy and precision of the M_G and M_IB models at estimating the effective population size (N_eμ) across the tree from data sets with 50,000 biallelic characters simulated on species trees randomly drawn from the M_G and M_IB tree distributions. Each plotted circle and associated error bars represent the posterior mean and 95% credible interval. Estimates for which the potential-scale reduction factor was greater than 1.2 (Brooks and Gelman, 1998) or the effective sample size was less than 200 are highlighted in red. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S6.

The M_G model has a low false positive rate (FPR; the proportion of incorrectly merged divergence times with a posterior probability > 0.5) when applied to data simulated on trees drawn from M_G with all (Row 1) or only variable (Row 2) unlinked characters, and all characters from linked loci (Row 3). Support for incorrectly merged divergence times is high only when the difference between the times is small (Column 2), and is not correlated with the age of the merged nodes (right). (Column 3; P = 0.109, 0.106, 0,053, and 0.068, from top to bottom for a t-test that Pearson’s correlation coefficient = 0 using all points with posterior probability > 0). When data sets with linked loci are reduced to only one variable site per locus (Row 4), the FPR increases (left) and precision decreases (right). The top row is the same as Figure 5A–C. Time units are expected substitutions per site. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S7.

The M_G model has a low false positive rate (FPR; the proportion of incorrectly merged divergence times with a posterior probability > 0.5) when applied to data simulated on trees drawn from M_IB (no shared or multifurcating divergences) with all (Row 1) or only variable (Row 2) unlinked characters, and all characters from linked loci (Row 3). Support for incorrectly merged divergence times is high only when the difference between the times is small (Column 2), and is not correlated with the age of the merged nodes (right). (Column 3; P = 0.25, 0.29, 0,11, and 0.11, from top to bottom for a t-test that Pearson’s correlation coefficient = 0 using all points with posterior probability > 0). When data sets with linked loci are reduced to only one variable site per locus (Row 4), the FPR increases (left) and precision decreases (right). The top row is the same as Figure 5D–F. Time units are expected substitutions per site. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S8.

Markov chain Monte Carlo (MCMC) sampling yielded better convergence and mixing among chains under the generalized tree model when there were shared or multifurcating divergences. Convergence and mixing of sampled trees is summarized using the average standard deviation of split frequencies (ASDSF; Lakner et al., 2008) across the four chains of each analysis; smaller standard deviations across chains indicate better sampling behavior. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S9.

The (A) accuracy and (B) Markov chain Monte Carlo (MCMC) sampling efficiency of the M_G tree model is better than M_IB, even when the MCMC chains for M_IB are started with the correct tree and are run twice as long and sampled half as frequently (“long chains”; 30,000 generations, sampling every 20). Results from 100 data sets simulated on the species tree shown in Figure 2A. (A) The square root of the sum of squared differences in branch lengths between the true tree and each posterior tree sample (Kuhner and Felsenstein, 1994); the point and bars represent the posterior mean and equal-tailed 95% credible interval, respectively. P-values are shown for Wilcoxon signed-rank tests (Wilcoxon, 1945) comparing the paired differences in tree distances between methods. (B) Violin plots of the average standard deviation of split frequencies (ASDSF; Lakner et al., 2008,; smaller is better) across the four chains of each analysis. The M_G and M_IB results (left and center) are the same shown in Figure 2B and Figure S8. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S10.

Results of analyses of 100 data sets with linked loci simulated along the species tree shown in (A). Each simulated data set comprised 500 loci with 100 biallelic characters. P-values are shown for Mann-Whitney U tests Mann and Whitney (1947) comparing the differences in tree distances between methods. For each simulation, the mutation-scaled effective population size (N_eμ) was drawn from a gamma distribution (shape = 20, mean = 0.001) and shared across all the branches of the tree; this distribution was used as the prior in analyses. Tree plotted using Gram (Version 4.0.0, Commit 02286362; Foster, 2018) and the P4 phylogenetic toolkit (Version 1.4, Commit d9c8d1b1; Foster, 2004). Other plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S11.

Results of analyses of 100 data sets with linked loci simulated along the species tree shown in (A). Each simulated data set comprised 500 loci with 100 biallelic characters. P-values are shown for Mann-Whitney U tests (Mann and Whitney, 1947) comparing the differences in tree distances between methods. For each simulation, the mutation-scaled effective population size (N_eμ) was drawn from a gamma distribution (shape = 20, mean = 0.001) and shared across all the branches of the tree; this distribution was used as the prior in analyses. Tree plotted using Gram (Version 4.0.0, Commit 02286362; Foster, 2018) and the P4 phylogenetic toolkit (Version 1.4, Commit d9c8d1b1; Foster, 2004). Other plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S12.

The performance of the M_G and M_IB tree models when applied to 100 data sets with 500 loci (each with 100 linked characters) simulated on species trees randomly drawn from the M_G tree distribution. For each simulation, the mutation-scaled effective population size (N_eμ) was drawn from a gamma distribution (shape = 20, mean = 0.001) and shared across all the branches of the tree; this distribution was used as the prior in analyses. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S13.

The performance of the M_G and M_IB tree models when applied to 100 data sets with 500 loci (each with 100 linked characters) simulated on species trees randomly drawn from the M_IB tree distribution. For each simulation, the mutation-scaled effective population size (N_eμ) was drawn from a gamma distribution (shape = 20, mean = 0.001) and shared across all the branches of the tree; this distribution was used as the prior in analyses. Plots created using the PGFPlotsX (Version 1.2.10, Commit 1adde3d0; Carlsson and Papp, 2021) backend of the Plots (Version 1.5.7, Commit f80ce6a2; Breloff, 2021) package in Julia (Version 1.5.4; Bezanson et al., 2017).

Figure S14.

The maximum a posteriori (MAP) topology from Figure 6A for Cyrtodactylus shown without branch lengths to make it clearer which clades are involved. Shared divergences indicated by dashed lines, with labels shown along the top that correspond to rows in Table S3, where the divergences are summarized. Created using ggplot2 (v3.3.5; Wickham, 2016), ggtree (v3.1.0; Yu et al., 2017), treeio (v1.17.0; Wang et al., 2019), cowplot (v1.1.1; Wilke, 2020), and ggrepel (v0.9.1; Slowikowski, 2020). Link to nexus-formatted annotated MAP tree: Cyrtodactylus.

Figure S15.

The maximum a posteriori (MAP) topology from Figure 6C for Gekko shown without branch lengths to make it clearer which clades are involved. Shared divergences indicated by dashed lines, with labels shown along the top that correspond to rows in Table S3, where the divergences are summarized. Created using ggplot2 (v3.3.5; Wickham, 2016), ggtree (v3.1.0; Yu et al., 2017), treeio (v1.17.0; Wang et al., 2019), cowplot (v1.1.1; Wilke, 2020), and ggrepel (v0.9.1; Slowikowski, 2020). Link to nexus-formatted annotated MAP tree: Gekko.

Figure S16.

Examples of trees (all with 8 independent, bifurcating divergences) rejected (top) and retained (bottom) when a minimum threshold of 0.001 substitutions per site between divergence times is applied to trees randomly sampled from the prior distribution of the M_IB model. Trees plotted using Gram (Version 4.0.0, Commit 02286362; Foster, 2018) and the P4 phylogenetic toolkit (Version 1.4, Commit d9c8d1b1; Foster, 2004).

2 Tables

Table S1. The data for all Cyrtodactylus samples included in our phylogenetic analyses are included in a tab-delimited text file available from the project repository and archived on Zenodo (Oaks and Wood, Jr., 2021): https://raw.githubusercontent.com/phyletica/gekgo/master/phycoeval-msg-assemblies/ipyrad-assemblies/sample-data/Cyrt_localities.tsv.

Table S2. The data for all Gekko samples included in our phylogenetic analyses are included in a tab-delimited text file available from the project repository and archived on Zenodo (Oaks and Wood, Jr., 2021): https://raw.githubusercontent.com/phyletica/gekgo/master/phycoeval-msg-assemblies/ipyrad-assemblies/sample-data/Gekko_localities.tsv.

View this table:

Table S3.

Summary of shared divergences in the maximum a posteriori (MAP) phylogeny estimated under the generalized tree model for Cyrtodactylus (Figure 6A) and Gekko (Figure 6C). See Figures S14 & S15 for the shared divergence labels.

View this table:

Table S4.

Convergence statistics we calculated with sumphycoeval from MCMC samples collected with phycoeval. We ran each MCMC chain for 15,000 generations, sampling every 10 generations.

View this table:

Table S5.

Settings used for assembling RADseq loci for Cyrtodactylus and Gekko.

3 The generalized tree model

Let T represent a rooted, potentially multifurcating tree topology with N tips and n(t) internal nodes t = t₁,t₂, … t_n(t), where n(t) can range from 1 (the “comb” tree) to N – 1 (fully bifurcating, independent divergences). Each internal node t is assigned to one divergence time τ, which it may share with other internal nodes in the tree. We will use τ = τ₁, …, τ_n_(τ) to represent n(τ) divergence times, where n(τ) can also range from 1 to N – 1, and every τ has at least one node assigned to it, and every node maps to a divergence time more recent than its parent (Figure S17). Note, the number of divergence times is constrained by the number of internal nodes; n(τ) ≤ n(t). For convenience, we will index each τ from youngest to oldest. We assume the tree is ultrametric; all tips are at time zero, which we will denote as τ₀.

Figure S17.

An illustration of the generalized tree model implemented in ecoevolity. The prior distributions of the divergence times are shown to the right, and “splittable” divergence times are indicated with an asterisk to the left. Figure created using Gram (Version 4.0.0, Commit 02286362; Foster, 2018) and the P4 phylogenetic toolkit (Version 1.4, Commit d9c8d1b1; Foster, 2004).

We assume all possible topologies (T) are equally probable; see Figure S1 for an example of the sample space of topologies under the generalized tree model. We assume the age of the root node follows a parametric distribution (e.g., a gamma distribution), and each of the other divergence times is beta-distributed between the present (τ₀) and the age of the youngest parent of any node mapped to the divergence time. For example, in Figure S17, the parents of the nodes mapped to τ₁ are t₃ (parent of t₁) and t₆ (parent of t₅), which are mapped to τ₃ and τ₄, respectively, the younger of which is τ₃. Thus, divergence time τ₁ in Figure S17 follows a beta distribution that is scaled to the interval between 0 and τ₃, resulting in the prior probability density where α_τ and β_τ are the two positive shape parameters of the beta prior probability distribution, and B(α_τ, β_τ) is the beta function that serves as a normalizing constant. If we use y(τ_i) to denote the divergence time of the youngest parent of any node mapped to the non-root divergence time, τ_i, we can generalize this beta prior probability density as

For additional flexibility, we allow a distribution to be placed on the alpha parameter of the beta distributions on the non-root divergence times (α_τ). However, we constrain β_τ = 1, simplifying the prior probability density of each non-root divergence time to

For all of our analyses presented in this paper, we constrained α_τ = 1, which further simplifies the prior probability of each non-root divergence time to the uniform density

For simplicity below, we denote the probability density of a divergence time as f(τ_i | y(τ_i)), but our work generalizes to any proper probability distributions on the divergence times, including the full beta distribution (Equation 4).

4 Approximating the posterior of the generalized tree model

We use Markov chain Monte Carlo (MCMC), specifically Metropolis-Hastings (MH) (Metropolis et al., 1953; Hastings, 1970) algorithms, to sample from the generalized tree distribution. An MH algorithm works by stochastically proposing changes to parameters of a model, and using the following rule to determine the probability of accepting these moves:

The product of the first two terms (likelihood and prior ratios) gives us the posterior density of the state of the model being proposed divided by the posterior density of the current state of the model. The Hastings ratio corrects for asymmetry in the proposal distribution by dividing the probability density of proposing the move that would reverse the move being proposed by the probability density of the move being proposed.

Below we describe MH moves for sampling from the generalized tree distribution, and derive the prior and Hastings ratio for each. These moves can be coupled with any likelihood function to calculate the likelihood ratio and sample from the posterior distribution of generalized trees using MH (Equation 7). In Sections 4.1–4.4, we describe a pair of reversible-jump moves, “split-time” and “merge-times,” for moving between trees with different numbers of divergence times. These are the core moves that allow the full space of the generalized tree distribution to be traversed and sampled. In Section 4.5, we describe moves that propose changes to the topology, but do not change the number of divergence-time parameters, nor their values. In Section 4.6, we describe a move that updates the values of divergence times without changing the topology (though we do describe an extension of this move that does update the topology).

4.1 Split-time move

To generalize the space of trees, we introduce two reversible-jump moves: “split-time” and “merge-times.” When a reversible-jump move is to be attempted, the merge-times or split-time move is chosen with probability 0.5, except for two special cases:

If the current state of the chain is the most general tree model (n(τ) = N – 1), then the merge-times move is chosen with probability 1.
If the current state of the chain is the “comb” tree (n(τ) = 1), then the split-time move is chosen with probability 1.

The basic idea is to randomly divide a “splittable” divergence time into two nonempty sets of nodes, and assign one of the sets to a new, more recent divergence time. A divergence time is considered splittable if it has (1) more than one node mapped to it, or (2) a single multifurcating node mapped to it. For example, in Figure S17, divergence times τ₁, τ₂, and τ₄ are splittable. The first step is to randomly choose a divergence time, τ_i, from among the splittable divergence times. After dividing τ_i into two sets of nodes, we need to randomly select a time more recent than τ_i to assign one of the sets.

4.1.1 Drawing the new divergence time

To get the new, proposed divergence time, we randomly draw a new divergence time between τ_i–1 and τ_i from a proposal distribution, where is the conditional probability density of proposing the new time τ′ given the times from the current values of τ_i and τ_i–1. In our implementation, we use a beta probability distribution scaled and shifted to the interval τ_i–1–τ_i so that the probability density of the new time is where α and β are the two positive shape parameters of the beta distribution, and B(α,β) is the beta function. For generality and simplicity, we will use g_τ to denote this probability density of the proposed divergence time below.

4.1.2 Prior ratio

The prior ratio for the split-time move is where f(T′, τ′) is the prior probability of the proposed tree topology and divergence times. In our implementation, we assume that all possible tree topologies (across n(τ) = 1, 2, …, N – 1) are equally probable a priori. We also assume the divergence time of the root (τ_n(τ)) is gamma-distributed, and each of the other divergence times is beta-distributed between the present (τ₀) and the divergence time of the youngest parent of a node mapped to τ_i, which we denote as y(τ_i) (Figure S17; Equation 4). Given these assumptions, the prior ratio becomes

If τ_i = 1 (i.e., the divergence time selected to split was the most recent divergence), then τ_i–1 is the present, and so f(τ_i–1 | y(τ_i–1)′) = f(τ_i–1 | y(τ_i–1)) = 1. Also, if none of the nodes assigned to τ_i–1 has a parent assigned to the newly proposed divergence time (i.e., y(τ_i–1)′ ≠ τ′), then y(τ_i–1)′ = y(τ_i–1); e.g., in Figure S17, if τ₂ is split, the prior probability density of τ₁ is not affected, because the youngest parent of a node mapped to τ₁ is mapped to τ₃. In both of these special cases, the prior ratio further simplifies to

4.1.3 Hastings ratio

The probability of proposing a split-time move involves several components. First, we have to choose to split rather than merge. We will account for the probability of this toward the end of this section. Next, we randomly choose a splittable divergence time τ_i with probability , where n_s(τ) is the number of splittable divergence times. As described in Section 4.1.1 above, we randomly choose a new divergence time τ′ more recent than τ_i with probability density g_τ.

When we divide τ_i into two sets of nodes, if any polytomies get broken up, new branches will get added to the tree. Under certain models, each of these new branches will need values randomly drawn for parameters. For example, if using a “relaxed-clock” model, each new branch will need a substitution rate. Or, if using a multi-species coalescent model where each branch has its own effective population size, a value for this will need to be drawn. Note, this does not involve divergence-time parameters, because all nodes split from τ_i will be assigned to τ′. We will use g_z to represent the product of all the probability densities of the proposed values for the new branches. If no polytomies get broken up, or new branches created from broken polytomies do not require parameter values, then g_z = 1.

We will deal with how the nodes assigned to τ_i are divided into two sets below. For now, we will use Ξ to represent the probability of the proposed division of τ_i. The probability density of the proposed split move is then where Θ and Θ′ represent the full state of the model before and after the proposed split move, respectively. The move that would exactly reverse this split move would simply entail randomly selecting the proposed divergence time from all divergence times except the root, which would then be deterministically merged with the next older divergence time. The probability of this reverse move is where n(τ) and n(τ)′ is the number of divergence times before and after the proposed split move, respectively.

The Hastings ratio for the split move is then where γ_S represents the probability of choosing to merge in the reverse move divided by the probability of choosing to split in the forward move, which is

When we are working with a tree with more than three tips, the first case occurs whenever the current tree is the comb tree (n(τ) = 1), and the second case occurs whenever the proposed tree has no shared divergences nor multifurcations (n(τ) = N – 1). However, for full generality we need to include the second condition in both of these cases to account for the situation where we are working with a tree with only three tips.

4.2 Merge-times move

In the “merge-times” move, we randomly choose τ_x from one of the n(τ) – 1 non-root divergence times. Then, we merge τ_x with the next older divergence time, τ_x+1. This will create shared divergence times among nodes and/or multifurcating nodes. We will use to refer to the newly merged divergence time proposed by the move.

4.2.1 Prior ratio

Generally, the prior ratio for the merge-times move is the same as Equation 10. Assuming (1) all topologies are equally probable, (2) the divergence time of the root (τ_n_(τ)) is gamma-distributed, and (3) each of the other divergence times is beta-distributed between the present (τ₀) and the age of the youngest parent to the nodes mapped to the divergence time (Figure S17; Equation 4), the prior ratio becomes

If τ_x+1 is the root of the tree, then , and this probability density is given by the gamma prior distribution on the divergence time of the root (Figure S17).

4.2.2 Hastings ratio

The probability of the forward merge-times move is simply where n(τ) and n(τ)′ is the number of divergence times before and after the proposed merge move, respectively.

Borrowing from Equation 13, the probability density of the split move that would exactly reverse the proposed merge move is where n_s(τ)′ is the number of splittable divergence times after the proposed merge-times move.

The Hastings ratio for the merge move is then where γ_M represents the probability of choosing to split in the reverse move divided by the probability of choosing to merge in the forward move, which is

4.3 Expanding Ξ

Up to this point, we have not dealt with how, during the split-time move, we divide the nodes mapped to τ_i into two sets, one of which gets assigned to the new divergence time drawn between τ_i–1 and τ_i. This has to be done with care to ensure that every possible configuration of two divergence times derived from the nodes assigned to τ_i can be proposed, such that it properly balances the reverse merge-times move. As above, we use Ξ to represent the probability of the proposed division of τ_i’s nodes.

In the next two sections, we show how this is done for two special cases. The first special case illustrates how we first choose which nodes currently mapped to τ_i will get moved to the new divergence time. The second special case shows how we handle any multifurcating nodes that have been chosen to be moved to the new divergence time. In the third section, we build on these special cases to show a general solution for Ξ.

4.3.1 The case of all bifurcating nodes mapped to τ_i

We will use n(t ↦ τ_i) to represent the number of nodes mapped to τ_i. If all n(t ↦ τ_i) nodes mapped to τ_i are bifurcating, we randomly divide these nodes into two non-empty sets and then randomly choose one of the two sets of nodes to move to the new divergence time. For example, this would be the case if τ₂ is chosen to split from the tree shown in Figure S17.

The number of ways n(t ↦ τ_i) can be divided into two non-empty subsets is given by the Stirling number of the second kind, which we denote as S₂(n(t ↦ τ_i), 2). We uniformly choose among these, such that the probability of randomly selecting any set partition of the n(t ↦ τ_i) nodes mapped to τ_i is . After partitioning the nodes into two sets, there is a 1/2 probability of choosing one set to move to the new, more recent divergence time. Thus, when all of the nodes mapped to τ_i are bifurcating the probability of each possible splitting of τ_i is

4.3.2 The case of a single polytomy mapped to τ_i

Next, let’s consider another special case where the number of nodes mapped to τ_i is one (i.e., a single polytomy). For example, this would be the case if τ₄ is chosen to split from the tree shown in Figure S17. In this case, we randomly resolve the polytomy, by randomly (uniformly) choosing a set partition of the descending branches into non-empty subsets. Any subsets with only one branch remain attached to the original polytomy node, while each subset with multiple branches get split off to form a new node (clade) that descends from the original polytomy node. These new nodes are assigned to the new, more recent divergence time τ′. The number of ways to partition the descending branches of the polytomy are thus B_b – 2, where B_b is the Bell number (Bell, 1934)—the number of possible set partitions of the b branches descending from the polytomy. We have to subtract 2 from B_b, because we do not allow the two “extreme” set partitions with one or b subsets. The former would move the whole polytomy to the new divergence time, and the latter would leave the polytomy as is. Neither of these scenarios adds a dimension (divergence time) to the model. We avoid these two scenarios using rejection so that the remaining partitions of the b descending branches are chosen uniformly. Thus, when only a single polytomy is mapped to τ_i the probability of each possible splitting of τ_i is

4.3.3 The case when multiple nodes, including at least one polytomy, are mapped to τ_i

When multiple nodes are mapped to τ_i, and at least one is a polytomy, we need to do some more accounting to ensure that we can reach every possible arrangement of two divergence times that can be merged to form the current configuration of nodes mapped to τ_i. Similar to the case with all bifurcating nodes, we will first divide the n(t ↦ τ_i) nodes mapped to τ_i into two subsets and randomly choose one of these subsets to move to the proposed, more recent divergence time, τ′. For each multifurcating node that ends up in the subset to be moved to τ′ (if any), we need to either break up the polytomy, as we did in the case of the single-polytomy case above, or move the entire polytomy to τ′.

Unlike in the case of only bifurcating nodes mapped to τ_i, when we partition the n(t ↦ τ_i) nodes mapped to τ_i into two sets, we must allow for the case where all n(t ↦ τ_i) end up in the set to move to τ. This is because, if any of the polytomy nodes get broken up, they will leave at least one node at τ_i, and the dimension of the model will change (i.e., the number of divergence times will increase by one). So, we have to allow an empty subset when we randomly partition the n(t ↦ τ_i) into two subsets. However, we cannot allow the empty subset to be chosen to move to τ′. There are S₂(n(t ↦ τ_i), 2) + 1 ways to partition the n(t ↦ τ_i) nodes mapped to τ_i into two subsets if we allow the set partition with one empty subset. For each of these, there are two ways to choose the subset to move to τ′, and of all of these, there is one scenario we will reject: if the empty set gets selected to move to τ′. Thus, there are ways to choose a subset of the nodes assigned to τ_i for moving to the new divergence time, and the probability of each is

For each polytomy mapped to τ_i that ends up in the set of nodes to move to τ′ (if any), we randomly choose one of the B_b possible set partitions of the b branches descending from the polytomy. However, we will reject the set partition with b subsets (i.e., all branches end up in their own subset). We reject this, because no subclades get broken off from the polytomy to move to τ′, and this scenario is already taken into account by the polytomy node not ending up in the set of nodes to move to τ′ in the first place. However, we need to allow the scenario where all b branches descending from a polytomy get assigned to a single set, which results in the entire polytomy node getting moved to τ′, as long as at least one node remains assigned to τ_i (we will handle this in a bit). Thus, for each polytomy in the set of nodes to be moved to τ′, there are B_b – 1 ways to move it. Using n_p(t ⟹ τ′) to represent the number of polytomies in the subset of nodes to be moved to τ′, the total number of ways these polytomies can be moved to τ′ is and the probability of each is equal to

If no polytomies end up in the subset of nodes to move to τ′, then Φ = 1.

However, if all n(t ↦ τ_i) nodes mapped to τ_i end up in the set of nodes to be moved to τ′, we need to reject the case where none of the polytomy nodes gets broken up (i.e., for every polytomy, all the descending branches get partitioned into a single set), because no nodes would remain assigned to τ_i, and the move would simplify to changing the value of τ_i. Thus, if all n(t ↦ τ_i) nodes mapped to τ_i end up in the set of nodes to move to τ′, the total number of ways all n_p(t ⇒ τ′) polytomies can be moved to τ′ is and the probability of each is equal to

Given all of this, the probability of choosing a subset of nodes from τ_i to move to the new divergence time across all possible cases is

Notice, the case of n(t ↦ τ_i) = 1 (i.e., only a single polytomy node mapped to τ_i) is simply a special case of the second condition above, where all the nodes assigned to τ_i end up in the set to move to the new divergence time, including at least one polytomy.

4.4 Validation of Split-time and Merge-times moves

To validate the split-time/merge-times moves, we used them to sample from the prior distribution of trees with 5, 6, and 7 leaves. If working correctly, we should sample all n(T) tree topologies with an equal frequency of . If we collect MCMC samples from the prior distribution, the number of times a topology is sampled should be approximately distributed as Binomial; i.e., binomially distributed where the number of “trials” is equal to the number of samples, and the probability of sampling each topology is . We found a close match between the number or times each tree was sampled by our reversible-jump MCMC chain and the expected number, and failed to reject the expected binomial distribution using χ² goodness-of-fit test (Figure S18; p = 0.742, 0.464, and 0.172 for the test with a 5, 6, and 7-leaved tree).

Figure S18.

Comparing the expected to the observed number of times each topology is sampled by an MCMC chain using our split-time and merge-times moves. Under our generalized tree distribution, how often each topology is sampled should follow a distribution, where is the total number of MCMC samples.

4.5 Nested-neighbor-node-swap move

The split-time and merge-times moves can sample all of the space of the generalized tree distribution (assuming the age of the root node is fixed). However, we implemented additional topology moves that do not jump between tree models with different numbers of divergence times.

The goal of this move is to change the topology without changing the number or timing of divergences. We start by randomly picking a non-root divergence time, τ_i, with probability . Next, we find the divergence time, τ_j, that contains the node that is the youngest parent of nodes mapped to τ_i. We then randomly pick one of the nodes mapped to τ_j that has children mapped to τ_i, we will call it t_a. Each child of t_a that is mapped to τ_i will randomly contribute one of its children to a “swap pool” of nodes. If t_a has children that are not mapped to τ_i, we randomly pick one of these children and add it to the swap pool if it is younger than τ_i. If the selected child of t_a is older than τ_i, we randomly sample one of its children and continue to do so until we have chosen a descendant node that is younger than τ_i, which we then add to the swap pool. Lastly, we randomly pick two nodes from the swap pool and we swap their parents.

After the proposed move, the structure of the tree rootward of the swapped nodes is the same. Because of this, the move that would reverse the proposed move would be equally probable; it would involve (1) choosing the same non-root divergence time τ_i, (2) choosing the same t_a, (3) choosing the children that swapped parents in the forward move to enter the swap pool, and (4) picking the nodes that swapped parents in the forward move from the pool and swapping their parents back. In #3, all the parent nodes involved have the same number of children as before the proposed forward move, and so the probability of the reverse move will be equal. As a result, the Hastings ratio for the move is 1.

For example, for the tree in Figure S17 if we randomly selected τ₁, the divergence containing the youngest parent of the nodes mapped to τ₁ is τ₃. Divergence time τ₃ only has one node that is a parent of nodes assigned to τ₁, which is t₃. Node t₃ only has one child mapped to τ₁ (t₁), which will randomly contribute one of its children, Leaf H or I, to the swap pool. Node t₃ also has children that are not mapped to τ₁, one child t₂ that is mapped to τ₂. Node t₂ is considered for the swap pool, but it is too old (it is older than τ₁ and thus could not become a child of t₁. So, we randomly consider one the children of t₂, Leaf F or G, for the swap pool, either of which is young enough to be added to the swap pool. Next, we randomly choose two nodes from the swap pool, which has exactly two nodes in this case, a child of t₁ (Leaf H or I) and t₂ (Leaf F or G), and these nodes swap parents. If we assume that Leaves G and H were swapped, it is clear that the probability of the reverse move that would swap them back is equally probable.

We also implemented variations of this move that make larger changes to the tree topology. For example, we can perform the swap for all of the nodes mapped to τ_j that have children mapped to τ_i (instead of randomly choosing one of them). Another option is to randomly permute the parents of all of the nodes in the swap pool, rather than swap the parents of just two of the nodes. By chance, when doing this permutation of the nodes in the swap pool, it is possible to end up with the same topology we started with. To avoid proposing the same state, we iteratively permute the parents of the nodes in the swap pool until we have a new topology (the parents of at least some of the nodes in the swap pool have changed). Just like with the swap move, this permutation move can be performed on one randomly selected node of τ_j that has children mapped to τ_i, or to all of them.

4.5.1 Validation of move

To validate this move, we used it to sample from a uniform distribution over the topologies of a 6-leaved bifurcating tree. There are 945 topologies for a rooted, bifurcating tree (Felsenstein, 1978). If the move is working correctly, the number of times we sample each of them should follow a distribution, where is the total number of MCMC samples. From an MCMC sample of 100,000 trees, we found a close match to this expected distribution, and were unable to reject it using a χ² goodness-of-fit test (Figure S19; p = 0.51).

Figure S19.

Comparing the expected to the observed number of times each topology of a rooted, 6-leaved, bifurcating tree is sampled by our nested-neighbor-node-swap move. Our MCMC sample of 100,000 trees closely matched the expected distribution.

4.6 Divergence time slide bump move

To begin this move, we randomly pick one of the divergence times, τ_i. Next, we draw a uniform deviate, u ~ Uniform(−λ, λ), where λ is a tuning parameter that can be adjusted to improve the acceptance rate of the proposal. Then, we get a new divergence time value by τ_ie^u. We will index our randomly selected divergence time, τ_i, as τ₁. We then use τ₁, τ₂, …, τ_n to represent the selected time, τ₁, and all the divergence times between τ₁ and τ₁e^u that contain nodes ancestral or descendant to the nodes mapped to τ₁. Note, that incrementing indices count younger or older divergence times, depending on whether τ_ie^u < τ₁ or τ_ie^u > τ₁, respectively.

The simplest case is that we do not have any intervening divergence times, and so we only have τ₁. This will happen when τ₁e^u is older than the oldest node that is a child of the nodes mapped to τ_i and younger than the youngest node that is parent of the nodes mapped to τ_i In that case, we propose a new time to which to slide τ₁ as

To reverse this move (slide τ₁ back) would be

To solve for the uniform deviate that would exactly reverse the move (u′), we take the log of Equation 32 and solve for u′,

To get the Hastings ratio for this move, we use the formula of Green (1995), which is the ratio of the probability of drawing the random deviate that would reverse the proposed move to the probability of drawing the random deviate of the proposed move, multiplied by the absolute value of the determinant of a Jacobian matrix. Because the forward and reverse random deviates are uniform, , and the Hastings ratio reduces to just the Jacobian term,

In the next simplest case, there is one intervening divergence time τ₂. In this case, τ₁ will slide to τ₂ and “bump” it to the new time τ₁e^u. More formally, the move will be

Again, the uniform deviate that would exactly reverse this move would be and the reverse move would be

Again, the Hastings ratio reduces to just the Jacobian term,

To generalize this to an arbitrary number of intervening divergence times that will be bumped, we have

Again, the uniform deviate that would exactly reverse this move would be and the reverse move would be

The Hastings ratio reduces to just the Jacobian term,

We avoid values of τ₁e^u that are less than zero, by rejecting the proposed move. There is no upper limit for the move, because the root of the tree can be moved to an arbitrarily old divergence time. However, in our implementation, the prior on the divergence time of the root is different than the other divergence times, and can be much more informative. In such cases, we might be able to improve mixing and tuning of the move be excluding the root divergence time from the move. We do this by selecting only non-root divergence times, and rejecting any proposed moves where τ₁e^u is older than the root.

4.6.1 An extension to this move

We can easily extend this move to also propose new topologies. Whenever we have a “bump” that involves a node and its children, we can propose a node swapping or permuting move described above (see the nested-neighbor-node-swap move and its extensions). Because we are sliding the nodes to take the position of the nodes they bump, the swap or permute moves are simplified a bit. We do not have to worry about τ_j contributing a child that is older than its potential new parents, so we never need to randomly choose descendants until we find a node that is younger than τ_i. We implemented this move, but jointly proposing changes to continuous divergence time parameters and changing the topology might lead to poor acceptance rates (Yang, 2014), so using separate moves to update divergence times and the topology is likely a better strategy.

4.6.2 Validation of this move

We used this move to sample from the prior distribution to ensure that the distribution of sampled divergence times matched the gamma-distributed prior we placed on the root age and the beta priors we placed on all other divergence times. To validate the extension of this move that also incorporates node swapping when nodes “bump,” we used it to sample from a uniform distribution over the topologies of a 6-leaved bifurcating tree. If the move is working correctly, the number of times we sample each of the 945 topologies should be follow a distribution, where is the total number of MCMC samples. We found a close match between our samples and this expected distribution, and could not reject it using a χ² goodness-of-fit test (Figure S20; p = 0.165,).

Figure S20.

Comparing the expected to the observed number of times each topology of a rooted, 6-leaved, bifurcating tree is sampled by the node-swapping extension to our divergencetime-slide-bump move. Our MCMC sample of 1 million trees closely matched the expected distribution.

5 Acknowledgments

We thank Mark Holder for helpful advice with Hastings ratios and modeling the distribution on divergence times. This work was supported by funding provided to JRO from the National Science Foundation (NSF grant number DEB 1656004). The computational work was made possible by the Auburn University (AU) Hopper and Easley Clusters supported by the AU Office of Information Technology and a grant of high-performance computing resources and technical support from the Alabama Supercomputer Authority. Our gecko sampling was amassed with NSF support for fieldwork (EF-0334952, DEB 073199 and 0743491 to RMB; and 0804115 to CDS) and Fulbright grants to CDS. This paper is contribution number 949 of the Auburn University Museum of Natural History.

Footnotes

Improvements to writing and figures. Adding additional analyses to further explore the improved Markov chain Monte Carlo behavior of the new generalized tree model. Adding plots summarizing support for incorrectly merged divergence times for data simulated on trees drawn from the generalized tree model (previous version only showed results for data simulated on strictly bifurcating trees with independent divergences).
https://doi.org/10.5281/zenodo.5162056
https://doi.org/10.5281/zenodo.5162085
https://github.com/phyletica/phycoeval-experiments

References

↵
Auton, A., G. R. Abecasis, D. M. Altshuler, R. M. Durbin, G. R. Abecasis, D. R. Bentley, A. Chakravarti, A. G. Clark, P. Donnelly, E. E. Eichler, P. Flicek, S. B. Gabriel, R. A. Gibbs, E. D. Green, M. E. Hurles, B. M. Knoppers, J. O. Korbel, E. S. Lander, C. Lee, H. Lehrach, E. R. Mardis, G. T. Marth, G. A. McVean, D. A. Nickerson, J. P. Schmidt, S. T. Sherry, J. Wang, R. K. Wilson, R. A. Gibbs, E. Boerwinkle, H. Doddapaneni, Y. Han, V. Korchina, C. Kovar, S. Lee, D. Muzny, J. G. Reid, Y. Zhu, J. Wang, Y. Chang, Q. Feng, X. Fang, X. Guo, M. Jian, H. Jiang, X. Jin, T. Lan, G. Li, J. Li, Y. Li, S. Liu, X. Liu, Y. Lu, X. Ma, M. Tang, B. Wang, G. Wang, H. Wu, R. Wu, X. Xu, Y. Yin, D. Zhang, W. Zhang, J. Zhao, M. Zhao, X. Zheng, E. S. Lander, D. M. Altshuler, S. B. Gabriel, N. Gupta, N. Gharani, L. H. Toji, N. P. Gerry, A. M. Resch, P. Flicek, J. Barker, L. Clarke, L. Gil, S. E. Hunt, G. Kelman, E. Kulesha, R. Leinonen, W. M. McLaren, R. Radhakrishnan, A. Roa, D. Smirnov, R. E. Smith, I. Streeter, A. Thormann, I. Toneva, B. Vaughan, X. Zheng-Bradley, D. R. Bentley, R. Grocock, S. Humphray, T. James, Z. Kingsbury, H. Lehrach, R. Sudbrak, M. W. Albrecht, V. S. Amstislavskiy, T. A. Borodina, M. Lienhard, F. Mertes, M. Sultan, B. Timmermann, M.-L. Yaspo, E. R. Mardis, R. K. Wilson, L. Fulton, R. Fulton, S. T. Sherry, V. Ananiev, Z. Belaia, D. Beloslyudtsev, N. Bouk, C. Chen, D. Church, R. Cohen, C. Cook, J. Garner, T. Hefferon, M. Kimelman, C. Liu, J. Lopez, P. Meric, C. O’Sullivan, Y. Ostapchuk, L. Phan, S. Ponomarov, V. Schneider, E. Shekhtman, K. Sirotkin, D. Slotta, H. Zhang, G. A. McVean, R. M. Durbin, S. Balasubramaniam, J. Burton, P. Danecek, T. M. Keane, A. Kolb-Kokocinski, S. McCarthy, J. Stalker, M. Quail, J. P. Schmidt, C. J. Davies, J. Gollub, T. Webster, B. Wong, Y. Zhan, A. Auton, C. L. Campbell, Y. Kong, A. Marcketta, R. A. Gibbs, F. Yu, L. Antunes, M. Bainbridge, D. Muzny, A. Sabo, Z. Huang, J. Wang, L. J. M. Coin, L. Fang, X. Guo, X. Jin, G. Li, Q. Li, Y. Li, Z. Li, H. Lin, B. Liu, R. Luo, H. Shao, Y. Xie, C. Ye, C. Yu, F. Zhang, H. Zheng, H. Zhu, C. Alkan, E. Dal, F. Kahveci, G. T. Marth, E. P. Garrison, D. Kural, W.-P. Lee, W. Fung Leong, M. Stromberg, A. N. Ward, J. Wu, M. Zhang, M. J. Daly, M. A. DePristo, R. E. Handsaker, D. M. Altshuler, E. Banks, G. Bhatia, G. del Angel, S. B. Gabriel, G. Genovese, N. Gupta, H. Li, S. Kashin, E. S. Lander, S. A. McCarroll, J. C. Nemesh, R. E. Poplin, S. C. Yoon, J. Lihm, V. Makarov, A. G. Clark, S. Gottipati, A. Keinan, J. L. Rodriguez-Flores, J. O. Korbel, T. Rausch, M. H. Fritz, A. M. Stütz, P. Flicek, K. Beal, L. Clarke, A. Datta, J. Herrero, W. M. McLaren, G. R. S. Ritchie, R. E. Smith, D. Zerbino, X. Zheng-Bradley, P. C. Sabeti, I. Shlyakhter, S. F. Schaffner, J. Vitti, D. N. Cooper, E. V. Ball, P. D. Stenson, D. R. Bentley, B. Barnes, M. Bauer, R. Keira Cheetham, A. Cox, M. Eberle, S. Humphray, S. Kahn, L. Murray, J. Peden, R. Shaw, E. E. Kenny, M. A. Batzer, M. K. Konkel, J. A. Walker, D. G. MacArthur, M. Lek, R. Sudbrak, V. S. Amstislavskiy, R. Herwig, E. R. Mardis, L. Ding, D. C. Koboldt, D. Larson, K. Ye, and S. Gravel. 2015. A global reference for human genetic variation. Nature 526:68–74.
OpenUrl CrossRef PubMed
↵
Barber, B. R. and J. Klicka. 2010. Two pulses of diversification across the Isthmus of Tehuantepec in a montane Mexican bird fauna. Proceedings Of The Royal Society B-Biological Sciences 277:2675–2681.
OpenUrl CrossRef PubMed
↵
Bell, E. T. 1934. Exponential numbers. American Mathematical Monthly 41:411–419.
OpenUrl CrossRef
↵
Bezanson, J., A. Edelman, S. Karpinski, and V. B. Shah. 2017. Julia: A fresh approach to numerical computing. SIAM review 59:65–98.
OpenUrl CrossRef
↵
Blackburn, D. C., D. P. Bickford, A. C. Diesmos, D. T. Iskandar, and R. M. Brown. 2010. An ancient origin for the enigmatic flat-headed frogs (Bombinatoridae: Barbourula) from the Islands of Southeast Asia. PLoS ONE 5:10.
OpenUrl CrossRef
↵
Breloff, T. 2021. Plots: Powerful convenience for Julia visualizations and data analysis. GitHub and archived on Zenodo.
↵
Brooks, S. P. and A. Gelman. 1998. General methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics 7:434–455.
OpenUrl
↵
1. R. M. Kilman
Brown, R. M. 2016. Biogeography of land vertebrates. Pages 211–220 in Encyclopdia of Evolutionary Biology ( R. M. Kilman, ed.) vol. 1 1st ed. Academic Press, Oxford, UK.
OpenUrl
↵
1. R. Gillespie and
2. D. Clague
Brown, R. M. and A. C. Diesmos. 2009. Philippines, biology. Pages 723–732 in Encyclopdia of Islands ( R. Gillespie and D. Clague, eds.). University of California Press, Berkeley.
↵
Brown, R. M. and S. I. Guttman. 2002. Phylogenetic systematics of the Rana signata complex of Philippine and Bornean stream frogs: reconsideration of Huxley’s modification of Wallace’s line at the Oriental-Australian faunal zone interface. Biological Journal of the Linnean Society 76:393–461.
OpenUrl CrossRef Web of Science
↵
Brown, R. M. and C. D. Siler. 2014. Spotted stream frog diversification at the Australasian faunal zone interface, mainland versus island comparisons, and a test of the Philippine ‘dual-umbilicus’ hypothesis. Journal of Biogeography 41:182–195.
OpenUrl CrossRef
↵
Brown, R. M., C. D. Siler, C. H. Oliveros, J. A. Esselstyn, A. C. Diesmos, P. A. Hosner, C. W. Linkem, A. J. Barley, J. R. Oaks, M. B. Sanguila, L. J. Welton, R. G. Moyle, A. T. Peterson, and A. C. Alcala. 2013. Evolutionary processes of diversification in a model island archipelago. Annual Review of Ecology, Evolution, and Systematics 44:411–435.
OpenUrl
↵
Brown, R. M., C. D. Siler, S. J. Richards, A. C. Diesmos, and D. C. Cannatella. 2015. Multilocus phylogeny and a new classification for Southeast Asian and Melanesian forest frogs (family Ceratobatrachidae). Zoological Journal of the Linnean Society 174:130–168.
OpenUrl
↵
Brown, R. M., Y.-C. Su, B. Barger, C. D. Siler, M. B. Sanguila, A. C. Diesmos, and D. C. Blackburn. 2016. Phylogeny of the island archipelago frog genus Sanguirana: Another endemic Philippine radiation that diversified ’out-of-Palawan’. Molecular Phylogenetics and Evolution 94:531–536.
OpenUrl
↵
Bryant, D., R. Bouckaert, J. Felsenstein, N. A. Rosenberg, and A. Roychoudhury. 2012. Inferring species trees directly from biallelic genetic markers: Bypassing gene trees in a full coalescent analysis. Molecular Biology and Evolution 29:1917–1932.
OpenUrl CrossRef PubMed Web of Science
↵
Carlsson, K. and T. K. Papp. 2021. PGFPlotsX: a Julia package to generate publication quality figures using the LaTeX library PGFPlots. GitHub.
↵
Catibog-Sinha, C. S. and L. R. Heaney. 2006. Philippine Biodiversity: Principles and Practice. Haribon Foundation, Quezon City, Philippines.
↵
Chan, K. O. and R. M. Brown. 2017. Did true frogs ’dispersify’? Biology Letters 13:20170299.
OpenUrl
↵
Chifman, J. and L. Kubatko. 2014. Quartet inference from SNP data under the coalescent model. Bioinformatics 30:3317–3324.
OpenUrl CrossRef PubMed Web of Science
↵
Choquet, M., I. Smolina, A. K. S. Dhanasiri, L. Blanco-Bercial, M. Kopp, A. Jueterbock, A. Y. M. Sundaram, and G. Hoarau. 2019. Towards population genomics in non-model species with large genomes: a case study of the marine zooplankton Calanus finmarchicus. Royal Society Open Science 6:180608.
OpenUrl CrossRef
↵
Clark, J. W. and P. C. J. Donoghue. 2017. Constraining the timing of whole genome duplication in plant evolutionary history. Proceedings of the Royal Society B: Biological Sciences 284:20170912.
OpenUrl CrossRef PubMed
↵
Daza, J. M., T. A. Castoe, and C. L. Parkinson. 2010. Using regional comparative phylogeographic data from snake lineages to infer historical processes in Middle America. Ecography 33:343–354.
OpenUrl Web of Science
↵
Diamond, J. M. and M. E. Gilpin. 1983. Biogeographic umbilici and the origin of the Philippine avifauna. Oikos 41:307–321.
OpenUrl CrossRef
↵
Dickerson, R. E. 1928. Distribution of life in the Philippines. Philippine Bureau of Science, Manila, Philippines.
↵
Doyle, J. J. and A. N. Egan. 2010. Dating the origins of polyploidy events. New Phytologist 186:73–85.
OpenUrl CrossRef PubMed Web of Science
↵
Drummond, A. J., S. Y. W. Ho, M. J. Phillips, and A. Rambaut. 2006. Relaxed phylogenetics and dating with confidence. PLoS Biology 4:e88.
OpenUrl CrossRef PubMed
↵
Drummond, A. J. and M. A. Suchard. 2010. Bayesian random local clocks, or one rate to rule them all. BMC Biology 8:114.
OpenUrl
↵
Eaton, D. A. R. and I. Overcast. 2020. ipyrad: Interactive assembly and analysis of RADseq datasets. Bioinformatics 36:2592–2594.
OpenUrl
↵
Evans, B., R. Brown, J. Mcguire, J. Supriatna, N. Andayani, A. Diesmos, D. Iskandar, D. Melnick, and D. Cannatella. 2003. Phylogenetics of fanged frogs: Testing biogeographical hypotheses at the interface of the Asian and Australian faunal zones. Systematic Biology 52:794–819.
OpenUrl CrossRef PubMed Web of Science
↵
Felsenstein, J. 1978. The number of evolutionary trees. Systematic Biology 27:27–33.
OpenUrl CrossRef GeoRef
↵
Foster, P. G. 2004. Modeling compositional heterogeneity. Systematic Biology 53:485–495.
OpenUrl CrossRef PubMed Web of Science
↵
Foster, P. G. 2018. Gram version 4.0.0. http://gram.nhm.ac.uk/.
↵
Gavryushkina, A., T. A. Heath, D. T. Ksepka, T. Stadler, D. Welch, and A. J. Drummond. 2016. Bayesian total-evidence dating reveals the recent crown radiation of penguins. Systematic Biology 66:57–73.
OpenUrl
↵
Gearty, W. 2021. deeptime: Plotting Tools for Anyone Working in Deep Time. R package version 0.0.6.
↵
Gong, L. and J. M. Flegal. 2016. A practical sequential stopping rule for high-dimensional Markov chain Monte Carlo. Journal of Computational and Graphical Statistics 25:684–700.
OpenUrl
↵
Green, P. J. 1995. Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika 82:711–732.
OpenUrl CrossRef Web of Science
↵
Grismer, L. L., N. A. Poyarkov, E. S. H. Quah, J. L. Grismer, and P. L. Wood, Jr.. 2022. The biogeography of bent-toed geckos, Cyrtodactylus (Squamata: Gekkonidae). PeerJ 10:e13153.
OpenUrl
↵
Haq, B. U., J. Hardenbol, and P. R. Vail. 1987. Chronology of fluctuating sea levels since the Triassic. Science 235:1156–1167.
OpenUrl Abstract/FREE Full Text
↵
Hastings, W. K. 1970. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57:97–109.
OpenUrl CrossRef Web of Science
↵
Heaney, L. R. 1985. Zoogeographic evidence for middle and late pleistocene land bridges to the philippine islands. Mod Quatern Res SE Asia 9:127–144.
OpenUrl
↵
Heaney, L. R., D. S. Balete, E. A. Rickart, P. A. Alviola, M. R. M. Duya, M. V. Duya, M. J. Veluz, L. VandeVrede, and S. J. Steppan. 2011. Chapter 1: Seven new species and a new subgenus of forest mice (Rodentia: Muridae: Apomys) from Luzon Island. Fieldiana Life and Earth Sciences 2:1–60.
OpenUrl
↵
Heaney, L. R. and J. C. Regalado, Jr.. 1998. Vanishing treasures of the Philippine rain forest. Field Museum, Chicago, Illinois.
↵
Heaney, L. R., J. S. Walsh, and A. T. Peterson. 2005. The roles of geological history and colonization abilities in genetic differentiation between mammalian populations in the Philippine Archipelago. Journal of Biogeography 32:229–247.
OpenUrl
↵
Heath, T. A., M. T. Holder, and J. P. Huelsenbeck. 2011. A Dirichlet process prior for estimating lineage-specific substitution rates. Molecular Biology and Evolution 29:939–955.
OpenUrl
↵
Heath, T. A., J. P. Huelsenbeck, and T. Stadler. 2014. The fossilized birth-death process: A coherent model of fossil calibration for divergence time estimation. Proceedings of the National Academy of Sciences 111:E2957–E2966.
OpenUrl Abstract/FREE Full Text
↵
Heled, J. and A. J. Drummond. 2010. Bayesian inference of species trees from multilocus data. Molecular Biology and Evolution 27:570–580.
OpenUrl CrossRef PubMed Web of Science
↵
Hickerson, M. J., E. A. Stahl, and H. A. Lessios. 2006. Test for simultaneous divergence using approximate Bayesian computation. Evolution 60:2435–2453.
OpenUrl CrossRef PubMed Web of Science
↵
Hoelzer, G. A. and D. J. Meinick. 1994. Patterns of speciation and limits to phylogenetic resolution. Trends in Ecology & Evolution 9:104–107.
OpenUrl
↵
Hohenlohe, P. A., S. Bassham, P. D. Etter, N. Stiffler, E. A. Johnson, and W. A. Cresko. 2010. Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLOS Genetics 6:1–23.
OpenUrl
↵
Hosner, P. A., A. S. Nyári, and R. G. Moyle. 2013. Water barriers and intra-island isolation contribute to diversification in the insular Aethopyga sunbirds (Aves: Nectariniidae). Journal of Biogeography 40:1094–1106.
OpenUrl
↵
Huang, W., N. Takebayashi, Y. Qi, and M. J. Hickerson. 2011. MTML-msBayes: Approximate Bayesian comparative phylogeographic inference from multiple taxa and multiple loci with rate heterogeneity. BMC Bioinformatics 12:1.
OpenUrl CrossRef PubMed
↵
Huxley, T. H. 1868. On the classification and the distribution of the Alectoromorphae and Heteromorphae. Proceedings of the Zoological Society of London 6:249–319.
OpenUrl
↵
Inger, R. F. 1954. Systematics and zoogeography of Philippine Amphibia. Fieldiana 33:182–531.
OpenUrl
↵
Jansa, S. A., F. K. Barker, and L. R. Heaney. 2006. The pattern and timing of diversification of Philippine endemic rodents: Evidence from mitochondrial and nuclear gene sequences. Systematic Biology 55:73–88.
OpenUrl CrossRef PubMed Web of Science
↵
Jiao, Y., N. J. Wickett, S. Ayyampalayam, A. S. Chanderbali, L. Landherr, P. E. Ralph, L. P. Tomsho, Y. Hu, H. Liang, P. S. Soltis, D. E. Soltis, S. W. Clifton, S. E. Schlarbaum, S. C. Schuster, H. Ma, J. Leebens-Mack, and C. W. dePamphilis. 2011. Ancestral polyploidy in seed plants and angiosperms. Nature 473:97–100.
OpenUrl CrossRef PubMed Web of Science
↵
1. H. N. Munro
Jukes, T. H. and C. R. Cantor. 1969. Evolution of protein molecules. chap. 24, Pages 21–132 in Mammalian Protein Metabolism ( H. N. Munro, ed.) vol. III. Academic Press, New York.
OpenUrl
↵
Justiniano, R., J. J. Schenk, D. S. Balete, E. A. Rickart, J. A. Esselstyn, L. R. Heaney, and S. J. Steppan. 2015. Testing diversification models of endemic Philippine forest mice (Apomys) with nuclear phylogenies across elevational gradients reveals repeated colonization of isolated mountain ranges. Journal of Biogeography 42:51–64.
OpenUrl GeoRef
↵
Kingman, J. F. C. 1982a. The coalescent. Stochastic processes and their applications 13:235–248.
OpenUrl CrossRef
↵
Kingman, J. F. C. 1982b. On the genealogy of large populations. Journal of Applied Probability 19:27–43.
OpenUrl CrossRef PubMed
↵
Kishino, H., J. L. Thorne, and W. J. Bruno. 2001. Performance of a divergence time estimation method under a probabilistic model of rate evolution. Molecular Biology and Evolution 18:352–361.
OpenUrl CrossRef PubMed Web of Science
↵
Klinkenberg, D., J. A. Backer, X. Didelot, C. Colijn, and J. Wallinga. 2017. Simultaneous inference of phylogenetic and transmission trees in infectious disease outbreaks. PLOS Computational Biology 13:1–32.
OpenUrl CrossRef
↵
Kuhner, M. K. and J. Felsenstein. 1994. A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates (erratum in mol. biol. evol. 1995; 12, 525). Molecular Biology and Evolution 11:459–468.
OpenUrl CrossRef PubMed Web of Science
↵
Lakner, C., P. van der Mark, J. P. Huelsenbeck, B. Larget, and F. Ronquist. 2008. Efficiency of Markov chain Monte Carlo tree proposals in Bayesian phylogenetics. Systematic Biology 57:86–103.
OpenUrl CrossRef PubMed Web of Science
↵
Leaché, A. D., S. C. Crews, and M. J. Hickerson. 2007. Two waves of diversification in mammals and reptiles of Baja California revealed by hierarchical Bayesian analysis. Biology Letters 3:646–650.
OpenUrl
↵
Lewis, P. O., M. T. Holder, and K. E. Holsinger. 2005. Polytomies and Bayesian phylogenetic inference. Systematic Biology 54:241–253.
OpenUrl CrossRef PubMed Web of Science
↵
Li, Z., G. P. Tiley, S. R. Galuska, C. R. Reardon, T. I. Kidder, R. J. Rundell, and M. S. Barker. 2018. Multiple large-scale gene and genome duplications during the evolution of hexapods. Proceedings of the National Academy of Sciences 115:4713–4718.
OpenUrl Abstract/FREE Full Text
↵
Linkem, C. W., A. C. Diesmos, and R. M. Brown. 2011. Molecular systematics of the Philippine forest skinks (Squamata: Scincidae: Sphenomorphus): testing morphological hypotheses of interspecific relationships. Zoological Journal of the Linnean Society 163:1217–1243.
OpenUrl
↵
Linkem, C. W., K. M. Hesed, A. C. Diesmos, and R. M. Brown. 2010. Species boundaries and cryptic lineage diversity in a Philippine forest skink complex (Reptilia; Squamata; Scincidae: Lygosominae). Molecular Phylogenetics and Evolution 56:572–585.
OpenUrl
↵
Liu, L. and D. K. Pearl. 2007. Species trees from gene trees: Reconstructing Bayesian posterior distributions of a species phylogeny using estimatated gene tree distributions. Systematic Biology 56:504–514.
OpenUrl CrossRef PubMed Web of Science
↵
Lomolino, M. V., B. R. Riddle, and J. H. Brown. 2016. Biogeography. 5th ed. Sinauer Associates, Sunderland, Massachusetts, USA.
↵
Mann, H. B. and D. R. Whitney. 1947. On a test of whether one of two random variables is stochastically larger than the other. The Annals of Mathematical Statistics 18:50–60.
OpenUrl
↵
Metropolis, N., A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller, and E. Teller. 1953. Equation of state calculations by fast computing machines. The Journal of Chemical Physics 21:1087–1092.
OpenUrl CrossRef Web of Science
↵
Miller, K. G., M. A. Kominz, J. V. Browning, J. D. Wright, G. S. Mountain, M. E. Katz, P. J. Sugarman, B. S. Cramer, N. Christie-Blick, and S. F. Pekar. 2005. The Phanerozoic record of global sea-level change. Science 310:1293–1298.
OpenUrl Abstract/FREE Full Text
↵
Nielsen, R. and J. Wakeley. 2001. Distinguishing migration from isolation: A Markov chain Monte Carlo approach. Genetics 158:885–896.
OpenUrl Abstract/FREE Full Text
↵
Oaks, J. R. 2014. An improved approximate-bayesian model-choice method for estimating shared evolutionary history. BMC Evolutionary Biology 14:150.
OpenUrl
↵
Oaks, J. R. 2019. Full Bayesian comparative phylogeography from genomic data. Systematic Biology 68:371–395.
OpenUrl
↵
Oaks, J. R. 2021. Analyses exploring the behavior of generalized bayesian phylogenetics: Version 1.0.0. GitHub and archived on Zenodo; https://doi.org/10.5281/zenodo.5162056.
↵
Oaks, J. R., N. L’Bahy, and K. A. Cobb. 2020. Insights from a general, full-likelihood Bayesian approach to inferring shared evolutionary events from genomic data: Inferring shared demographic events is challenging. Evolution 74:2184–2206.
OpenUrl CrossRef
↵
Oaks, J. R., C. D. Siler, and R. M. Brown. 2019. The comparative biogeography of Philippine geckos challenges predictions from a paradigm of climate-driven vicariant diversification across an island archipelago. Evolution 73:1151–1167.
OpenUrl
↵
Oaks, J. R., J. Sukumaran, J. A. Esselstyn, C. W. Linkem, C. D. Siler, M. T. Holder, and R. M. Brown. 2013. Evidence for climate-driven diversification? a caution for interpreting ABC inferences of simultaneous historical events. Evolution 67:991–1010.
OpenUrl CrossRef PubMed
↵
Oaks, J. R. and P. L. Wood, Jr.. 2021. Open-science notebook for the comparative phylogenetics of philippine gekkonids: Version 2. GitHub and archived on Zenodo; https://doi.org/10.5281/zenodo.5162085.
↵
Plouviez, S., T. M. Shank, B. Faure, C. Daguin-Thiebaut, F. Viard, F. H. Lallier, and D. Jollivet. 2009. Comparative phylogeography among hydrothermal vent species along the East Pacific Rise reveals vicariant processes and population expansion in the South. Molecular Ecology 18:3903–3917.
OpenUrl CrossRef PubMed
↵
Pybus, O. G. and A. Rambaut. 2009. Evolutionary analysis of the dynamics of viral infectious disease. Nature Reviews Genetics 10:540–550.
OpenUrl CrossRef PubMed Web of Science
↵
Rannala, B. and Z. Yang. 2003. Bayes estimation of species divergence times and ancestral population sizes using DNA sequences from multiple loci. Genetics 164:1645–1656.
OpenUrl Abstract/FREE Full Text
↵
Roberts, T. E. 2006. Multiple levels of allopatric divergence in the endemic Philippine fruit bat Haplonycteris fischeri (Pteropodidae). Biological Journal of the Linnean Society 88:329–349.
OpenUrl CrossRef Web of Science
↵
1. A. F. Horadam and
2. W. D. Wallis
Robinson, D. F. and L. R. Foulds. 1979. Comparison of weighted labelled trees. Pages 119–126 in Combinatorial Mathematics VI ( A. F. Horadam and W. D. Wallis, eds.) Springer Berlin Heidelberg, Berlin, Heidelberg.
↵
Rohling, E. J., M. Fenton, F. J. Jorissen, P. Bertrand, G. Ganssen, and J. P. Caulet. 1998. Magnitudes of sea-level lowstands of the past 500,000 years. Nature 394:162–165.
OpenUrl CrossRef GeoRef Web of Science
↵
Siddall, M., E. J. Rohling, A. Almogi-Labin, C. Hemleben, D. Meischner, I. Schmelzer, and D. A. Smeed. 2003. Sea-level fluctuations during the last glacial cycle. Nature 423:853–858.
OpenUrl CrossRef GeoRef PubMed Web of Science
↵
Siler, C. D., A. C. Diesmos, A. C. Alcala, and R. M. Brown. 2011. Phylogeny of Philippine slender skinks (Scincidae: Brachymeles) reveals underestimated species diversity, complex biogeographical relationships, and cryptic patterns of lineage diversification. Molecular Phylogenetics and Evolution 59:53–65.
OpenUrl CrossRef PubMed
Siler, C. D., J. R. Oaks, K. Cobb, O. Hidetoshi, and R. M. Brown. 2014. Critically endangered island endemic or peripheral population of a widespread species? conservation genetics of Kikuchi’s gecko and the global challenge of protecting peripheral oceanic island endemic vertebrates. Diversity and Distributions 20:756–772.
OpenUrl
↵
Siler, C. D., J. R. Oaks, J. A. Esselstyn, A. C. Diesmos, and R. M. Brown. 2010. Phylogeny and biogeography of Philippine bent-toed geckos (Gekkonidae: Cyrtodactylus) contradict a prevailing model of Pleistocene diversification. Molecular Phylogenetics and Evolution 55:699–710.
OpenUrl CrossRef PubMed
↵
Siler, C. D., J. R. Oaks, L. J. Welton, C. W. Linkem, J. Swab, A. C. Diesmos, and R. M. Brown. 2012. Did geckos ride the Palawan raft to the Philippines? Journal of Biogeography 39:1217–1234.
OpenUrl
↵
Slowikowski, K. 2020. ggrepel: Automatically Position Non-Overlapping Text Labels with ggplot2. R package ggrepel version 0.9.1.
↵
Spratt, R. M. and L. E. Lisiecki. 2016. A Late Pleistocene sea level stack. Climate of the Past 12:1079–1092.
OpenUrl
↵
Stadler, T. 2010. Sampling-through-time in birth–death trees. Journal of Theoretical Biology 267:396–404.
OpenUrl CrossRef PubMed Web of Science
↵
Stadler, T., A. Gavryushkina, R. C. Warnock, A. J. Drummond, and T. A. Heath. 2018. The fossilized birth-death model for the analysis of stratigraphic range data under different speciation modes. Journal of Theoretical Biology 447:41–55.
OpenUrl CrossRef
↵
Suchard, M. A., R. E. Weiss, and J. S. Sinsheimer. 2001. Bayesian selection of continuoustime Markov chain evolutionary models. Molecular Biology And Evolution 18:1001–1013.
OpenUrl CrossRef PubMed Web of Science
↵
Suzuki, Y., G. V. Glazko, and M. Nei. 2002. Overcredibility of molecular phylogenies obtained by bayesian phylogenetics. Proceedings of the National Academy of Sciences 99:16138–16143.
OpenUrl Abstract/FREE Full Text
↵
Voje, K. L., C. Hemp, Ø. Flagstad, G.-P. Saetre, and N. C. Stenseth. 2009. Climatic change as an engine for speciation in flightless Orthoptera species inhabiting African mountains. Molecular Ecology 18:93–108.
OpenUrl PubMed
↵
Wallace, A. R. 1869. The Malay Archipelago: The Land of the Orang-utan, and the Bird of Paradise. Macmillan and Co., London.
↵
Wang, L.-G., T. T.-Y. Lam, S. Xu, Z. Dai, L. Zhou, T. Feng, P. Guo, C. W. Dunn, B. R. Jones, T. Bradley, H. Zhu, Y. Guan, Y. Jiang, and G. Yu. 2019. Treeio: An R package for phylogenetic tree input and output with richly annotated and associated data. Molecular Biology and Evolution 37:599–603.
OpenUrl
↵
Wickham, H. 2016. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York.
↵
Wilcoxon, F. 1945. Individual comparisons by ranking methods. Biometrics Bulletin 1:80–83.
OpenUrl CrossRef Web of Science
↵
Wilke, C. O. 2020. cowplot: Streamlined Plot Theme and Plot Annotations for ggplot2. R package version 1.1.1.
↵
Wood, Jr., P. L., X. Guo, S. L. Travers, Y.-C. Su, K. V. Olson, A. M. Bauer, L. L. Grismer, C. D. Siler, R. G. Moyle, M. J. Andersen, and R. M. Brown. 2020. Parachute geckos free fall into synonymy: Gekko phylogeny, and a new subgeneric classification, inferred from thousands of ultraconserved elements. Molecular Phylogenetics and Evolution 146:106731.
OpenUrl
↵
Yang, Z. 1994. Statistical properties of the maximum likelihood method of phylogenetic estimation and comparison with distance matrix methods. Systematic Biology 43:329–342.
OpenUrl CrossRef Web of Science
↵
Yang, Z. 2014. Molecular Evolution: A Statistical Approach. Oxford University Press, Oxford, United Kingdom.
↵
Yang, Z., N. Goldman, and A. Friday. 1995. Maximum likelihood trees from DNA sequences: A peculiar statistical estimation problem. Systematic Biology 44:384–399.
OpenUrl CrossRef Web of Science
↵
Ypma, R. J. F., W. M. van Ballegooijen, and J. Wallinga. 2013. Relating phylogenetic trees to transmission trees of infectious disease outbreaks. Genetics 195:1055–1062.
OpenUrl Abstract/FREE Full Text
↵
Yu, G., D. K. Smith, H. Zhu, Y. Guan, and T. T.-Y. Lam. 2017. ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution 8:28–36.
OpenUrl
↵
Yumul, G., C. Dimalanta, V. Maglambayan, and E. Marquez. 2008. Tectonic setting of a composite terrane: A review of the Philippine island arc system. Geosciences Journal 12:7–17.
OpenUrl

View the discussion thread.

Posted April 29, 2022.

Download PDF

Data/Code

Citation Tools

Subject Area

Evolutionary Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5197)
Biochemistry (11700)
Bioengineering (8715)
Bioinformatics (29120)
Biophysics (14927)
Cancer Biology (12047)
Cell Biology (17347)
Clinical Trials (138)
Developmental Biology (9405)
Ecology (14140)
Epidemiology (2067)
Evolutionary Biology (18262)
Genetics (12216)
Genomics (16761)
Immunology (11840)
Microbiology (27999)
Molecular Biology (11549)
Neuroscience (60784)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3228)
Physiology (4937)
Plant Biology (10382)
Scientific Communication and Education (1679)
Synthetic Biology (2876)
Systems Biology (7332)
Zoology (1642)

[2] ↵
Barber, B. R. and J. Klicka. 2010. Two pulses of diversification across the Isthmus of Tehuantepec in a montane Mexican bird fauna. Proceedings Of The Royal Society B-Biological Sciences 277:2675–2681.
OpenUrl CrossRef PubMed

[3] ↵
Bell, E. T. 1934. Exponential numbers. American Mathematical Monthly 41:411–419.
OpenUrl CrossRef

[4] ↵
Bezanson, J., A. Edelman, S. Karpinski, and V. B. Shah. 2017. Julia: A fresh approach to numerical computing. SIAM review 59:65–98.
OpenUrl CrossRef

[5] ↵
Blackburn, D. C., D. P. Bickford, A. C. Diesmos, D. T. Iskandar, and R. M. Brown. 2010. An ancient origin for the enigmatic flat-headed frogs (Bombinatoridae: Barbourula) from the Islands of Southeast Asia. PLoS ONE 5:10.
OpenUrl CrossRef

[6] ↵
Breloff, T. 2021. Plots: Powerful convenience for Julia visualizations and data analysis. GitHub and archived on Zenodo.

[7] ↵
Brooks, S. P. and A. Gelman. 1998. General methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics 7:434–455.
OpenUrl

[8] ↵
R. M. Kilman
Brown, R. M. 2016. Biogeography of land vertebrates. Pages 211–220 in Encyclopdia of Evolutionary Biology ( R. M. Kilman, ed.) vol. 1 1st ed. Academic Press, Oxford, UK.
OpenUrl

[9] R. M. Kilman

[10] ↵
R. Gillespie and
D. Clague
Brown, R. M. and A. C. Diesmos. 2009. Philippines, biology. Pages 723–732 in Encyclopdia of Islands ( R. Gillespie and D. Clague, eds.). University of California Press, Berkeley.

[11] R. Gillespie and

[12] D. Clague

[13] ↵
Brown, R. M. and S. I. Guttman. 2002. Phylogenetic systematics of the Rana signata complex of Philippine and Bornean stream frogs: reconsideration of Huxley’s modification of Wallace’s line at the Oriental-Australian faunal zone interface. Biological Journal of the Linnean Society 76:393–461.
OpenUrl CrossRef Web of Science

[14] ↵
Brown, R. M. and C. D. Siler. 2014. Spotted stream frog diversification at the Australasian faunal zone interface, mainland versus island comparisons, and a test of the Philippine ‘dual-umbilicus’ hypothesis. Journal of Biogeography 41:182–195.
OpenUrl CrossRef

[15] ↵
Brown, R. M., C. D. Siler, C. H. Oliveros, J. A. Esselstyn, A. C. Diesmos, P. A. Hosner, C. W. Linkem, A. J. Barley, J. R. Oaks, M. B. Sanguila, L. J. Welton, R. G. Moyle, A. T. Peterson, and A. C. Alcala. 2013. Evolutionary processes of diversification in a model island archipelago. Annual Review of Ecology, Evolution, and Systematics 44:411–435.
OpenUrl

[16] ↵
Brown, R. M., C. D. Siler, S. J. Richards, A. C. Diesmos, and D. C. Cannatella. 2015. Multilocus phylogeny and a new classification for Southeast Asian and Melanesian forest frogs (family Ceratobatrachidae). Zoological Journal of the Linnean Society 174:130–168.
OpenUrl

[17] ↵
Brown, R. M., Y.-C. Su, B. Barger, C. D. Siler, M. B. Sanguila, A. C. Diesmos, and D. C. Blackburn. 2016. Phylogeny of the island archipelago frog genus Sanguirana: Another endemic Philippine radiation that diversified ’out-of-Palawan’. Molecular Phylogenetics and Evolution 94:531–536.
OpenUrl

[18] ↵
Bryant, D., R. Bouckaert, J. Felsenstein, N. A. Rosenberg, and A. Roychoudhury. 2012. Inferring species trees directly from biallelic genetic markers: Bypassing gene trees in a full coalescent analysis. Molecular Biology and Evolution 29:1917–1932.
OpenUrl CrossRef PubMed Web of Science

[19] ↵
Carlsson, K. and T. K. Papp. 2021. PGFPlotsX: a Julia package to generate publication quality figures using the LaTeX library PGFPlots. GitHub.

[20] ↵
Catibog-Sinha, C. S. and L. R. Heaney. 2006. Philippine Biodiversity: Principles and Practice. Haribon Foundation, Quezon City, Philippines.

[21] ↵
Chan, K. O. and R. M. Brown. 2017. Did true frogs ’dispersify’? Biology Letters 13:20170299.
OpenUrl

[22] ↵
Chifman, J. and L. Kubatko. 2014. Quartet inference from SNP data under the coalescent model. Bioinformatics 30:3317–3324.
OpenUrl CrossRef PubMed Web of Science

[23] ↵
Choquet, M., I. Smolina, A. K. S. Dhanasiri, L. Blanco-Bercial, M. Kopp, A. Jueterbock, A. Y. M. Sundaram, and G. Hoarau. 2019. Towards population genomics in non-model species with large genomes: a case study of the marine zooplankton Calanus finmarchicus. Royal Society Open Science 6:180608.
OpenUrl CrossRef

[24] ↵
Clark, J. W. and P. C. J. Donoghue. 2017. Constraining the timing of whole genome duplication in plant evolutionary history. Proceedings of the Royal Society B: Biological Sciences 284:20170912.
OpenUrl CrossRef PubMed

[25] ↵
Daza, J. M., T. A. Castoe, and C. L. Parkinson. 2010. Using regional comparative phylogeographic data from snake lineages to infer historical processes in Middle America. Ecography 33:343–354.
OpenUrl Web of Science

[26] ↵
Diamond, J. M. and M. E. Gilpin. 1983. Biogeographic umbilici and the origin of the Philippine avifauna. Oikos 41:307–321.
OpenUrl CrossRef

[27] ↵
Dickerson, R. E. 1928. Distribution of life in the Philippines. Philippine Bureau of Science, Manila, Philippines.

[28] ↵
Doyle, J. J. and A. N. Egan. 2010. Dating the origins of polyploidy events. New Phytologist 186:73–85.
OpenUrl CrossRef PubMed Web of Science

[29] ↵
Drummond, A. J., S. Y. W. Ho, M. J. Phillips, and A. Rambaut. 2006. Relaxed phylogenetics and dating with confidence. PLoS Biology 4:e88.
OpenUrl CrossRef PubMed

[30] ↵
Drummond, A. J. and M. A. Suchard. 2010. Bayesian random local clocks, or one rate to rule them all. BMC Biology 8:114.
OpenUrl

[31] ↵
Eaton, D. A. R. and I. Overcast. 2020. ipyrad: Interactive assembly and analysis of RADseq datasets. Bioinformatics 36:2592–2594.
OpenUrl

[32] ↵
Evans, B., R. Brown, J. Mcguire, J. Supriatna, N. Andayani, A. Diesmos, D. Iskandar, D. Melnick, and D. Cannatella. 2003. Phylogenetics of fanged frogs: Testing biogeographical hypotheses at the interface of the Asian and Australian faunal zones. Systematic Biology 52:794–819.
OpenUrl CrossRef PubMed Web of Science

[33] ↵
Felsenstein, J. 1978. The number of evolutionary trees. Systematic Biology 27:27–33.
OpenUrl CrossRef GeoRef

[34] ↵
Foster, P. G. 2004. Modeling compositional heterogeneity. Systematic Biology 53:485–495.
OpenUrl CrossRef PubMed Web of Science

[35] ↵
Foster, P. G. 2018. Gram version 4.0.0. http://gram.nhm.ac.uk/.

[36] ↵
Gavryushkina, A., T. A. Heath, D. T. Ksepka, T. Stadler, D. Welch, and A. J. Drummond. 2016. Bayesian total-evidence dating reveals the recent crown radiation of penguins. Systematic Biology 66:57–73.
OpenUrl

[37] ↵
Gearty, W. 2021. deeptime: Plotting Tools for Anyone Working in Deep Time. R package version 0.0.6.

[38] ↵
Gong, L. and J. M. Flegal. 2016. A practical sequential stopping rule for high-dimensional Markov chain Monte Carlo. Journal of Computational and Graphical Statistics 25:684–700.
OpenUrl

[39] ↵
Green, P. J. 1995. Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika 82:711–732.
OpenUrl CrossRef Web of Science

[40] ↵
Grismer, L. L., N. A. Poyarkov, E. S. H. Quah, J. L. Grismer, and P. L. Wood, Jr.. 2022. The biogeography of bent-toed geckos, Cyrtodactylus (Squamata: Gekkonidae). PeerJ 10:e13153.
OpenUrl

[41] ↵
Haq, B. U., J. Hardenbol, and P. R. Vail. 1987. Chronology of fluctuating sea levels since the Triassic. Science 235:1156–1167.
OpenUrl Abstract/FREE Full Text

[42] ↵
Hastings, W. K. 1970. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57:97–109.
OpenUrl CrossRef Web of Science

[43] ↵
Heaney, L. R. 1985. Zoogeographic evidence for middle and late pleistocene land bridges to the philippine islands. Mod Quatern Res SE Asia 9:127–144.
OpenUrl

[44] ↵
Heaney, L. R., D. S. Balete, E. A. Rickart, P. A. Alviola, M. R. M. Duya, M. V. Duya, M. J. Veluz, L. VandeVrede, and S. J. Steppan. 2011. Chapter 1: Seven new species and a new subgenus of forest mice (Rodentia: Muridae: Apomys) from Luzon Island. Fieldiana Life and Earth Sciences 2:1–60.
OpenUrl

[45] ↵
Heaney, L. R. and J. C. Regalado, Jr.. 1998. Vanishing treasures of the Philippine rain forest. Field Museum, Chicago, Illinois.

[46] ↵
Heaney, L. R., J. S. Walsh, and A. T. Peterson. 2005. The roles of geological history and colonization abilities in genetic differentiation between mammalian populations in the Philippine Archipelago. Journal of Biogeography 32:229–247.
OpenUrl

[47] ↵
Heath, T. A., M. T. Holder, and J. P. Huelsenbeck. 2011. A Dirichlet process prior for estimating lineage-specific substitution rates. Molecular Biology and Evolution 29:939–955.
OpenUrl

[48] ↵
Heath, T. A., J. P. Huelsenbeck, and T. Stadler. 2014. The fossilized birth-death process: A coherent model of fossil calibration for divergence time estimation. Proceedings of the National Academy of Sciences 111:E2957–E2966.
OpenUrl Abstract/FREE Full Text

[49] ↵
Heled, J. and A. J. Drummond. 2010. Bayesian inference of species trees from multilocus data. Molecular Biology and Evolution 27:570–580.
OpenUrl CrossRef PubMed Web of Science

[50] ↵
Hickerson, M. J., E. A. Stahl, and H. A. Lessios. 2006. Test for simultaneous divergence using approximate Bayesian computation. Evolution 60:2435–2453.
OpenUrl CrossRef PubMed Web of Science

[51] ↵
Hoelzer, G. A. and D. J. Meinick. 1994. Patterns of speciation and limits to phylogenetic resolution. Trends in Ecology & Evolution 9:104–107.
OpenUrl

[52] ↵
Hohenlohe, P. A., S. Bassham, P. D. Etter, N. Stiffler, E. A. Johnson, and W. A. Cresko. 2010. Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLOS Genetics 6:1–23.
OpenUrl

[53] ↵
Hosner, P. A., A. S. Nyári, and R. G. Moyle. 2013. Water barriers and intra-island isolation contribute to diversification in the insular Aethopyga sunbirds (Aves: Nectariniidae). Journal of Biogeography 40:1094–1106.
OpenUrl

[54] ↵
Huang, W., N. Takebayashi, Y. Qi, and M. J. Hickerson. 2011. MTML-msBayes: Approximate Bayesian comparative phylogeographic inference from multiple taxa and multiple loci with rate heterogeneity. BMC Bioinformatics 12:1.
OpenUrl CrossRef PubMed

[55] ↵
Huxley, T. H. 1868. On the classification and the distribution of the Alectoromorphae and Heteromorphae. Proceedings of the Zoological Society of London 6:249–319.
OpenUrl

[56] ↵
Inger, R. F. 1954. Systematics and zoogeography of Philippine Amphibia. Fieldiana 33:182–531.
OpenUrl

[57] ↵
Jansa, S. A., F. K. Barker, and L. R. Heaney. 2006. The pattern and timing of diversification of Philippine endemic rodents: Evidence from mitochondrial and nuclear gene sequences. Systematic Biology 55:73–88.
OpenUrl CrossRef PubMed Web of Science

[58] ↵
Jiao, Y., N. J. Wickett, S. Ayyampalayam, A. S. Chanderbali, L. Landherr, P. E. Ralph, L. P. Tomsho, Y. Hu, H. Liang, P. S. Soltis, D. E. Soltis, S. W. Clifton, S. E. Schlarbaum, S. C. Schuster, H. Ma, J. Leebens-Mack, and C. W. dePamphilis. 2011. Ancestral polyploidy in seed plants and angiosperms. Nature 473:97–100.
OpenUrl CrossRef PubMed Web of Science

[59] ↵
H. N. Munro
Jukes, T. H. and C. R. Cantor. 1969. Evolution of protein molecules. chap. 24, Pages 21–132 in Mammalian Protein Metabolism ( H. N. Munro, ed.) vol. III. Academic Press, New York.
OpenUrl

[60] H. N. Munro

[61] ↵
Justiniano, R., J. J. Schenk, D. S. Balete, E. A. Rickart, J. A. Esselstyn, L. R. Heaney, and S. J. Steppan. 2015. Testing diversification models of endemic Philippine forest mice (Apomys) with nuclear phylogenies across elevational gradients reveals repeated colonization of isolated mountain ranges. Journal of Biogeography 42:51–64.
OpenUrl GeoRef

[62] ↵
Kingman, J. F. C. 1982a. The coalescent. Stochastic processes and their applications 13:235–248.
OpenUrl CrossRef

[63] ↵
Kingman, J. F. C. 1982b. On the genealogy of large populations. Journal of Applied Probability 19:27–43.
OpenUrl CrossRef PubMed

[64] ↵
Kishino, H., J. L. Thorne, and W. J. Bruno. 2001. Performance of a divergence time estimation method under a probabilistic model of rate evolution. Molecular Biology and Evolution 18:352–361.
OpenUrl CrossRef PubMed Web of Science

[65] ↵
Klinkenberg, D., J. A. Backer, X. Didelot, C. Colijn, and J. Wallinga. 2017. Simultaneous inference of phylogenetic and transmission trees in infectious disease outbreaks. PLOS Computational Biology 13:1–32.
OpenUrl CrossRef

[66] ↵
Kuhner, M. K. and J. Felsenstein. 1994. A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates (erratum in mol. biol. evol. 1995; 12, 525). Molecular Biology and Evolution 11:459–468.
OpenUrl CrossRef PubMed Web of Science

[67] ↵
Lakner, C., P. van der Mark, J. P. Huelsenbeck, B. Larget, and F. Ronquist. 2008. Efficiency of Markov chain Monte Carlo tree proposals in Bayesian phylogenetics. Systematic Biology 57:86–103.
OpenUrl CrossRef PubMed Web of Science

[68] ↵
Leaché, A. D., S. C. Crews, and M. J. Hickerson. 2007. Two waves of diversification in mammals and reptiles of Baja California revealed by hierarchical Bayesian analysis. Biology Letters 3:646–650.
OpenUrl

[69] ↵
Lewis, P. O., M. T. Holder, and K. E. Holsinger. 2005. Polytomies and Bayesian phylogenetic inference. Systematic Biology 54:241–253.
OpenUrl CrossRef PubMed Web of Science

[70] ↵
Li, Z., G. P. Tiley, S. R. Galuska, C. R. Reardon, T. I. Kidder, R. J. Rundell, and M. S. Barker. 2018. Multiple large-scale gene and genome duplications during the evolution of hexapods. Proceedings of the National Academy of Sciences 115:4713–4718.
OpenUrl Abstract/FREE Full Text

[71] ↵
Linkem, C. W., A. C. Diesmos, and R. M. Brown. 2011. Molecular systematics of the Philippine forest skinks (Squamata: Scincidae: Sphenomorphus): testing morphological hypotheses of interspecific relationships. Zoological Journal of the Linnean Society 163:1217–1243.
OpenUrl

[72] ↵
Linkem, C. W., K. M. Hesed, A. C. Diesmos, and R. M. Brown. 2010. Species boundaries and cryptic lineage diversity in a Philippine forest skink complex (Reptilia; Squamata; Scincidae: Lygosominae). Molecular Phylogenetics and Evolution 56:572–585.
OpenUrl

[73] ↵
Liu, L. and D. K. Pearl. 2007. Species trees from gene trees: Reconstructing Bayesian posterior distributions of a species phylogeny using estimatated gene tree distributions. Systematic Biology 56:504–514.
OpenUrl CrossRef PubMed Web of Science

[74] ↵
Lomolino, M. V., B. R. Riddle, and J. H. Brown. 2016. Biogeography. 5th ed. Sinauer Associates, Sunderland, Massachusetts, USA.

[75] ↵
Mann, H. B. and D. R. Whitney. 1947. On a test of whether one of two random variables is stochastically larger than the other. The Annals of Mathematical Statistics 18:50–60.
OpenUrl

[76] ↵
Metropolis, N., A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller, and E. Teller. 1953. Equation of state calculations by fast computing machines. The Journal of Chemical Physics 21:1087–1092.
OpenUrl CrossRef Web of Science

[77] ↵
Miller, K. G., M. A. Kominz, J. V. Browning, J. D. Wright, G. S. Mountain, M. E. Katz, P. J. Sugarman, B. S. Cramer, N. Christie-Blick, and S. F. Pekar. 2005. The Phanerozoic record of global sea-level change. Science 310:1293–1298.
OpenUrl Abstract/FREE Full Text

[78] ↵
Nielsen, R. and J. Wakeley. 2001. Distinguishing migration from isolation: A Markov chain Monte Carlo approach. Genetics 158:885–896.
OpenUrl Abstract/FREE Full Text

[79] ↵
Oaks, J. R. 2014. An improved approximate-bayesian model-choice method for estimating shared evolutionary history. BMC Evolutionary Biology 14:150.
OpenUrl

[80] ↵
Oaks, J. R. 2019. Full Bayesian comparative phylogeography from genomic data. Systematic Biology 68:371–395.
OpenUrl

[81] ↵
Oaks, J. R. 2021. Analyses exploring the behavior of generalized bayesian phylogenetics: Version 1.0.0. GitHub and archived on Zenodo; https://doi.org/10.5281/zenodo.5162056.

[82] ↵
Oaks, J. R., N. L’Bahy, and K. A. Cobb. 2020. Insights from a general, full-likelihood Bayesian approach to inferring shared evolutionary events from genomic data: Inferring shared demographic events is challenging. Evolution 74:2184–2206.
OpenUrl CrossRef

[83] ↵
Oaks, J. R., C. D. Siler, and R. M. Brown. 2019. The comparative biogeography of Philippine geckos challenges predictions from a paradigm of climate-driven vicariant diversification across an island archipelago. Evolution 73:1151–1167.
OpenUrl

[84] ↵
Oaks, J. R., J. Sukumaran, J. A. Esselstyn, C. W. Linkem, C. D. Siler, M. T. Holder, and R. M. Brown. 2013. Evidence for climate-driven diversification? a caution for interpreting ABC inferences of simultaneous historical events. Evolution 67:991–1010.
OpenUrl CrossRef PubMed

[85] ↵
Oaks, J. R. and P. L. Wood, Jr.. 2021. Open-science notebook for the comparative phylogenetics of philippine gekkonids: Version 2. GitHub and archived on Zenodo; https://doi.org/10.5281/zenodo.5162085.

[86] ↵
Plouviez, S., T. M. Shank, B. Faure, C. Daguin-Thiebaut, F. Viard, F. H. Lallier, and D. Jollivet. 2009. Comparative phylogeography among hydrothermal vent species along the East Pacific Rise reveals vicariant processes and population expansion in the South. Molecular Ecology 18:3903–3917.
OpenUrl CrossRef PubMed

[87] ↵
Pybus, O. G. and A. Rambaut. 2009. Evolutionary analysis of the dynamics of viral infectious disease. Nature Reviews Genetics 10:540–550.
OpenUrl CrossRef PubMed Web of Science

[88] ↵
Rannala, B. and Z. Yang. 2003. Bayes estimation of species divergence times and ancestral population sizes using DNA sequences from multiple loci. Genetics 164:1645–1656.
OpenUrl Abstract/FREE Full Text

[89] ↵
Roberts, T. E. 2006. Multiple levels of allopatric divergence in the endemic Philippine fruit bat Haplonycteris fischeri (Pteropodidae). Biological Journal of the Linnean Society 88:329–349.
OpenUrl CrossRef Web of Science

[90] ↵
A. F. Horadam and
W. D. Wallis
Robinson, D. F. and L. R. Foulds. 1979. Comparison of weighted labelled trees. Pages 119–126 in Combinatorial Mathematics VI ( A. F. Horadam and W. D. Wallis, eds.) Springer Berlin Heidelberg, Berlin, Heidelberg.

[91] A. F. Horadam and

[92] W. D. Wallis

[93] ↵
Rohling, E. J., M. Fenton, F. J. Jorissen, P. Bertrand, G. Ganssen, and J. P. Caulet. 1998. Magnitudes of sea-level lowstands of the past 500,000 years. Nature 394:162–165.
OpenUrl CrossRef GeoRef Web of Science

[94] ↵
Siddall, M., E. J. Rohling, A. Almogi-Labin, C. Hemleben, D. Meischner, I. Schmelzer, and D. A. Smeed. 2003. Sea-level fluctuations during the last glacial cycle. Nature 423:853–858.
OpenUrl CrossRef GeoRef PubMed Web of Science

[95] ↵
Siler, C. D., A. C. Diesmos, A. C. Alcala, and R. M. Brown. 2011. Phylogeny of Philippine slender skinks (Scincidae: Brachymeles) reveals underestimated species diversity, complex biogeographical relationships, and cryptic patterns of lineage diversification. Molecular Phylogenetics and Evolution 59:53–65.
OpenUrl CrossRef PubMed

[96] Siler, C. D., J. R. Oaks, K. Cobb, O. Hidetoshi, and R. M. Brown. 2014. Critically endangered island endemic or peripheral population of a widespread species? conservation genetics of Kikuchi’s gecko and the global challenge of protecting peripheral oceanic island endemic vertebrates. Diversity and Distributions 20:756–772.
OpenUrl

[97] ↵
Siler, C. D., J. R. Oaks, J. A. Esselstyn, A. C. Diesmos, and R. M. Brown. 2010. Phylogeny and biogeography of Philippine bent-toed geckos (Gekkonidae: Cyrtodactylus) contradict a prevailing model of Pleistocene diversification. Molecular Phylogenetics and Evolution 55:699–710.
OpenUrl CrossRef PubMed

[98] ↵
Siler, C. D., J. R. Oaks, L. J. Welton, C. W. Linkem, J. Swab, A. C. Diesmos, and R. M. Brown. 2012. Did geckos ride the Palawan raft to the Philippines? Journal of Biogeography 39:1217–1234.
OpenUrl

[99] ↵
Slowikowski, K. 2020. ggrepel: Automatically Position Non-Overlapping Text Labels with ggplot2. R package ggrepel version 0.9.1.

[100] ↵
Spratt, R. M. and L. E. Lisiecki. 2016. A Late Pleistocene sea level stack. Climate of the Past 12:1079–1092.
OpenUrl

[101] ↵
Stadler, T. 2010. Sampling-through-time in birth–death trees. Journal of Theoretical Biology 267:396–404.
OpenUrl CrossRef PubMed Web of Science

[102] ↵
Stadler, T., A. Gavryushkina, R. C. Warnock, A. J. Drummond, and T. A. Heath. 2018. The fossilized birth-death model for the analysis of stratigraphic range data under different speciation modes. Journal of Theoretical Biology 447:41–55.
OpenUrl CrossRef

[103] ↵
Suchard, M. A., R. E. Weiss, and J. S. Sinsheimer. 2001. Bayesian selection of continuoustime Markov chain evolutionary models. Molecular Biology And Evolution 18:1001–1013.
OpenUrl CrossRef PubMed Web of Science

[104] ↵
Suzuki, Y., G. V. Glazko, and M. Nei. 2002. Overcredibility of molecular phylogenies obtained by bayesian phylogenetics. Proceedings of the National Academy of Sciences 99:16138–16143.
OpenUrl Abstract/FREE Full Text

[105] ↵
Voje, K. L., C. Hemp, Ø. Flagstad, G.-P. Saetre, and N. C. Stenseth. 2009. Climatic change as an engine for speciation in flightless Orthoptera species inhabiting African mountains. Molecular Ecology 18:93–108.
OpenUrl PubMed

[106] ↵
Wallace, A. R. 1869. The Malay Archipelago: The Land of the Orang-utan, and the Bird of Paradise. Macmillan and Co., London.

[107] ↵
Wang, L.-G., T. T.-Y. Lam, S. Xu, Z. Dai, L. Zhou, T. Feng, P. Guo, C. W. Dunn, B. R. Jones, T. Bradley, H. Zhu, Y. Guan, Y. Jiang, and G. Yu. 2019. Treeio: An R package for phylogenetic tree input and output with richly annotated and associated data. Molecular Biology and Evolution 37:599–603.
OpenUrl

[108] ↵
Wickham, H. 2016. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York.

[109] ↵
Wilcoxon, F. 1945. Individual comparisons by ranking methods. Biometrics Bulletin 1:80–83.
OpenUrl CrossRef Web of Science

[110] ↵
Wilke, C. O. 2020. cowplot: Streamlined Plot Theme and Plot Annotations for ggplot2. R package version 1.1.1.

[111] ↵
Wood, Jr., P. L., X. Guo, S. L. Travers, Y.-C. Su, K. V. Olson, A. M. Bauer, L. L. Grismer, C. D. Siler, R. G. Moyle, M. J. Andersen, and R. M. Brown. 2020. Parachute geckos free fall into synonymy: Gekko phylogeny, and a new subgeneric classification, inferred from thousands of ultraconserved elements. Molecular Phylogenetics and Evolution 146:106731.
OpenUrl

[112] ↵
Yang, Z. 1994. Statistical properties of the maximum likelihood method of phylogenetic estimation and comparison with distance matrix methods. Systematic Biology 43:329–342.
OpenUrl CrossRef Web of Science

[113] ↵
Yang, Z. 2014. Molecular Evolution: A Statistical Approach. Oxford University Press, Oxford, United Kingdom.

[114] ↵
Yang, Z., N. Goldman, and A. Friday. 1995. Maximum likelihood trees from DNA sequences: A peculiar statistical estimation problem. Systematic Biology 44:384–399.
OpenUrl CrossRef Web of Science

[115] ↵
Ypma, R. J. F., W. M. van Ballegooijen, and J. Wallinga. 2013. Relating phylogenetic trees to transmission trees of infectious disease outbreaks. Genetics 195:1055–1062.
OpenUrl Abstract/FREE Full Text

[116] ↵
Yu, G., D. K. Smith, H. Zhu, Y. Guan, and T. T.-Y. Lam. 2017. ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution 8:28–36.
OpenUrl

[117] ↵
Yumul, G., C. Dimalanta, V. Maglambayan, and E. Marquez. 2008. Tectonic setting of a composite terrane: A review of the Philippine island arc system. Geosciences Journal 12:7–17.
OpenUrl

Generalizing Bayesian phylogenetics to infer shared evolutionary events

Abstract

1 Introduction

2 Results

2.1 Simulations on fixed trees

2.2 Simulations on random trees

2.3 The rate of falsely inferring shared divergences

2.4 Convergence and mixing of MCMC chains

2.5 Simulations of linked characters

2.6 Testing for shared divergences in Philippine gekkonids predicted by glacial cycles

3 Discussion

3.1 Robustness of coalescent models that assume unlinked characters

3.2 Diversification of Philippine gekkonid lizards

3.3 Future directions

4 Methods

4.1 Generalized tree model

4.2 Likelihood model

4.2.1 The data

4.2.2 The evolution of characters

4.2.3 The evolution of gene trees

4.2.4 The likelihood

4.3 Bayesian inference

4.3.1 Priors

4.4 Approximating the posterior of generalized trees

4.5 Software implementation

4.6 Simulation-based analyses

4.6.1 Methods used for all our simulations (unless noted)

4.6.2 Simulations on fixed trees

4.6.3 Simulations on random trees

4.6.4 Simulations of linked characters

4.7 Inference of shared divergences in Philippine gekkonids

4.7.1 Assembling alignments

4.7.2 Phylogenetic analyses

Data availability

Supporting Information

1 Figures referenced in main text

2 Tables

3 The generalized tree model

4 Approximating the posterior of the generalized tree model

4.1 Split-time move

4.1.1 Drawing the new divergence time

4.1.2 Prior ratio

4.1.3 Hastings ratio

4.2 Merge-times move

4.2.1 Prior ratio

4.2.2 Hastings ratio

4.3 Expanding Ξ

4.3.1 The case of all bifurcating nodes mapped to τi

4.3.2 The case of a single polytomy mapped to τi

4.3.3 The case when multiple nodes, including at least one polytomy, are mapped to τi

4.4 Validation of Split-time and Merge-times moves

4.5 Nested-neighbor-node-swap move

4.5.1 Validation of move

4.6 Divergence time slide bump move

4.6.1 An extension to this move

4.6.2 Validation of this move

5 Acknowledgments

Footnotes

References

Citation Manager Formats

Subject Area

4.3.1 The case of all bifurcating nodes mapped to τ_i

4.3.2 The case of a single polytomy mapped to τ_i

4.3.3 The case when multiple nodes, including at least one polytomy, are mapped to τ_i