RevGadgets: an R Package for visualizing Bayesian phylogenetic analyses from RevBayes

Carrie M. Tribble; William A. Freyman; Michael J. Landis; Jun Ying Lim; Joëlle Barido-Sottani; Bjørn Tore Kopperud; Sebastian Höhna; Michael R. May

doi:10.1101/2021.05.10.443470

Summary

Statistical phylogenetic methods are the foundation for a wide range of evolutionary and epidemiological studies. However, as these methods grow increasingly complex, users often encounter significant challenges with summarizing, visualizing, and communicating their key results.
We present RevGadgets, an R package for creating publication-quality figures from the results of a large variety of phylogenetic analyses performed in RevBayes (and other phylogenetic software packages).
We demonstrate how to use RevGadgets through a set of vignettes that cover the most common use cases that researchers will encounter.
RevGadgets is an open-source, extensible package that will continue to evolve in parallel with RevBayes, helping researchers to make sense of and communicate the results of a diverse array of analyses.

Introduction

Beyond being a graphical representation of the Tree of Life, phylogenetic trees provide a rigorous basis for a wide range of evolutionary and epidemiological inferences. Phylogenetic methods allow researchers to understand how molecular and morphological traits evolve (Nei, 1987; Yang, 2014; Felsenstein, 1985; Harvey and Pagel, 1991), how lineages disperse over geographic space (Ronquist and Sanmartín, 2011), and how lineages diversify over time (Morlon, 2014), among other evolutionary phenomena. Additionally, phylogenetic methods can be used to inform conservation decisions (Faith, 1992) and are powerful epidemiological tools (Volz et al., 2013; Baele et al., 2017).

Phylogenetic methods are increasingly based on explicit probabilistic models with parameters that describe underlying evolutionary processes. As datasets grow and evolutionary hypotheses become more nuanced, these models necessarily become more complex. RevBayes (Höhna et al., 2016) is a Bayesian phylogenetic inference program that was developed to accommodate this increasing complexity and allows users to explore a vast space of phylogenetic models. Models in RevBayes are specified as probabilistic graphical models (Höhna et al., 2014), which are graphical representations of the underlying dependencies among parameters (and their corresponding prior distributions), similar to individual Legos being used to build a complex city. Using this graphical modeling framework, users can design customized models and tailor analyses to their particular datasets and research questions. However, this flexibility comes at a cost: because of the nearly infinite variety of possible models (and model combinations) that users can explore in RevBayes, the results of these analyses are often challenging to summarize and visualize using standard software. This is a significant limitation for RevBayes users because, in addition to being the primary method for reporting results of phylogenetic analyses, graphical summaries are a valuable tool for making sense of scientific results (Tufte, 2001), and for diagnosing modeling and analytical problems (Kerman et al., 2008).

Historically, RevBayes users have had to process and plot their results using ad hoc scripts written for each analysis, which imposed a significant barrier to entry for users not familiar with the structure of RevBayes output or comfortable with developing their own graphical summaries. To address these challenges, we developed RevGadgets. RevGadgets is an R package (R Core Team, 2020) that adds to the diverse ecosystem of phylogenetic visualization tools—e.g., ape (Paradis and Schliep, 2019), Tracer (Rambaut et al., 2018), phytools (Revell, 2012), ggtree (Yu et al., 2017), FigTree (Rambaut, 2014), IcyTree (Vaughan, 2017), among many others— but is specialized for output produced by RevBayes. RevGadgets serves as a bridge between RevBayes analyses and existing tools for phylogenetic data processing and plotting in R, especially the ggtree package suite, which includes the ggtree, tidytree, and treeio packages (Wang et al., 2020; Yu et al., 2017). RevGadgets provides tools for plotting summary trees (including summaries of parameters for each branch), ancestral-state estimates, and posterior distributions of parameters for a variety of models. Using the general framework of ggplot2, the tidyverse, and associated packages (Wickham, 2011; Wickham et al., 2019), plotting functions return plot objects with default, but customizable, aesthetics. Here, we present five vignettes demonstrating how to use RevGadgets to summarize results for a variety of phylogenetic analyses.

Phylogenies

Phylogenies are central to all analyses in RevBayes, so accurate and information-rich visualizations of evolutionary trees are critical. In this case study, we demonstrate the tree-plotting functionality of RevGadgets, with methods to visualize phylogenies and their associated posterior probabilities, divergence-time estimates, and branch-specific parameter estimates.

RevGadgets provides paired functions for (1) reading in and processing data, and (2) summarizing and visualizing results. For phylogenies, the function readTrees() loads trees (either individual trees, or sets of trees) in either Newick or NEXUS (Maddison et al., 1997) formats, then processes associated branch or node annotations, and finally stores the tree(s) as treedata object(s) (as defined by treeio; Wang et al., 2020). Users can then visualize the treedata object using either plotTree() or plotFBDTree(), as we demonstrate below.

RevGadgets can plot both unrooted and rooted trees, and creates plots that are compatible with plotting options from ggtree. Additionally, RevGadgets provides extensive functionality for plotting trees with non-contemporaneous tips, such as those produced by total-evidence analyses under the fossilized birth-death [FBD] process (Heath et al., 2014; Zhang et al., 2016). The fossilized birth-death process (and the related serially-sampled birth-death process; Stadler, 2010) produces sampled ancestors (samples that are directly ancestral to another sampled taxon and thus are not represented as tips in the tree), and the ages of the samples are often subject to uncertainty (e.g., because of imperfect knowledge about the age of the strata from which the samples were collected). As a consequence, conventional tree plotting tools are unsuitable for plotting FBD trees. We demonstrate how to use RevGadgets to plot the results of an FBD analyses of living and extinct bears (Figure 1; data from Abella et al., 2012 and Heath et al., 2014). We include age bars colored by the posterior probability of the corresponding node, a geological time scale and labeled epochs from the package deeptime (Gearty, 2021), and fossils estimated to be direct ancestors of other samples (i.e., sampled ancestors).

Figure 1: Plotting a time-calibrated phylogeny of extinct and extant taxa.

Top) RevGadgets code for reading in and plotting a time-calibrated phylogeny of extant and extinct bears. We use the theme function from ggplot2 to add the posterior-probability legend. Bottom) The maximum sampled-ancestor clade-credibility (MSACC) tree for the bears. Sampled ancestors are indicated by numbers along the branches (legend, top left). Bars represent the 95% credible interval of the age of the node, tip or sampled ancestor in millions of years (geological timescale, x-axis); the color of the bar corresponds to the posterior probability (legend, middle left) of that a clade exists, the posterior probability that a fossil is a sampled ancestor, or the posterior probability that a tip is not a sampled ancestor. (Data from Abella et al., 2012; Heath et al., 2014.)

In addition to visualizing trees themselves, RevGadgets allows researchers to visualize branch-specific parameters, for example rates of evolution or diversification for each branch in the phylogeny. In Figure 2, we demonstrate how to use plotTree() to visualize the estimated optimal body size as it varies across the cetacean phylogeny under a relaxed Ornstein-Uhlenbeck process (Butler and King, 2004; Uyeda and Harmon, 2014; data from Steeman et al., 2009; Slater et al., 2010). Under this model, a quantitative character evolves towards an adaptive optimum that changes along the branches of the tree, and thus the optimum associated with each branch is a focal inference.

Figure 2: Plotting branch-specific parameter values across a phylogeny.

Top) RevGadgets code for reading in and plotting the cetacean phylogeny that has been annotated with branch-specific adaptive optima (θ) inferred under a relaxed Ornstein-Uhlenbeck model. Bottom) The cetacean phylogeny with branches colored according to the posterior-mean estimate of the inferred branch-specific optimum body size, θ (legend, tip left). (Phylogeny from Steeman et al., 2009; body size data in units of natural log-transformed meters from Slater et al., 2010.)

The plotTree() function can also visualize unrooted or circular phylogenies, and users may add text annotations to denote posterior probabilities or other quantities. Users can apply ggtree functions to modify the RevGadgets plot, e.g., to highlight certain clades with geom hilight() or to add phylopics (http://phylopic.org/) using geom phylopic(). Together, these functions provide user-friendly and customizable tree-plotting functionality for a variety of core research questions in evolutionary biology.

Posterior Estimates of Numerical Parameters

RevGadgets provides several tools to visualize posterior distributions of numerical parameters. The output produced by most RevBayes analyses is a (typically tab-delimited) text file where rows correspond to samples from sequential iterations of an MCMC analysis, and columns correspond to parameters in the model. Most information of interest to researchers—e.g., most probable parameter values (maximum a posteriori, or MAP, estimates), 95% credible intervals (CIs), or full posterior distributions—requires processing this raw MCMC output. Here, we demonstrate methods for processing and visualizing MCMC output for both quantitative and qualitative parameters.

We illustrate the core functions for reading, summarizing and visualizing posterior distributions of specific parameters with an example analysis of chromosome number evolution (Figure 3; data from Freyman and Höhna, 2018). We use readTrace() to read in parameters sampled during one or more MCMC analyses. We then use summarizeTrace() to calculate the posterior mean and 95% credible interval for the focal parameters. Finally, we plot the marginal posterior distributions of the focal parameters using plotTrace().

Figure 3: Plotting posterior distributions of numerical parameter values.

Top) RevGadgets code for reading in and plotting the posterior distributions of rates of chromosome evolution in Aristolochia. Bottom) Marginal posterior distributions of the two rate parameters. Shaded regions represent the 95% credible interval of each posterior distribution. (Data from Freyman and Höhna, 2018.)

Plots of the posterior distributions of parameter values are key to a thorough understanding of the results of any Bayesian analysis. These tools encourage users to explore their results thoroughly rather than relying on single summary statistics. These summaries and plots may also be useful as tools for science communication and education on statistical phylogenetics, as they can easily be used to demonstrate differences in parameter estimates that result from changes to basic phylogenetic models. Additionally, the output of readTrace() may be passed to R packages specializing in MCMC diagnosis, e.g., convenience (Fabreti and Höhna, 2021) or coda (Plummer et al., 2006). These functions are compatible with any delimited text file of MCMC samples, and can be used with the output of most Bayesian phylogenetic programs.

Ancestral-State Estimates

In addition to making inferences about the underlying process of evolution, researchers may be interested in studying how particular characters evolved across the branches of the phylogeny. Ancestral-state estimation is a method for inferring that history.

RevGadgets offers two different types of summaries for ancestral-state estimates: 1) maximum a posterior (MAP) estimates, i.e., the states with the highest posterior probability at each node, and; 2) pie charts that represent each state in proportion to its probability at each node. Ancestral-state estimates may be represented as text annotations rather than colored symbols. Additionally, RevGadgets can summarize and visualize ancestral-state estimates at internal nodes and at the “shoulders”, i.e., at the beginning of each branch. Plotting the states at internal nodes is appropriate for standard evolutionary models of anagenetic (within-lineage) change. However, models of evolution that include a cladogenetic component (e.g., models of biogeographic or chromosome-number evolution; Ree and Smith, 2008; Goldberg and Igić, 2012; Freyman and Höhna, 2018) also allow states to change at speciation events. In this case, researchers may also want to plot the shoulder states, which represent the ancestral-state estimates for each daughter lineage immediately following the speciation event.

We demonstrate how to plot ancestral-state estimates of placenta type across the mammal phylogeny under an asymmetric model of character evolution (Figure 4; data from Elliot and Crespi, 2006). First, we use processAncStates() to read in and parse the phylogeny and ancestral-state estimates inferred using RevBayes. Second, we use plotAncStatesMAP() to color each node symbol according to the state with the highest posterior probability, and make the radius of the symbol proportional to that state’s posterior probability. Because of the size of the phylogeny, we choose to plot the estimates on a circular tree by changing the tree layout parameter.

Figure 4: Plotting maximum a posterior (MAP) estimates of ancestral states on a circular phylogeny.

Top) MAP estimates of ancestral placental states across the phylogeny of mammals. Each node is colored by the MAP state (legend, bottom right); the size of each symbol is proportional to the posterior probability of the map state (legend, top right). Bottom) RevGadgets code for reading in and plotting the MAP estimates for ancestral placental states across the mammals phylogeny. (Data from Elliot and Crespi, 2006.)

Next, we demonstrate plotting estimates of ancestral ranges of the Hawaiian silversword alliance that were generated by a Dispersal-Extinction-Cladogenesis (DEC) model (Figure 5; data from Landis et al., 2018). Since the DEC model features a cladogenetic component, we include shoulder-state estimates. Because of the large number of states in this analysis (15 possible ranges and one “other” category), more pre-processing is necessary. As before, we pass the appropriate state names to processAncStates(); however, in this case we plot pie charts representing the probability of each state using plotAncStatesPie(), and plot states at shoulders using cladogenetic = TRUE.

Figure 5: Plotting posterior distributions of ancestral states under a cladogenetic model.

Top) The posterior estimates ancestral biogeographic states of the Hawaiian silverswords estimated under a DEC model. The size of each pie slice is proportional to the posterior probability of a given state (legend, top left) for a particular lineage. Pies at nodes represent the state of the ancestral lineage immediately before speciation; pies at “shoulders” represent the states of each daughter lineage immediately following the speciation event. Bottom) RevGadgets code for reading in and plotting the posterior estimates for ancestral geographic range across the phylogeny of Hawaiian silverswords. (Data from Landis et al., 2018.)

Beyond the above examples, these versatile plotting tools can visualize any discrete ancestral-state estimates reconstructed by RevBayes, including the results of chromosome count estimations (Freyman and Höhna, 2018) and discrete state-dependent speciation and extinction (SSE) models (Freyman and Höhna, 2019; Zenil-Ferguson et al., 2019).

Diversification Rates

The processes of speciation and extinction (i.e., lineage diversification) is of great interest to evolutionary biologists (Morlon, 2014). Rates of speciation and extinction may be modeled as constant over time and among branches (as in a constant-rate birth-death process; Kendall et al., 1948; Nee et al., 1994), or allowed to vary over time (Stadler, 2011; May et al., 2016), across branches of a phylogeny (Rabosky, 2014; Höhna et al., 2019), or based on the character states of the evolving lineages (Maddison et al., 2007; Freyman and Höhna, 2019). For example, rates that vary across branches of the phylogeny can be visualized using plotTree() to color the branches by their inferred rate. State-dependent diversification models provide estimates of the speciation and extinction rates associated with each character state, and may also be used to estimate ancestral states. plotTrace() or specific processing and plotting functions for diversification rates—processSSE(), plotMuSSE, and plotHiSSE—may be used to visualize the estimated rates. plotAncStatesMAP() or plotAncStatesPie() may be used to visualize the ancestral-state estimates.

We demonstrate how to plot the results of a time-varying model—the episodic birth-death process (Stadler, 2011; Höhna, 2015)—applied to primate phylogeny (Figure 6; Springer et al., 2012). The episodic birth-death analysis in RevBayes produces separate trace files each type of rate. We read these output files using processDivRates() and plot the resulting parameter estimates over time using plotDivRates().

Figure 6: Plotting posterior distributions of diversification rates over time.

Top) RevGadgets code for reading in and plotting the posterior estimates of diversification rates over time inferred from the primate phylogeny. Bottom) Posterior distributions of speciation and extinction rates over time, as well as the net diversification rate (speciation minus extinction) and the relative extinction rate (extinction divided by speciation). Dark lines correspond to the posterior-mean estimate of each parameter for each time interval, and shaded regions correspond to the 95% credible interval. (Data from Springer et al., 2012.)

Together with the aforementioned functions for plotting diversification parameter estimates, plotDivRates() allows users to visualize the outputs of nearly all diversification analyses available in RevBayes. Stochastic character mapping of diversification estimates, in which the timing and location of diversification rate shifts are painted along the branches of the tree, will be added in the future (Freyman and Höhna, 2019; Höhna et al., 2019).

Model Adequacy

In addition to visualizing the results of phylogenetic inferences with a specific model, RevGadgets provides tools for exploring the adequacy of the model (i.e., whether the model provides an adequate description of the data-generating process; Bollback, 2002; Gelman et al., 2013; Brown, 2014; Höhna et al., 2018). Posterior-predictive analysis tests whether a fitted model simulates (predicts) data that are similar to the observed data. This process is distinct from model testing, in which one model is chosen from a set of possible models, as the best model of the set may still provide an inadequate description of the underlying process.

First, users analyze their data with the model of interest and then use the inferred posterior distribution to simulate a number of new data sets. The user then selects test statistics that describe important features of the data (e.g., the number of invariant sites in a nucleotide alignment) and calculates these statistics for both the observed data and the simulated data. If the statistic from the empirical data is reasonably included within the distribution of statistics from simulated datasets (posterior-predictive p-value > 0.05), the model is considered an adequate description of the process that produced the tested data feature.

Here, we demonstrate the workflow for a posterior-predictive analysis to test model adequacy of the Jukes-Cantor model for nucleotide sequence evolution (Jukes et al., 1969) in a single gene across a sample of 23 primates (Figure 7; data from Springer et al., 2012). First, we perform an analysis in RevBayes under a Jukes-Cantor model of nucleotide sequence data. Second, we use RevBayes to simulate datasets under the posterior distributions estimated in the first step. Third, we use RevBayes to calculate statistics from the simulated and empirical datasets. These statistics should describe aspects of the data that we hope capture a meaningful component of model performance. Finally, we use RevGadgets to plot those statistics and compute posterior-predictive p-values.

Figure 7: Plotting simulated posterior-predictive distributions to assess model adequacy.

Top) RevGadgets code for reading in and plotting the distributions of summary statistics generated using posterior-predictive simulation posterior. Bottom) Posterior-predictive distributions (black curves) of four statistics simulated under the Jukes-Cantor model fit to primate cytb, compared to the same statistics computed on the observed data (dashed vertical lines). The posterior-predictive p-value (upper right of each panel) is the fraction of simulated statistics that are as or more extreme than the observed statistic. If the observed statistic falls in or beyond the orange region, we deem the model as in-adequate at the 5% significance level; if the observed statistic falls in the blue region, the model is marginally adequate at the 10% significance level. In this case, the Jukes-Cantor model provides an inadequate description of the true generating process according to every summary statistic. (Data from Springer et al., 2012.)

Despite being computationally inexpensive compared to Bayesian model comparison methods (i.e., Bayes factor calculation), posterior-predictive approaches remain relatively uncommon in empirical phylogenetic studies. As genome-scale datasets and increasingly complex statistical methods become more accessible to researchers, posterior-predictive simulation will be critical to testing how well our models describe the underlying generative processes. This component of RevGadgets functionality and the associated clear workflows for performing and interpreting posterior-predictive tests will hopefully increase the adoption of this important tool.

Conclusions

RevBayes is a flexible platform for performing Bayesian phylogenetic evolutionary inferences. Because of the almost endless possibilities for building unique combinations of models in RevBayes, these analyses are often challenging to visualize using standard plotting software. We have developed an R package, RevGadgets, to produce publication-quality visualizations of phylogenetic analyses performed in RevBayes. The case studies described above illustrate some of the core functionality available in RevGadgets and demonstrate how to produce plots of the most commonly-performed RevBayes analyses. RevBayes is open source software that is actively maintained and developed. Likewise, RevGadgets is also open source and will continue to provide new plotting tools to meet new visualization challenges as they arise. RevGadgets and any future updates will be available on GitHub at https://github.com/cmt2/RevGadgets. Additionally, we provide thorough documentation for all functionality in the package and maintain numerous tutorials demonstrating how to use RevGadgets on the RevBayes website at https://RevBayes.github.io/tutorials/. Together, the modular modeling tools from RevBayes and the visualization gadgets in RevGadgets will help researchers make sense of and communicate the results of a diverse array of sophisticated phylogenetic analyses.

Authors Contributions

CMT and MRM designed the R package. All authors contributed code and examples. CMT and MRM drafted the manuscript. All authors revised and approved the final version of the manuscript.

Data Availability

RevGadgets and all example datasets are freely available on GitHub at https://github.com/cmt2/RevGadgets.

Acknowledgements

We would like to acknowledge Carl J. Rothfels, Benjamin K. Blackman, David D. Ackerly, and Chelsea D. Specht for feedback on initial stages of the manuscript. Ixchel González Ramírez, Jenna T. B. Ekwealor, Isaac Lichter Marck, and members of the Rothfels Lab at UC Berkeley provided valuable feedback on usability and legibility of figures and code.

This research was supported by the Deutsche Forschungsgemeinschaft (DFG) Emmy Noether-Program HO 6201/1-1 awarded to SH.

Footnotes

https://github.com/cmt2/RevGadgets

References

↵
Abella, J., Alba, D. M., Robles, J. M., Valenciano, A., Rotgers, C., Carmona, R., Montoya, P., and Morales, J. (2012). Kretzoiarctos gen. nov., the oldest member of the giant panda clade. PLoS One, 7(11).
↵
Baele, G., Suchard, M. A., Rambaut, A., and Lemey, P. (2017). Emerging concepts of data integration in pathogen phylodynamics. Systematic Biology, 66(1):e47–e65.
OpenUrl CrossRef PubMed
↵
Bollback, J. P. (2002). Bayesian model adequacy and choice in phylogenetics. Molecular Biology and Evolution, 19(7):1171–1180.
OpenUrl CrossRef PubMed Web of Science
↵
Brown, J. M. (2014). Predictive approaches to assessing the fit of evolutionary models. Systematic biology, 63(3):289–292.
OpenUrl CrossRef PubMed
↵
Butler, M. A. and King, A. A. (2004). Phylogenetic comparative analysis: a modeling approach for adaptive evolution. The American Naturalist, 164(6):683–695.
OpenUrl CrossRef PubMed Web of Science
↵
Elliot, M. G. and Crespi, B. J. (2006). Placental invasiveness mediates the evolution of hybrid inviability in mammals. The American Naturalist, 168(1):114–120.
OpenUrl CrossRef PubMed Web of Science
↵
Fabreti, L. G. and Höhna, S. (2021). Convergence assessment for bayesian phylogenetic analysis using mcmc simulation.
↵
Faith, D. P. (1992). Conservation evaluation and phylogenetic diversity. Biological Conservation, 61(1):1–10.
OpenUrl CrossRef Web of Science
↵
Felsenstein, J. (1985). Phylogenies and the comparative method. The American Naturalist, 125(1):1–15.
OpenUrl CrossRef Web of Science
↵
Freyman, W. A. and Höhna, S. (2018). Cladogenetic and anagenetic models of chromosome number evolution: a Bayesian model averaging approach. Systematic biology, 67(2):195–215.
OpenUrl CrossRef PubMed
↵
Freyman, W. A. and Höhna, S. (2019). Stochastic character mapping of state-dependent diversification reveals the tempo of evolutionary decline in self-compatible Onagraceae lineages. Systematic Biology, 68(3):505–519.
OpenUrl
↵
Gearty, W. (2021). deeptime: Plotting Tools for Anyone Working in Deep Time. R package version 0.0.5.3.
↵
Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., and Rubin, D. B. (2013). Bayesian data analysis. CRC press.
↵
Goldberg, E. E. and Igić, B. (2012). Tempo and mode in plant breeding system evolution. Evolution: International Journal of Organic Evolution, 66(12):3701–3709.
OpenUrl CrossRef PubMed Web of Science
↵
Harvey, P. H. and Pagel, M. D. (1991). The Comparative Method in Evolutionary Biology, volume 239. Oxford University Press.
↵
Heath, T. A., Huelsenbeck, J. P., and Stadler, T. (2014). The fossilized birth–death process for coherent calibration of divergence-time estimates. Proceedings of the National Academy of Sciences, 111(29):E2957–E2966.
OpenUrl Abstract/FREE Full Text
↵
Höhna, S. (2015). The time-dependent reconstructed evolutionary process with a key-role for mass-extinction events. Journal of theoretical biology, 380:321–331.
OpenUrl CrossRef PubMed
Höhna, S., Coghill, L. M., Mount, G. G., Thomson, R. C., and Brown, J. M. (2018). P3: Phylogenetic posterior prediction in RevBayes. Molecular biology and evolution, 35(4):1028–1034.
OpenUrl
↵
Höhna, S., Freyman, W. A., Nolen, Z., Huelsenbeck, J., May, M. R., and Moore, B. R. (2019). A Bayesian approach for estimating branch-specific speciation and extinction rates. bioRxiv, 555805.
Höhna, S., Heath, T. A., Boussau, B., Landis, M. J., Ronquist, F., and Huelsenbeck, J. P. (2014). Probabilistic graphical model representation in phylogenetics. Systematic biology, 63(5):753–771.
OpenUrl CrossRef PubMed
↵
Höhna, S., Landis, M. J., Heath, T. A., Boussau, B., Lartillot, N., Moore, B. R., Huelsenbeck, J. P., and Ronquist, F. (2016). RevBayes: Bayesian phylogenetic inference using graphical models and an interactive model-specification language. Systematic Biology, 65(4):726–736.
OpenUrl CrossRef PubMed
↵
Jukes, T. H., Cantor, C. R., et al. (1969). Evolution of protein molecules. Mammalian protein metabolism, 3(21):132.
OpenUrl
↵
Kendall, D. G. et al. (1948). On the generalized “birth-and-death” process. The annals of mathematical statistics, 19(1):1–15.
OpenUrl
↵
Kerman, J., Gelman, A., Zheng, T., and Ding, Y. (2008). Visualization in bayesian data analysis. In Handbook of Data Visualization, pages 709–724. Springer.
↵
Landis, M. J., Freyman, W. A., and Baldwin, B. G. (2018). Retracing the Hawaiian silversword radiation despite phylogenetic, biogeographic, and paleogeographic uncertainty. Evolution, 72(11):2343–2359.
OpenUrl
↵
Maddison, D. R., Swofford, D. L., and Maddison, W. P. (1997). NEXUS: an extensible file format for systematic information. Systematic biology, 46(4):590–621.
OpenUrl CrossRef PubMed Web of Science
↵
Maddison, W. P., Midford, P. E., and Otto, S. P. (2007). Estimating a binary character’s effect on speciation and extinction. Systematic biology, 56(5):701–710.
OpenUrl CrossRef PubMed Web of Science
↵
May, M. R., Höhna, S., and Moore, B. R. (2016). A Bayesian approach for detecting the impact of mass-extinction events on molecular phylogenies when rates of lineage diversification may vary. Methods in Ecology and Evolution, 7(8):947–959.
OpenUrl
↵
Morlon, H. (2014). Phylogenetic approaches for studying diversification. Ecology letters, 17(4):508–525.
OpenUrl CrossRef PubMed
↵
Nee, S., May, R. M., and Harvey, P. H. (1994). The reconstructed evolutionary process. Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences, 344(1309):305–311.
OpenUrl GeoRef PubMed Web of Science
↵
Nei, M. (1987). Molecular Evolutionary Genetics. Columbia University Press.
↵
Paradis, E. and Schliep, K. (2019). ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics, 35:526–528.
OpenUrl CrossRef PubMed
↵
Plummer, M., Best, N., Cowles, K., and Vines, K. (2006). CODA: convergence diagnosis and output analysis for MCMC. R news, 6(1):7–11.
OpenUrl
↵
R Core Team (2020). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria.
↵
Rabosky, D. L. (2014). Automatic detection of key innovations, rate shifts, and diversity-dependence on phylogenetic trees. PloS one, 9(2).
↵
Rambaut, A. (2014). FigTree 1.4. 2 software. Institute of Evolutionary Biology, Univ. Edinburgh.
↵
Rambaut, A., Drummond, A. J., Xie, D., Baele, G., and Suchard, M. A. (2018). Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Systematic biology, 67(5):901–904.
OpenUrl CrossRef PubMed
↵
Ree, R. H. and Smith, S. A. (2008). Maximum likelihood inference of geographic range evolution by dispersal, local extinction, and cladogenesis. Systematic biology, 57(1):4–14.
OpenUrl CrossRef GeoRef PubMed Web of Science
↵
Revell, L. J. (2012). phytools: an R package for phylogenetic comparative biology (and other things). Methods in ecology and evolution, 3(2):217–223.
OpenUrl
↵
Ronquist, F. and Sanmartín, I. (2011). Phylogenetic methods in biogeography. Annual Review of Ecology, Evolution, and Systematics, 42.
↵
Slater, G. J., Price, S. A., Santini, F., and Alfaro, M. E. (2010). Diversity versus disparity and the radiation of modern cetaceans. Proceedings of the Royal Society B: Biological Sciences, 277(1697):3097–3104.
OpenUrl CrossRef GeoRef PubMed
↵
Springer, M. S., Meredith, R. W., Gatesy, J., Emerling, C. A., Park, J., Ra-bosky, D. L., Stadler, T., Steiner, C., Ryder, O. A., Janečka, J. E., et al. (2012). Macroevolutionary dynamics and historical biogeography of primate diversification inferred from a species supermatrix. PloS one, 7(11).
↵
Stadler, T. (2010). Sampling-through-time in birth–death trees. Journal of Theoretical Biology, 267(3):396–404.
OpenUrl CrossRef PubMed Web of Science
↵
Stadler, T. (2011). Mammalian phylogeny reveals recent diversification rate shifts. Proceedings of the National Academy of Sciences, 108(15):6187–6192.
OpenUrl Abstract/FREE Full Text
↵
Steeman, M. E., Hebsgaard, M. B., Fordyce, R. E., Ho, S. Y., Rabosky, D. L., Nielsen, R., Rahbek, C., Glenner, H., Sørensen, M. V., and Willerslev, E. (2009). Radiation of extant cetaceans driven by restructuring of the oceans. Systematic Biology, 58(6):573–585.
OpenUrl CrossRef PubMed Web of Science
↵
Tufte, E. (2001). The visual display of quantitative information.
↵
Uyeda, J. C. and Harmon, L. J. (2014). A novel Bayesian method for in-ferring and interpreting the dynamics of adaptive landscapes from phylogenetic comparative data. Systematic biology, 63(6):902–918.
OpenUrl CrossRef PubMed
↵
Vaughan, T. G. (2017). IcyTree: rapid browser-based visualization for phylogenetic trees and networks. Bioinformatics, 33(15):2392–2394.
OpenUrl CrossRef
↵
Volz, E. M., Koelle, K., and Bedford, T. (2013). Viral phylodynamics. PLoS Computational Biology, 9(3):e1002947.
OpenUrl
↵
Wang, L.-G., Lam, T. T.-Y., Xu, S., Dai, Z., Zhou, L., Feng, T., Guo, P., Dunn, C. W., Jones, B. R., Bradley, T., et al. (2020). treeio: an R package for phylogenetic tree input and output with richly annotated and associated data. Molecular biology and evolution, 37(2):599–603.
OpenUrl CrossRef PubMed
↵
Wickham, H. (2011). ggplot2. Wiley Interdisciplinary Reviews: Computational Statistics, 3(2):180–185.
OpenUrl
↵
Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L., François, R., Grolemund, G., Hayes, A., Henry, L., Hester, J., et al. (2019). Welcome to the Tidyverse. Journal of Open Source Software, 4(43):1686.
OpenUrl
↵
Yang, Z. (2014). Molecular Evolution: A Statistical Approach. Oxford University Press.
↵
Yu, G., Smith, D. K., Zhu, H., Guan, Y., and Lam, T. T.-Y. (2017). ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution, 8(1):28–36.
OpenUrl CrossRef
↵
Zenil-Ferguson, R., Burleigh, J. G., Freyman, W. A., Igić, B., Mayrose, I., and Goldberg, E. E. (2019). Interaction among ploidy, breeding system and lineage diversification. New Phytologist, 224(3):1252–1265.
OpenUrl
↵
Zhang, C., Stadler, T., Klopfstein, S., Heath, T. A., and Ronquist, F. (2016). Total-evidence dating under the fossilized birth-death process. Systematic Biology, 65(2):228–249.
OpenUrl CrossRef PubMed

View the discussion thread.

Posted May 11, 2021.

Download PDF

Data/Code

Citation Tools

Subject Area

Evolutionary Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5214)
Biochemistry (11745)
Bioengineering (8751)
Bioinformatics (29195)
Biophysics (14971)
Cancer Biology (12095)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14179)
Epidemiology (2067)
Evolutionary Biology (18306)
Genetics (12245)
Genomics (16802)
Immunology (11867)
Microbiology (28083)
Molecular Biology (11592)
Neuroscience (60965)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2885)
Systems Biology (7339)
Zoology (1651)

[1] ↵
Abella, J., Alba, D. M., Robles, J. M., Valenciano, A., Rotgers, C., Carmona, R., Montoya, P., and Morales, J. (2012). Kretzoiarctos gen. nov., the oldest member of the giant panda clade. PLoS One, 7(11).

[2] ↵
Baele, G., Suchard, M. A., Rambaut, A., and Lemey, P. (2017). Emerging concepts of data integration in pathogen phylodynamics. Systematic Biology, 66(1):e47–e65.
OpenUrl CrossRef PubMed

[3] ↵
Bollback, J. P. (2002). Bayesian model adequacy and choice in phylogenetics. Molecular Biology and Evolution, 19(7):1171–1180.
OpenUrl CrossRef PubMed Web of Science

[4] ↵
Brown, J. M. (2014). Predictive approaches to assessing the fit of evolutionary models. Systematic biology, 63(3):289–292.
OpenUrl CrossRef PubMed

[5] ↵
Butler, M. A. and King, A. A. (2004). Phylogenetic comparative analysis: a modeling approach for adaptive evolution. The American Naturalist, 164(6):683–695.
OpenUrl CrossRef PubMed Web of Science

[6] ↵
Elliot, M. G. and Crespi, B. J. (2006). Placental invasiveness mediates the evolution of hybrid inviability in mammals. The American Naturalist, 168(1):114–120.
OpenUrl CrossRef PubMed Web of Science

[7] ↵
Fabreti, L. G. and Höhna, S. (2021). Convergence assessment for bayesian phylogenetic analysis using mcmc simulation.

[8] ↵
Faith, D. P. (1992). Conservation evaluation and phylogenetic diversity. Biological Conservation, 61(1):1–10.
OpenUrl CrossRef Web of Science

[9] ↵
Felsenstein, J. (1985). Phylogenies and the comparative method. The American Naturalist, 125(1):1–15.
OpenUrl CrossRef Web of Science

[10] ↵
Freyman, W. A. and Höhna, S. (2018). Cladogenetic and anagenetic models of chromosome number evolution: a Bayesian model averaging approach. Systematic biology, 67(2):195–215.
OpenUrl CrossRef PubMed

[11] ↵
Freyman, W. A. and Höhna, S. (2019). Stochastic character mapping of state-dependent diversification reveals the tempo of evolutionary decline in self-compatible Onagraceae lineages. Systematic Biology, 68(3):505–519.
OpenUrl

[12] ↵
Gearty, W. (2021). deeptime: Plotting Tools for Anyone Working in Deep Time. R package version 0.0.5.3.

[13] ↵
Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., and Rubin, D. B. (2013). Bayesian data analysis. CRC press.

[14] ↵
Goldberg, E. E. and Igić, B. (2012). Tempo and mode in plant breeding system evolution. Evolution: International Journal of Organic Evolution, 66(12):3701–3709.
OpenUrl CrossRef PubMed Web of Science

[15] ↵
Harvey, P. H. and Pagel, M. D. (1991). The Comparative Method in Evolutionary Biology, volume 239. Oxford University Press.

[16] ↵
Heath, T. A., Huelsenbeck, J. P., and Stadler, T. (2014). The fossilized birth–death process for coherent calibration of divergence-time estimates. Proceedings of the National Academy of Sciences, 111(29):E2957–E2966.
OpenUrl Abstract/FREE Full Text

[17] ↵
Höhna, S. (2015). The time-dependent reconstructed evolutionary process with a key-role for mass-extinction events. Journal of theoretical biology, 380:321–331.
OpenUrl CrossRef PubMed

[18] Höhna, S., Coghill, L. M., Mount, G. G., Thomson, R. C., and Brown, J. M. (2018). P3: Phylogenetic posterior prediction in RevBayes. Molecular biology and evolution, 35(4):1028–1034.
OpenUrl

[19] ↵
Höhna, S., Freyman, W. A., Nolen, Z., Huelsenbeck, J., May, M. R., and Moore, B. R. (2019). A Bayesian approach for estimating branch-specific speciation and extinction rates. bioRxiv, 555805.

[20] Höhna, S., Heath, T. A., Boussau, B., Landis, M. J., Ronquist, F., and Huelsenbeck, J. P. (2014). Probabilistic graphical model representation in phylogenetics. Systematic biology, 63(5):753–771.
OpenUrl CrossRef PubMed

[21] ↵
Höhna, S., Landis, M. J., Heath, T. A., Boussau, B., Lartillot, N., Moore, B. R., Huelsenbeck, J. P., and Ronquist, F. (2016). RevBayes: Bayesian phylogenetic inference using graphical models and an interactive model-specification language. Systematic Biology, 65(4):726–736.
OpenUrl CrossRef PubMed

[22] ↵
Jukes, T. H., Cantor, C. R., et al. (1969). Evolution of protein molecules. Mammalian protein metabolism, 3(21):132.
OpenUrl

[23] ↵
Kendall, D. G. et al. (1948). On the generalized “birth-and-death” process. The annals of mathematical statistics, 19(1):1–15.
OpenUrl

[24] ↵
Kerman, J., Gelman, A., Zheng, T., and Ding, Y. (2008). Visualization in bayesian data analysis. In Handbook of Data Visualization, pages 709–724. Springer.

[25] ↵
Landis, M. J., Freyman, W. A., and Baldwin, B. G. (2018). Retracing the Hawaiian silversword radiation despite phylogenetic, biogeographic, and paleogeographic uncertainty. Evolution, 72(11):2343–2359.
OpenUrl

[26] ↵
Maddison, D. R., Swofford, D. L., and Maddison, W. P. (1997). NEXUS: an extensible file format for systematic information. Systematic biology, 46(4):590–621.
OpenUrl CrossRef PubMed Web of Science

[27] ↵
Maddison, W. P., Midford, P. E., and Otto, S. P. (2007). Estimating a binary character’s effect on speciation and extinction. Systematic biology, 56(5):701–710.
OpenUrl CrossRef PubMed Web of Science

[28] ↵
May, M. R., Höhna, S., and Moore, B. R. (2016). A Bayesian approach for detecting the impact of mass-extinction events on molecular phylogenies when rates of lineage diversification may vary. Methods in Ecology and Evolution, 7(8):947–959.
OpenUrl

[29] ↵
Morlon, H. (2014). Phylogenetic approaches for studying diversification. Ecology letters, 17(4):508–525.
OpenUrl CrossRef PubMed

[30] ↵
Nee, S., May, R. M., and Harvey, P. H. (1994). The reconstructed evolutionary process. Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences, 344(1309):305–311.
OpenUrl GeoRef PubMed Web of Science

[31] ↵
Nei, M. (1987). Molecular Evolutionary Genetics. Columbia University Press.

[32] ↵
Paradis, E. and Schliep, K. (2019). ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics, 35:526–528.
OpenUrl CrossRef PubMed

[33] ↵
Plummer, M., Best, N., Cowles, K., and Vines, K. (2006). CODA: convergence diagnosis and output analysis for MCMC. R news, 6(1):7–11.
OpenUrl

[34] ↵
R Core Team (2020). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria.

[35] ↵
Rabosky, D. L. (2014). Automatic detection of key innovations, rate shifts, and diversity-dependence on phylogenetic trees. PloS one, 9(2).

[36] ↵
Rambaut, A. (2014). FigTree 1.4. 2 software. Institute of Evolutionary Biology, Univ. Edinburgh.

[37] ↵
Rambaut, A., Drummond, A. J., Xie, D., Baele, G., and Suchard, M. A. (2018). Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Systematic biology, 67(5):901–904.
OpenUrl CrossRef PubMed

[38] ↵
Ree, R. H. and Smith, S. A. (2008). Maximum likelihood inference of geographic range evolution by dispersal, local extinction, and cladogenesis. Systematic biology, 57(1):4–14.
OpenUrl CrossRef GeoRef PubMed Web of Science

[39] ↵
Revell, L. J. (2012). phytools: an R package for phylogenetic comparative biology (and other things). Methods in ecology and evolution, 3(2):217–223.
OpenUrl

[40] ↵
Ronquist, F. and Sanmartín, I. (2011). Phylogenetic methods in biogeography. Annual Review of Ecology, Evolution, and Systematics, 42.

[41] ↵
Slater, G. J., Price, S. A., Santini, F., and Alfaro, M. E. (2010). Diversity versus disparity and the radiation of modern cetaceans. Proceedings of the Royal Society B: Biological Sciences, 277(1697):3097–3104.
OpenUrl CrossRef GeoRef PubMed

[42] ↵
Springer, M. S., Meredith, R. W., Gatesy, J., Emerling, C. A., Park, J., Ra-bosky, D. L., Stadler, T., Steiner, C., Ryder, O. A., Janečka, J. E., et al. (2012). Macroevolutionary dynamics and historical biogeography of primate diversification inferred from a species supermatrix. PloS one, 7(11).

[43] ↵
Stadler, T. (2010). Sampling-through-time in birth–death trees. Journal of Theoretical Biology, 267(3):396–404.
OpenUrl CrossRef PubMed Web of Science

[44] ↵
Stadler, T. (2011). Mammalian phylogeny reveals recent diversification rate shifts. Proceedings of the National Academy of Sciences, 108(15):6187–6192.
OpenUrl Abstract/FREE Full Text

[45] ↵
Steeman, M. E., Hebsgaard, M. B., Fordyce, R. E., Ho, S. Y., Rabosky, D. L., Nielsen, R., Rahbek, C., Glenner, H., Sørensen, M. V., and Willerslev, E. (2009). Radiation of extant cetaceans driven by restructuring of the oceans. Systematic Biology, 58(6):573–585.
OpenUrl CrossRef PubMed Web of Science

[46] ↵
Tufte, E. (2001). The visual display of quantitative information.

[47] ↵
Uyeda, J. C. and Harmon, L. J. (2014). A novel Bayesian method for in-ferring and interpreting the dynamics of adaptive landscapes from phylogenetic comparative data. Systematic biology, 63(6):902–918.
OpenUrl CrossRef PubMed

[48] ↵
Vaughan, T. G. (2017). IcyTree: rapid browser-based visualization for phylogenetic trees and networks. Bioinformatics, 33(15):2392–2394.
OpenUrl CrossRef

[49] ↵
Volz, E. M., Koelle, K., and Bedford, T. (2013). Viral phylodynamics. PLoS Computational Biology, 9(3):e1002947.
OpenUrl

[50] ↵
Wang, L.-G., Lam, T. T.-Y., Xu, S., Dai, Z., Zhou, L., Feng, T., Guo, P., Dunn, C. W., Jones, B. R., Bradley, T., et al. (2020). treeio: an R package for phylogenetic tree input and output with richly annotated and associated data. Molecular biology and evolution, 37(2):599–603.
OpenUrl CrossRef PubMed

[51] ↵
Wickham, H. (2011). ggplot2. Wiley Interdisciplinary Reviews: Computational Statistics, 3(2):180–185.
OpenUrl

[52] ↵
Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L., François, R., Grolemund, G., Hayes, A., Henry, L., Hester, J., et al. (2019). Welcome to the Tidyverse. Journal of Open Source Software, 4(43):1686.
OpenUrl

[53] ↵
Yang, Z. (2014). Molecular Evolution: A Statistical Approach. Oxford University Press.

[54] ↵
Yu, G., Smith, D. K., Zhu, H., Guan, Y., and Lam, T. T.-Y. (2017). ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution, 8(1):28–36.
OpenUrl CrossRef

[55] ↵
Zenil-Ferguson, R., Burleigh, J. G., Freyman, W. A., Igić, B., Mayrose, I., and Goldberg, E. E. (2019). Interaction among ploidy, breeding system and lineage diversification. New Phytologist, 224(3):1252–1265.
OpenUrl

[56] ↵
Zhang, C., Stadler, T., Klopfstein, S., Heath, T. A., and Ronquist, F. (2016). Total-evidence dating under the fossilized birth-death process. Systematic Biology, 65(2):228–249.
OpenUrl CrossRef PubMed