Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Hybrid-Lambda: simulation of multiple merger and Kingman gene genealogies in species networks and species trees

Sha Zhu, James H. Degnan, Sharyn J. Goldstien, Bjarki Eldon
doi: https://doi.org/10.1101/023465
Sha Zhu
1Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: joe.zhu@well.ox.ac.uk
James H. Degnan
2Department of Mathematics and Statistics, University of New Mexico, Albuquerque, New Mexico, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sharyn J. Goldstien
3Department of Biology, University of Canterbury, Christchurch, New Zealand
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bjarki Eldon
4Institut für Mathematik, Technische Universität Berlin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Background There has been increasing interest in coalescent models which admit multiple mergers of ancestral lineages; and to model hybridization and coalescence simultaneously.

Results Hybrid-Lambda is a software package that simulates gene genealogies under multiple merger and Kingman’s coalescent processes within species networks or species trees. Hybrid-Lambda allows different coalescent processes to be specified for different populations, and allows for time to be converted between generations and coalescent units, by specifying a population size for each population. In addition, Hybrid-Lambda can generate simulated datasets, assuming the infinitely many sites mutation model, and compute the FST statistic. As an illustration, we apply Hybrid-Lambda to infer the time of subdivision of certain marine invertebrates under different coalescent processes.

Conclusions Hybrid-Lambda makes it possible to investigate biogeographic concordance among high fecundity species exhibiting skewed offspring distribution.

Background

Species trees describe ancestral relations among species. Gene genealogies describe the random ancestral relations of alleles sampled within species. Species trees are often assumed to be bifurcating [6], and gene genealogies to follow the Kingman coalescent [23, 28] in allowing at most two lineages to coalesce at a time.

Recently, there has been increasing interest in coalescent models which admit multiple mergers of ancestral lineages [1, 2, 9, 12, 37, 39, 40] and to model hybridization and coalescence simultaneously [3, 25, 26, 29, 46]. For high fecundity species exhibiting sweepstake-like reproduction, such as oysters and other marine organisms [1, 4, 9, 11, 17, 18, 39], the Kingman coalescent may not be appropriate, as it is based on low offspring number population models (see recent reviews by Hedgecock and Pudovkin [19] and Tellier and Lemaire [42]). Thus, we consider Λ coalescents [8, 36, 37] derived from sweepstake-like reproduction models, and allow more than two lineages to coalesce at a time. We introduce the software Hybrid-Lambda for simulating gene trees under two models of Λ-coalescents within rooted species trees and rooted species networks. Our program differs from existing software which also allows multiple mergers, such as SIMCOAL 2.0 [30] — which allows multiple mergers in gene trees due to small population sizes under the Wright-Fisher model — in that we apply coalescent processes that are obtained from population models explicitly modelling skewed offspring distributions, as opposed to bottlenecks.

Species trees may also fail to be bifurcating due to either polytomies or hybridization events. The simulation of gene genealogies within a species network which admits hybridization is another application of Hybrid-Lambda. The package ms [24] can also simulate gene genealogies within species networks under Kingman’s coalescent. However the input of ms is difficult to automate when the network is sophisticated or generated from other software. Other simulation studies using species networks have either used a small number of network topologies coded individually (for example, in phylonet [43, 45, 46]) or have assumed that gene trees have evolved on species trees embedded within the species network [22, 29, 32]. Hybrid-Lambda will help to automate simulation studies of hybridization by allowing for a large number of species network topologies and allowing gene trees to evolve directly within the network. Hybrid-Lambda can simulate both Kingman and Λ-coalescent processes within species networks. A comparison of features of several software packages that output gene genealogies under coalescent models is given in Table 1.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1

Comparison of software programs simulating gene trees in species trees and networks. Migration refers to modeling post-speciation gene flow.

Implementation

The program input file for Hybrid-Lambda is a character string that describes relationships between species. Standard Newick format [34] is used for the input of species trees and the output of gene trees, whose interior nodes are not labelled. An extended Newick formatted string [5, 25] labels all internal nodes, and is used for the input of species networks (see Fig. 1).

Figure 1
  • Download figure
  • Open in new tab
Figure 1 Demonstration of a multiple merger genealogy within a specis network.

A multiple merger gene genealogy with topology (((a1,a2,a3),c1),(b1,c2,d1)), of which the coalescence events pointed to by arrows labelled “multiple merger” indicate coalescence of 3 lines, simulated in a species network with topology ((((B,C)s1)h1#H1,A)s2,(h1#H1,D)s3)r, where H1 is the probability that a lineage has its ancestry from its left parental population.

Parameters

Hybrid-Lambda can use multiple lineages sampled from each species and simulate Kingman or multiple merger (Λ)-coalescent processes within a given species network. In addition, separate coalescent processes can be specified on different branches of the species network. The coalescent is a continuous-time Markov process, in which times between coalescent events are independent exponential random variables with different rates. The rates are determined by a so-called coalescent parameter that can be input via command line, or a(n) (extended) Newick formatted string with specific coalescent parameters as branch lengths. By default, the Kingman coalescent is used, for which two of b active lineages coalesce at rate Embedded Image. One can choose between two different examples of a Λ-coalescent, whose parameters have clear biological interpretation. While we cannot hope to cover the huge class of Lambda-coalescents, our two examples are the ones that have been most studied in the literature [2, 7, 13]. If the coalescent parameter is between 0 and 1, then we use ψ for the coalescent parameter, and the rate λb,k at which k out of b (2 ≤ k ≤ b) active ancestral lineages merge is Embedded Image

[9]. If the coalescent parameter is between 1 and 2, then we use α for the coalescent parameter, and the rate of k-mergers (2 ≤ k ≤ b) is Embedded Image where B(·, ·) is the beta function [40].

Hybrid-Lambda assumes by default that the input network (tree) branch lengths are in coalescent units. However, this is not essential. Coalescent units can be converted through an alternative input file with numbers of generations as branch lengths, which are then divided by their corresponding effective population sizes. By default, effective population sizes on all branches are assumed to be equal and unchanged. Users can change this parameter using the command line, or using a(n) (extended) Newick formatted string to specify population sizes on all branches through another input file.

The simulation requires ultrametric species networks, i.e. equal lengths of all paths from tip to root. Hybrid-Lambda checks the distances in coalescent units between the root and all tip nodes and prints out warning messages if the ultrametric assumption is violated.

Results and Discussion

Hybrid-Lambda outputs simulated gene trees in three different files: one contains gene trees with branch lengths in coalescent units, another uses the number of generations as branch lengths, and the third uses the number of expected mutations as branch lengths.

Besides outputting gene tree files, Hybrid-Lambda also provides several functions for analysis purposes:

  • user-defined random seed for simulation,

  • output simulated data in 0/1 format assuming the infinitely many sites mutation model,

  • a frequency table of gene tree topologies,

  • a figure of the species network or tree (this function only works when LATEX or dot is installed) (Fig. 2),

  • the expected FST value for a split model between two populations,

  • when gene trees are simulated from two populations, the software Hybrid-Lambda can generate a table of relative frequencies of reciprocal monophyly, paraphyly, and polyphyly.

Figure 2
  • Download figure
  • Open in new tab
Figure 2 Demonstration of a network figure generated by Hybrid-Lambda.

The network is automatically generated by Hybrid-Lambda as dot and. pdf files from the extended newick string “(((((((6:.1,7:.1)s 6:.4,2:.5)s 1:1.1,3:1.6)s 2:3.3, 4:4.9)s 3:2)h 2#.5:1.41,5:8.31)s 4:0.1,(1:7.2,h 2#.5:.3)s 5:1.21)r;”

Simulation Example

We give a simulation example showing the impact of the particular coalescent model on estimating the divergence time for two populations. Results can be confirmed using analytic approximations to FST. This is shown in the Appendix along with example code for using Hybrid-Lambda for this example.

Eldon and Wakeley [10] showed that population subdivision can be observed in genetic data despite high migration between populations. One of the most widely used measures of population differentiation is the FST statistic. The relationship between FST and biogeography depends on the underlying coalescent process, which might be especially important for the interpretation of divergence and demographic history of many marine species. Here we used Hybrid-Lambda to simulate divergence between two populations based on different Λ-coalescents, as well as the standard Kingman coalescent. Mutations were simulated in Hybrid-Lambda under the infinite-sites model. The summary statistic FST was estimated for these data and was used to compare FST estimated from mtDNA from five species of marine invertebrates. These species were used in previous studies to test the hypothesis that contemporary oceanic conditions are creating subdivisions between the North Island and South Island reef populations of New Zealand [16, 35, 44]. These studies represent some of the earliest mitochondrial studies on the marine disjunction between the North and South Islands of New Zealand.

The FST statistic between North Island and South Island populations reported for these species ranges from approximately 0.07 to 0.8 (Fig. 3). Cellana ornata displays a very strong split, which was estimated to have occurred around 0.2–0.3 million years ago based on published estimates of divergence rates and reciprocal monophyly displayed in the data set. This result may be supported by our simulations using the Kingman coalescent. However, when multiple mergers and a higher fraction of replacement by a single parent is allowed to occur then our simulations support much younger splits between the populations ∼9,000 generations or ∼48,000 generations ago (Fig. 3). Similarly, the strong split observed for Coscinasterias muricata could be placed anywhere from ∼9000 to 45,000 generations ago depending on the degree to which multiple mergers are allowed to occur. While the range for Patiriella regularis, Cellana radians and C. flava is much smaller, it is still not clear cut as to whether divergence would be observed under different coalescent models. Here we used ψ = 0.01 and ψ = 0.23, and α = 1.5 and α = 1.9, with larger values of ψ and smaller values of α corresponding to higher probabilities of multiple mergers. Our choice of parameer values corresponds to the estimated values obtained for mtDNA of oysters and Atlantic cod. An estimate for Pacific oysters based on mitochondrial DNA for ψ was 0.075 [9]. The results for our choice of parameter values suggest that our conclusions about a much earlier split of the populations than previously estimated are robust with regard to parameter choice. A recent study of Atlantic cod [2] estimated ψ between 0.07 and 0.23 for nuclear genes and near 0.01 for mitochondrial genes. The same study estimated α to be 1.0 and 1.28 for nuclear genes and between 1.53 and 2.0 for mitochondrial genes.

Figure 3
  • Download figure
  • Open in new tab
Figure 3 Estimated FST from simulation.

The estimated FST from two populations simulated to have diverged over 0, 10, 20, and 50 thousand generations, as a function of the underlying coalescent process. Dashed lines show the relationship between the FST value estimated from mtDNA data and the estimated number of generations since divergence, for the different coalescent processes for the five marine invertebrate species, Cellana ornata, C. radians, C. flava (Goldstien et al. 2006), Coscinasterias muricata (Perrin et al. 2004), and Patiriella regularis (Waters and Roy 2005).

Figure 4
  • Download figure
  • Open in new tab
Figure 4 Comparison of estimated FST values from simulation and analytical predictions.

Values of FST as a function of the parameter α for 1 < α < 2 for different numbers of generations of separation for two populations. Simulations (dotted lines) are based on 1 individual from each of two populations separated by t generations with 103 replicates and α ∈ {1.1, 1.2, …, 1.9}. Analytical predictions (solid lines) of FST were calculated using (6).

Conclusions

The implications for using alternative coalescent models are far reaching. Many marine organisms reproduce through broadcast spawning of thousands to millions of gametes, and while the expected survival of these offspring is low, there is the potential for a small subset of the adults to have a greater contribution to the next generation than assumed by the Kingman coalescent. Hybrid-Lambda makes it possible to investigate the effect of high fecundity on biogeographic concordance among species that exhibit high fecundity and high offspring mortality, including in complex demagraphic scenarios that allow hybridization.

Availability and requirements

Hybrid-Lambda can be downloaded from http://hybridlambda.github.io/. The program is written in C++ (requires compilers that support C++11 standard to build), and released under the GNU General Public License (GPL) version 3 or later. Users can modify and make new distributions under the terms of this license. For full details of this license, visit http://www.gnu.org/licenses/. We have used travis continuous integration to test compiling the program on Linux and Mac OS. An API in R [38] is currently under development.

Competing interests

The authors declare that they have no competing interests.

Author’s contributions

SZ was responsible for the software development. JD and BE supervised the project. BE derived all the FST calculations in the appendix. SG provided the simulation results and time estimates in Figure 3. All the authors have contributed to the manuscript writing.

Acknowledgements

This work was supported by New Zealand Marsden Fund (SZ and JD), EPSRC grant EP/G052026/1 and DFG grant BL 1105/3-1 through the SPP Priority Programme 1590 “Probabilistic Structures in Evolution” (BE). This work was partly conducted while JD was a Sabbatical Fellow at the National Institute for Mathematical and Biological Synthesis, an Institute sponsored by the National Science Foundation, the U.S. Department of Homeland Security, and the U.S. Department of Agriculture through NSF Award #EF-0832858, with additional support from The University of Tennessee, Knoxville.

Appendix FST calculations

Here we show analytic calculations that can be used to obtain expressions for FST when mutation rates are low. The effect of α on FST for fixed generation times is shown in Figure 4.

Assume two populations A and B have been isolated until time τ in the past as measured from the present. Assume also that the same coalescent process is operating in populations A and B. Let Tw denote the time until coalescence for two lines when drawn from the same population, and Tb when drawn from different populations. Let λA denote the coalescence rate for two lines in population A, and λAB for the common ancestral population AB. For the Beta(2 - α, α)-coalescent, λA = 1, for the point-mass process λA = ψ2. One now obtains Embedded Image

Slatkin (1991) obtained the approximation, where μ is the per generation mutation rate, Embedded Image

Thus, using (3) gives Embedded Image

The result (5) seems to make sense, since Embedded Image and Embedded Image. By way of example, if all populations exhibit a Beta(2 - α, α)-coalescent, λA = λAB = 1, and Embedded Image

However, deciding the timeunit of τ now becomes important, since the timescale of a Beta(2 - α, α)-coalescent is proportional to Nα−1, 1 < α < 2 [40], where N is the population size. One can obtain a more accurate expression of the timescale given knowledge about the mean of the potential offspring distribution [see 40]. However, since the mean is unknown in most cases, we apply the approximation Nα−1. Assuming n ≥ 2 sequences from each population, the ‘observed’ FST Embedded Image was computed as Embedded Image where Hw is the average pairwise differences within populations, Embedded Image, and Hb is the average of n2 pairwise differences between populations.

The following command-line argument for Hybrid-Lambda simulates 1000 genealogies with 10 lineages sampled from each of two populations separated by one coalescent unit with mutation rate μ = 0.00001 using a β-coalescent with parameter α = 1.5:

hybrid-Lambda -spng ‘(A:10000,B:10000);’ -num 1000 -seed 45 -mu 0.00001 -S 10 10 \

-mm 1.5 -sim_num_mut -seg –fst

where

  • -spng ‘(A:10000,B:10000);’ denotes the population structure of a split model of one population splits to two at 10000 generations in the past.

  • -num 1000 simulates 1000 genealogies from this model.

  • -seed 45 initializes the random seed for the simulation.

  • -mu 0.00001 specifies the mutation rate of 0.00001 per generation.

  • -S 10 10 samples 10 individuals from each population.

  • -mm 1.5 specifies the Λ-coalescent parameter.

  • -sim num mut outputs simulated genealogies in Newick string, of which the number of mutations on internal branches are labelled.

  • -seg generates haplotype data set.

  • -fst computes FST of the generated haplotype data set.

One can use this example to generate the data for Figure 4 by setting the -S flag to -S 1 1.

References

  1. 1.↵
    Árnason, E. (2004). Mitochondrial cytochrome b variation in the high-fecundity Atlantic cod: trans-Atlantic clines and shallow gene genealogy, Genetics 166, 1871–1885.
    OpenUrlAbstract/FREE Full Text
  2. 2.↵
    Árnason, E. and Halldórsdóttir, K. (2015). Nucleotide variation and balancing selection at the Ckma gene in Atlantic cod: analysis with multiple merger coalescent models. PeerJ 3:e786 http://dx.doi.org/10.7717/peerj.786.
    OpenUrlCrossRefPubMed
  3. 3.↵
    Bartoszek, K., Jones, G., Oxelman, B., Sagitov, S. (2004). Time to a single hybridization event in a group of species with unknown ancestral history, J. Theor. Biol. 322, 1–6.
    OpenUrl
  4. 4.↵
    1. B Golding
    Beckenbach, A. T. Mitochondrial haplotype frequencies in oysters: neutral alternatives to selection models, Non-neutral Evolution, ed. B Golding, 188–198 (1994). Chapman & Hall, New York.
  5. 5.↵
    Cardona, G., Rossell, F., and Valiente, G. (2008). Extended Newick: it is time for a standard representation of phylogenetic networks. BMC Bioinformatics 9, 532.
    OpenUrlCrossRefPubMed
  6. 6.↵
    Degnan, J. H. and Salter, L. A. (2005). Gene tree distributions under the coalescent process. Evolution 59, 24–37.
    OpenUrlCrossRefPubMedWeb of Science
  7. 7.↵
    Delmas, J. F., Dhersin, J. S. and Siri-Jegousse, A. (2008). Asymptotic results on the length of coalescent trees. Ann Appl Prob 18, 997–1025.
    OpenUrl
  8. 8.↵
    Donnelly, P. and Kurtz, T. G. Particle representations for measure-valued population models, Ann. Probab. 27, 166–205 (1999).
    OpenUrlCrossRef
  9. 9.↵
    Eldon, B. and Wakeley, J. (2006). Coalesent processes when the distribution of offspring number among individuals is highly skewed. Genetics 172, 2621–2633.
    OpenUrlAbstract/FREE Full Text
  10. 10.↵
    Eldon, B. and Wakeley, J. (2009). Coalescence times and FST under a skewed offspring distribution among individuals in a population. Genetics 181, 615–629.
    OpenUrlAbstract/FREE Full Text
  11. 11.↵
    Eldon, B. (2011) Estimation of parameters in large offspring number models and ratios of coalescence times, Theor. Popul. Biol. 80, 16–28.
    OpenUrlCrossRefPubMed
  12. 12.↵
    Eldon, B. and Degnan, J. H. (2012). Multiple merger gene genealogies in two species: monophyly, paraphyly, and polyphyly for two examples of Lambda coalescents, Theor. Popul. Biol. 82 117–130.
    OpenUrlCrossRefPubMed
  13. 13.↵
    Eldon, B., Birkner, M., Blath, J. and Freund, F. (2015). Can the Site-Frequency Spectrum Distinguish Exponential Population Growth from Multiple-Merger Coalescents? Genetics 199, 841–856.
    OpenUrlAbstract/FREE Full Text
  14. 14.
    Ewing, G. and Hermisson, J. (2010). MSMS: a coalescent simulation program including recombination, demographic structure and selection at a single locus Bioinformatics 26, 2064–2065.
    OpenUrlCrossRefPubMedWeb of Science
  15. 15.
    Excoffier, L. and Foll, M. (2011). Fastsimcoal: a continuous-time coalescent simulator of genomic diversity under arbitrarily complex evolutionary scenarios Bioinformatics 27, 9.
    OpenUrlCrossRefPubMedWeb of Science
  16. 16.↵
    Goldstien, S. J., Schiel, D. R., and Gemmell, N. J. (2009). Comparative phylogeography of coastal limpets across a marine disjunction in New Zealand. Mol Ecol 15, 3259–3268.
    OpenUrl
  17. 17.↵
    1. A Beaumont
    Hedgecock, D. (1994). Does variance in reproductive success limit effective population sizes of marine organisms? Genetics and Evolution of Aquatic Organisms, ed. A Beaumont, 1222–1344 (1994). Chapman and Hall, London.
  18. 18.↵
    1. LG Abele
    Hedgecock, D., Tracey, M. and Nelson, K. (1982). Genetics, The Biology of Crustacea vol. 2, ed. LG Abele, 297–403 (1982). Academic Press, New York.
  19. 19.↵
    Hedgecock, D. and A. I. Pudovkin (2011). Sweepstakes reproductive success in highly fecund marine fish and shellfish: a review and commentary. Bull Mar Sci 87, 971–1002.
  20. 20.
    Heled, J. and Bryant, D. and Drummond, A. J. (2013). Simulating gene trees under the multispecies coalescent and time-dependent migration BMC Evolut. Biol. 13, 44.
    OpenUrl
  21. 21.
    Hellenthal, G. and Stephens, M. (2007) msHOT: modifying Hudson’s ms simulator to incorporate crossover and gene conversion hotspots Bioinformatics 23, 520–521.
    OpenUrlCrossRefPubMedWeb of Science
  22. 22.↵
    Holland, B. R., Benthin, S., Lockhart, P. J., Moulton, V., and Huber, K. T. (2008) Using supernetworks to distinguish hybridization from lineage-sorting BMC Evol. Biol. 8:202.
    OpenUrlCrossRefPubMed
  23. 23.↵
    Hudson, R. R. (1990). Gene genealogies and the coalescent process. Oxford Surveys Evolution Biology 7, 1–44.
    OpenUrl
  24. 24.↵
    Hudson, R. R. (2002). Generating samples under a Wright-Fisher neutral model. Bioinformatics 18, 337–338.
    OpenUrlCrossRefPubMedWeb of Science
  25. 25.↵
    Huson, D., Rupp, R. and Scornavacca, C. (2010). Phylogenetic Networks: Concepts, Algorithms and Applications. Phylogenetic Networks: Concepts, Algorithms and Applications. Cambridge University Press.
  26. 26.↵
    Jones, G., Sagitov, S., Oxelman, B. (2013) Statistical inference of allopolyploid species networks in the presence of incomplete lineage sorting, Syst. Biol. 62, 467–478.
    OpenUrlCrossRefPubMed
  27. 27.
    Laval, G. and Excoffier, L. (2004). SIMCOAL 2.0: a program to simulate genomic diversity over large recombining regions in a subdivided population with a complex history Bioinformatics 20, 2485–2487.
    OpenUrlCrossRefPubMedWeb of Science
  28. 28.↵
    Kingman, J. F. C. (1982) On the genealogy of large populations, J. App. Probab. 19A, 27–43.
    OpenUrl
  29. 29.↵
    Kubatko, L. S. (2009) Identifying hybridization events in the presence of coalescence via model selection Syst. Biol. 58, 478–488.
    OpenUrlCrossRefPubMed
  30. 30.↵
    Laval, G. and Excoffier, L. (2004) SIMCOAL 2.0: a program to simulate genomic diversity over large recombining regions in a subdivided population with a complex history. Bioinformatics 20, 2485–2487.
    OpenUrlCrossRefPubMedWeb of Science
  31. 31.
    Liang, L. and Zöllner, S. and Abecasis, G. R. (2007) GENOME: a rapid coalescent-based whole genome simulator Bioinformatics 23, 1565–1567.
    OpenUrlCrossRefPubMedWeb of Science
  32. 32.↵
    Meng, C. and Kubatko, L. S. (2009) Detecting hybrid speciation in the presence of incomplete lineage sorting using gene tree incongruence: A model Theor. Popul. Biol. 75, 35–45.
    OpenUrlCrossRefPubMedWeb of Science
  33. 33.
    Mailund, T. and Schierup, H. and Pedersen, C. N. S. and Mechlenborg, P. J. M. and Madsen, J. N. and Schauser, L. (2005) CoaSim A Flexible Environment for Simulating Genetic Data under Coalescent Models BMC Bioinformatics 6, 252.
    OpenUrlCrossRefPubMed
  34. 34.↵
    Olsen, G. (1990). Gary Olsen’s interpretation of the “Newick’s 8:45” tree format standard. http://evolution.genetics.washington.edu/phylip/newickdoc.html.
  35. 35.↵
    Perrin, C., Wing, S. R., and Roy, M. S. (2004) Effects of hydrographic barriers on population genetic structure of the sea star Coscinasterias muricata (Echinodermata, Asteroidea) in the New Zealand fiords Mol. Ecol. 13, 2183–2195.
    OpenUrlCrossRefPubMed
  36. 36.↵
    Pitman, J. (1999). Coalescents with multiple collisions. Ann. Probab. 27, 1870–1902.
    OpenUrlCrossRefWeb of Science
  37. 37.↵
    Sagitov, S. (1999) The general coalescent with asynchronous mergers of ancestral lines, J. Appl. Probab. 36, 1116–1125.
    OpenUrlCrossRef
  38. 38.↵
    R Core Team (2012). R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. ISBN 3-900051-07-0.
  39. 39.↵
    Sargsyan, O. and Wakeley, J. (2008). A coalescent process with simultaneous multiple mergers for approximating the gene genealogies of many marine organisms. Theor. Popul. Biol. 74, 104–114.
    OpenUrlCrossRefPubMed
  40. 40.↵
    Schweinsberg, J. (2003). Coalescent processes obtained from supercritical Galton-Watson processes. Stoch. Proc. Appl. 106, 107–139.
    OpenUrlCrossRef
  41. 41.
    Staab, P. R., S. Zhu, D. Metzler, and G. Lunter (2015). scrm: efficiently simulating long sequences using the approximated coalescent with recombination. Bioinformatics 31 (10), 1680–1682.
    OpenUrlCrossRefPubMed
  42. 42.↵
    Tellier, A. and C. Lemaire (2014). Coalescence 2.0: a multiple branching of recent theoretical developments and their applications. Mol Ecol 23, 2637–2652.
    OpenUrlCrossRef
  43. 43.↵
    Than, C., Ruths, D., and Nakhleh, L. (2008) PhyloNet: a software package for analyzing and reconstructing reticulate evolutionary relationships, BMC Bioinformatics, 9, 322. doi: 10.1186/1471-2105-9-322
    OpenUrlCrossRefPubMed
  44. 44.↵
    Waters, J. M., and Roy, M. S. (2004). Phylogeography of a high-dispersal New Zealand sea-star: does upwelling block gene-flow? Mol Ecol 13, 2797–2806.
    OpenUrlCrossRefPubMed
  45. 45.↵
    Yu, Y., Than, C., Degnan, J. H. and Nakhleh, L. (2011). Coalescent histories on phylogenetic networks and detection of hybridization despite incomplete lineage sorting, Syst. Biol. 60, 138–149.
    OpenUrlCrossRefPubMed
  46. 46.↵
    Yu, Y., Degnan, J. H. and Nakhleh, L. (2012). The probability of a gene tree topology within a phylogenetic network with applications to hybridization detection PLoS Genet 8, e1002660. doi: 10.1371/journal.pgen.1002660.
    OpenUrlCrossRefPubMed
Back to top
PreviousNext
Posted July 29, 2015.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Hybrid-Lambda: simulation of multiple merger and Kingman gene genealogies in species networks and species trees
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Hybrid-Lambda: simulation of multiple merger and Kingman gene genealogies in species networks and species trees
Sha Zhu, James H. Degnan, Sharyn J. Goldstien, Bjarki Eldon
bioRxiv 023465; doi: https://doi.org/10.1101/023465
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Hybrid-Lambda: simulation of multiple merger and Kingman gene genealogies in species networks and species trees
Sha Zhu, James H. Degnan, Sharyn J. Goldstien, Bjarki Eldon
bioRxiv 023465; doi: https://doi.org/10.1101/023465

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Evolutionary Biology
Subject Areas
All Articles
  • Animal Behavior and Cognition (3579)
  • Biochemistry (7526)
  • Bioengineering (5486)
  • Bioinformatics (20703)
  • Biophysics (10261)
  • Cancer Biology (7939)
  • Cell Biology (11585)
  • Clinical Trials (138)
  • Developmental Biology (6574)
  • Ecology (10145)
  • Epidemiology (2065)
  • Evolutionary Biology (13556)
  • Genetics (9502)
  • Genomics (12796)
  • Immunology (7888)
  • Microbiology (19460)
  • Molecular Biology (7618)
  • Neuroscience (41917)
  • Paleontology (307)
  • Pathology (1253)
  • Pharmacology and Toxicology (2182)
  • Physiology (3253)
  • Plant Biology (7011)
  • Scientific Communication and Education (1291)
  • Synthetic Biology (1942)
  • Systems Biology (5410)
  • Zoology (1108)