Skip to main content

A Scheme for Adaptive Selection of Population Sizes in Approximate Bayesian Computation - Sequential Monte Carlo

  • Conference paper
  • First Online:
Computational Methods in Systems Biology (CMSB 2017)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 10545))

Included in the following conference series:

Abstract

Parameter inference and model selection in systems biology often requires likelihood-free methods, such as Approximate Bayesian Computation (ABC). In recent years, this approach has frequently been combined with a Sequential Monte Carlo (ABC-SMC) scheme. In this scheme, the approximation of the posterior distribution through a population of particles is iteratively improved by a sequential sampling strategy. However, it has been difficult to give general guidelines on how to choose the size of these populations. In this manuscript, we propose a method to adaptively and automatically select these population sizes. The method exploits the cross-validated approximation error of a kernel density estimate of the particles in the current population to select the number of particles for the subsequent population.

We found the proposed method to be robust to the initially chosen population size and to the number of posterior modes. We demonstrated that the method is applicable to parameter inference as well as to model selection. The study of a computationally demanding multiscale model confirmed the method’s scalability. In conclusion, the proposed method is applicable to a wide range of parameter and model selection tasks. The method makes the influence of the population size on the approximation error explicit simplifying the application of ABC-SMC schemes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    A population is a sequence of pairs. Pairs \((w,\theta )\) can occur multiple times in a population. The curly braces \(\{\}\) denote a finite sequence (not a set).

References

  1. Beaumont, M.A.: Approximate Bayesian Computation in evolution and ecology. Annu. Rev. Ecol. Evol. Syst. 41(1), 379–406 (2010)

    Article  Google Scholar 

  2. Beaumont, M.A., Zhang, W., Balding, D.J.: Approximate Bayesian Computation in population genetics. Genetics 162(4), 2025–2035 (2002)

    Google Scholar 

  3. Bowman, A.W., Azzalini, A.: Applied Smoothing Techniques for Data Analysis: The Kernel Approach with S-Plus Illustrations. Oxford University Press, Oxford (1997)

    MATH  Google Scholar 

  4. Daigle, B.J., Roh, M.K., Petzold, L.R., Niemi, J.: Accelerated maximum likelihood parameter estimation for stochastic biochemical systems. BMC Bioinform. 13, 68 (2012)

    Article  Google Scholar 

  5. Eigen, M.: Prionics or the kinetic basis of prion diseases. Biophys. Chem. 63(1), A1–A18 (1996)

    Article  Google Scholar 

  6. Filippi, S., Barnes, C.P., Cornebise, J., Stumpf, M.P.H.: On optimality of kernels for Approximate Bayesian Computation using Sequential Monte Carlo. Stat. Appl. Genet. Mol. Biol. 12(1), 87–107 (2013)

    Article  MathSciNet  Google Scholar 

  7. Fox, C.W., Roberts, S.J.: A tutorial on variational Bayesian inference. Artifi. Intell. Rev. 38(2), 85–95 (2012)

    Article  Google Scholar 

  8. Gillespie, D.T.: Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81(25), 2340–2361 (1977)

    Article  Google Scholar 

  9. Jagiella, N., Rickert, D., Theis, F.J., Hasenauer, J.: Parallelization and high-performance computing enables automated statistical inference of multi-scale models. Cell Syst. 4(2), 194–206 (2017)

    Article  Google Scholar 

  10. Koutroumpas, K., Ballarini, P., Votsi, I., Cournède, P.H.: Bayesian parameter estimation for the WNT pathway: an infinite mixture models approach. Bioinformatics 32(17), i781–i789 (2016)

    Article  Google Scholar 

  11. Levenberg, K.: A method for the solution of certain non-linear problems in least squares. Q. Appl. Math. 2(2), 164–168 (1944)

    Article  MathSciNet  MATH  Google Scholar 

  12. Lück, A., Wolf, V.: Generalized method of moments for estimating parameters of stochastic reaction networks. BMC Syst. Biol. 10, 98 (2016)

    Article  Google Scholar 

  13. Marin, J.M., Pillai, N.S., Robert, C.P., Rousseau, J.: Relevant statistics for Bayesian model choice. J. Roy. Stat. Soc. Ser. B (Stat. Methodol.) 76(5), 833–859 (2014)

    Article  MathSciNet  Google Scholar 

  14. Marjoram, P., Molitor, J., Plagnol, V., Tavaré, S.: Markov Chain Monte Carlo without likelihoods. Proce. Natl. Acad. Sci. 100(26), 15324–15328 (2003)

    Article  Google Scholar 

  15. Marquardt, D.W.: An algorithm for least-squares estimation of nonlinear parameters. J. Soc. Ind. Appl. Math. 11(2), 431–441 (1963)

    Article  MathSciNet  MATH  Google Scholar 

  16. Moral, P.D., Doucet, A., Jasra, A.: An adaptive Sequential Monte Carlo method for Approximate Bayesian Computation. Stat. Comput. 22(5), 1009–1020 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  17. Nunes, M.A., Balding, D.J.: On optimal selection of summary statistics for approximate bayesian computation. Stat. Appl. Genet. Mol. Biol. 9(1), 34 (2010). doi:10.2202/1544-6115.1576

    Article  MathSciNet  MATH  Google Scholar 

  18. Prusiner, S.B.: Novel proteinaceous infectious particles cause Scrapie. Science 216(4542), 136–144 (1982)

    Article  Google Scholar 

  19. Salgado-Ugarte, I.H., Perez-Hernandez, M.A.: Exploring the use of variable bandwidth kernel density estimators. Stata J. 3(2), 133–147 (2003)

    Google Scholar 

  20. Silk, D., Filippi, S., Stumpf, M.P.H.: Optimizing threshold-schedules for sequential Approximate Bayesian Computation: applications to molecular systems. Stat. Appl. Genet. Mol. Biol. 12(5), 603–618 (2013)

    Article  MathSciNet  Google Scholar 

  21. Silverman, B.W.: Density Estimation for Statistics and Data Analysis, vol. 26. CRC Press, Boca Raton (1986)

    Book  MATH  Google Scholar 

  22. Sisson, S.A., Fan, Y., Tanaka, M.M.: Sequential Monte Carlo without likelihoods. Proc. Natl. Acad. Sci. 104(6), 1760–1765 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  23. de Souza, L.G.M., Haida, H., Thévenin, D., Seidel-Morgenstern, A., Janiga, G.: Model selection and parameter estimation for chemical reactions using global model structure. Comput. Chem. Eng. 58, 269–277 (2013)

    Article  Google Scholar 

  24. Sunnåker, M., Busetto, A.G., Numminen, E., Corander, J., Foll, M., Dessimoz, C.: Approximate Bayesian Computation. PLOS Comput. Biol. 9(1), e1002803 (2013)

    Article  MathSciNet  Google Scholar 

  25. Toni, T., Stumpf, M.P.H.: Simulation-based model selection for dynamical systems in systems and population biology. Bioinformatics 26(1), 104–110 (2010)

    Article  Google Scholar 

  26. Toni, T., Welch, D., Strelkowa, N., Ipsen, A., Stumpf, M.P.H.: Approximate Bayesian Computation scheme for parameter inference and model selection in dynamical systems. J. Roy. Soc. Interface 6(31), 187–202 (2009)

    Article  Google Scholar 

  27. Westerhuis, J.A., Boelens, H.F.M., Iron, D., Rothenberg, G.: Model selection and optimal sampling in high-throughput experimentation. Anal. Chem. 76(11), 3171–3178 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jan Hasenauer .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Klinger, E., Hasenauer, J. (2017). A Scheme for Adaptive Selection of Population Sizes in Approximate Bayesian Computation - Sequential Monte Carlo. In: Feret, J., Koeppl, H. (eds) Computational Methods in Systems Biology. CMSB 2017. Lecture Notes in Computer Science(), vol 10545. Springer, Cham. https://doi.org/10.1007/978-3-319-67471-1_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67471-1_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67470-4

  • Online ISBN: 978-3-319-67471-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics