Approximate Bayesian computation: Difference between revisions

Content deleted Content added
Outline basic ABC rejection procedure
Bender the Bot (talk | contribs)
m Software: HTTP to HTTPS for SourceForge
 
(308 intermediate revisions by 99 users not shown)
Line 1:
{{short description|Computational method in Bayesian statistics}}
'''Approximate Bayesian computation (ABC)''' is a family of computational techniques in [[Bayesian statistics]]. These simulation techniques operate on summary data (such as population mean, or variance) to make broad inferences with less computation than might be required if all available data were analyzed in detail. They are especially useful in situations where evaluation of the likelihood is computationally prohibitive, or whenever suitable likelihoods are not available.
{{Bayesian statistics}}
 
'''Approximate Bayesian computation''' ('''ABC''') constitutes a class of [[Computational science|computational methods]] rooted in [[Bayesian statistics]] that can be used to estimate the posterior distributions of model parameters.
ABC methods originated in population and evolutionary genetics <ref name=Pritchard1999>{{cite journal|last = Pritchard|first = J. K.|coauthors = Seielstad, M. T., Perez-Lezaun, A., and Feldman, M. T.|title = Population Growth of Human Y Chromosomes: A Study of Y Chromosome Microsatellites|journal = Mol. Biol. Evol.|volume = 16|date = 1999|pages = 1791–1798}}</ref><ref name=Beaumont>{{cite journal|last = Beaumont|first = M. A.|coauthors = Zhang, W. and [[David Balding|Balding, D. J.]]|title = Approximate Bayesian Computation in Population Genetics|journal = Genetics|volume = 162|date = Dec 2002|pages = 2025–2035|url = http://www.genetics.org/cgi/content/abstract/162/4/2025|pmid = 12524368}}</ref> but have recently also been introduced to the analysis of complex and stochastic [[dynamical systems]] <ref name=Toni2009>{{cite journal |author = Toni, T.; Welch, D.; Strelkowa, N.; Ipsen, A.; Stumpf, M.P.H. |year = 2009 |title = Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems | journal = Journal of the Royal Society Interface |volume = 6 |issue = 31 |pages = 187&ndash;202 |doi: 10.1098/​rsif.2008.0172 |url=http://rsif.royalsocietypublishing.org/content/6/31/187.abstract}}</ref>.
 
In all model-based [[statistical inference]], the [[likelihood|likelihood function]] is of central importance, since it expresses the probability of the observed data under a particular [[statistical model]], and thus quantifies the support data lend to particular values of parameters and to choices among different models. For simple models, an analytical formula for the likelihood function can typically be derived. However, for more complex models, an analytical formula might be elusive or the likelihood function might be computationally very costly to evaluate.
In standard Bayesian inference the [[posterior distribution]] is given by
 
ABC methods bypass the evaluation of the likelihood function. In this way, ABC methods widen the realm of models for which statistical inference can be considered. ABC methods are mathematically well-founded, but they inevitably make assumptions and approximations whose impact needs to be carefully assessed. Furthermore, the wider application ___domain of ABC exacerbates the challenges of [[Estimation Theory|parameter estimation]] and [[model selection]].
<math>P(\theta|D)\propto P(D|\theta) \pi(\theta)</math>
 
ABC has rapidly gained popularity over the last years and in particular for the analysis of complex problems arising in [[Biology|biological sciences]], e.g. in [[population genetics]], [[ecology]], [[epidemiology]], [[systems biology]], and in [[radio propagation]].<ref name="Bharti" />
where <math>\theta</math> are the parameters of a probability model, <math>D</math> are the observed data, and <math>\pi(\theta)</math> is the [[prior distribution]] of the parameters <math>\theta</math>. <math>P(D|\theta)</math> is the [[likelihood]] of <math>\theta</math>, that is the probability of observing the data <math>D</math> given the model with parameter <math>\theta</math>. The explicit evaluation of the likelihood is avoided in ABC approaches by considering distances between observed and data simulated from a model with parameter <math>\theta</math>. For sufficiently complex models and large data sets the probability of happening upon a simulation run that yields precisely the same dataset as the one observed will be very small, often unacceptably so. So rather than considering the data we consider a summary statistic of the data, <math>S(D)</math>, and use a distance <math>\Delta(S(D),S(X))</math> between the summary statistics of real and simulated data, <math>D</math> and <math>X</math>, respectively.
 
==History==
The first ABC-related ideas date back to the 1980s. [[Donald Rubin]], when discussing the interpretation of Bayesian statements in 1984,<ref name="Rubin" /> described a hypothetical sampling mechanism that yields a sample from the [[Posterior probability|posterior distribution]]. This scheme was more of a conceptual [[thought experiment]] to demonstrate what type of manipulations are done when inferring the posterior distributions of parameters. The description of the sampling mechanism coincides exactly with that of the [[#The ABC rejection algorithm|ABC-rejection scheme]], and this article can be considered to be the first to describe approximate Bayesian computation. However, a two-stage [[quincunx]] was constructed by [[Francis Galton]] in the late 1800s that can be seen as a physical implementation of an [[#The ABC rejection algorithm|ABC-rejection scheme]] for a single unknown (parameter) and a single observation.<ref name="Stigler2010">see figure 5 in {{cite journal|last1=Stigler|first1=Stephen M.|title=Darwin, Galton and the Statistical Enlightenment|journal=Journal of the Royal Statistical Society. Series A (Statistics in Society)|volume=173|issue=3|year=2010|pages=469–482|issn=0964-1998|doi=10.1111/j.1467-985X.2010.00643.x|s2cid=53333238 }}</ref> Another prescient point was made by Rubin when he argued that in Bayesian inference, applied statisticians should not settle for analytically tractable models only, but instead consider computational methods that allow them to estimate the posterior distribution of interest. This way, a wider range of models can be considered. These arguments are particularly relevant in the context of ABC.
 
In 1984, [[Peter Diggle]] and [[Richard Gratton]] suggested using a systematic simulation scheme to approximate the likelihood function in situations where its analytic form is [[Intractability (complexity)|intractable]].<ref name="Diggle" /> Their method was based on defining a grid in the parameter space and using it to approximate the likelihood by running several simulations for each grid point. The approximation was then improved by applying smoothing techniques to the outcomes of the simulations. While the idea of using simulation for hypothesis testing was not new,<ref name="Bartlett63" /><ref name="Hoel71" /> Diggle and Gratton seemingly introduced the first procedure using simulation to do statistical inference under a circumstance where the likelihood is intractable.
The generic ABC approach to infer the posterior probability of a parameter <math>\theta</math> is as follows:
 
Although Diggle and Gratton's approach had opened a new frontier, their method was not yet exactly identical to what is now known as ABC, as it aimed at approximating the likelihood rather than the posterior distribution. An article of [[Simon Tavaré]] and co-authors was first to propose an ABC algorithm for posterior inference.<ref name="Tavare" /> In their seminal work, inference about the genealogy of DNA sequence data was considered, and in particular the problem of deciding the posterior distribution of the time to the [[most recent common ancestor]] of the sampled individuals. Such inference is analytically intractable for many demographic models, but the authors presented ways of simulating coalescent trees under the putative models. A sample from the posterior of model parameters was obtained by accepting/rejecting proposals based on comparing the number of segregating sites in the synthetic and real data. This work was followed by an applied study on modeling the variation in human Y chromosome by [[Jonathan K. Pritchard]] and co-authors using the ABC method.<ref name="Pritchard1999" /> Finally, the term approximate Bayesian computation was established by Mark Beaumont and co-authors,<ref name="Beaumont2002" /> extending further the ABC methodology and discussing the suitability of the ABC-approach more specifically for problems in population genetics. Since then, ABC has spread to applications outside population genetics, such as systems biology, epidemiology, and [[phylogeography]].
'''1.''' Sample a candidate parameter vector <math>\theta^\ast</math> from some proposal distribution <math>\pi(\theta)</math>.
 
Approximate Bayesian computation can be understood as a kind of Bayesian version of [[indirect inference]].<ref>{{cite arXiv | eprint=1803.01999 | author1=Christopher C Drovandi | title=ABC and Indirect Inference | date=2018 | class=stat.CO }}</ref><ref name="Peters 2009">{{Cite journal |last=Peters |first=Gareth |date=2009 |title=Advances in Approximate Bayesian Computation and Trans-Dimensional Sampling Methodology |url=https://www.ssrn.com/abstract=3785580 |journal=SSRN Electronic Journal |language=en |doi=10.2139/ssrn.3785580 |issn=1556-5068|hdl=1959.4/50086 |hdl-access=free }}</ref>
'''2.''' Simulate a dataset <math>X</math> from the model with parameter <math>\theta^\ast</math>.
 
Several efficient Monte Carlo based approaches have been developed to perform sampling from the ABC posterior distribution for purposes of estimation and prediction problems. A popular choice is the SMC Samplers algorithm <ref>{{Cite journal |last1=Del Moral |first1=Pierre |last2=Doucet |first2=Arnaud |last3=Jasra |first3=Ajay |date=2006 |title=Sequential Monte Carlo Samplers |url=https://www.jstor.org/stable/3879283 |journal=Journal of the Royal Statistical Society. Series B (Statistical Methodology) |volume=68 |issue=3 |pages=411–436 |doi=10.1111/j.1467-9868.2006.00553.x |jstor=3879283 |issn=1369-7412|arxiv=cond-mat/0212648 }}</ref><ref>{{Cite journal |last1=Del Moral |first1=Pierre |last2=Doucet |first2=Arnaud |last3=Peters |first3=Gareth |date=2004 |title=Sequential Monte Carlo Samplers CUED Technical Report |url=https://www.ssrn.com/abstract=3841065 |journal=SSRN Electronic Journal |language=en |doi=10.2139/ssrn.3841065 |issn=1556-5068|url-access=subscription }}</ref><ref>{{Cite journal |last=Peters |first=Gareth |date=2005 |title=Topics in Sequential Monte Carlo Samplers |url=https://www.ssrn.com/abstract=3785582 |journal=SSRN Electronic Journal |language=en |doi=10.2139/ssrn.3785582 |issn=1556-5068|url-access=subscription }}</ref> adapted to the ABC context in the method (SMC-ABC).<ref>{{Cite journal |last1=Sisson |first1=S. A. |last2=Fan |first2=Y. |last3=Tanaka |first3=Mark M. |date=2007-02-06 |title=Sequential Monte Carlo without likelihoods |journal=Proceedings of the National Academy of Sciences |language=en |volume=104 |issue=6 |pages=1760–1765 |doi=10.1073/pnas.0607208104 |doi-access=free |issn=0027-8424 |pmc=1794282 |pmid=17264216|bibcode=2007PNAS..104.1760S }}</ref><ref name="Peters 2009"/><ref>{{Cite journal |last1=Peters |first1=G. W. |last2=Sisson |first2=S. A. |last3=Fan |first3=Y. |date=2012-11-01 |title=Likelihood-free Bayesian inference for α-stable models |url=https://www.sciencedirect.com/science/article/pii/S0167947310003786 |journal=Computational Statistics & Data Analysis |series=1st issue of the Annals of Computational and Financial Econometrics |volume=56 |issue=11 |pages=3743–3756 |doi=10.1016/j.csda.2010.10.004 |issn=0167-9473|url-access=subscription }}</ref><ref>{{Cite journal |last1=Peters |first1=Gareth W. |last2=Wüthrich |first2=Mario V. |last3=Shevchenko |first3=Pavel V. |date=2010-08-01 |title=Chain ladder method: Bayesian bootstrap versus classical bootstrap |url=https://www.sciencedirect.com/science/article/pii/S0167668710000351 |journal=Insurance: Mathematics and Economics |volume=47 |issue=1 |pages=36–51 |doi=10.1016/j.insmatheco.2010.03.007 |arxiv=1004.2548 |issn=0167-6687}}</ref>
'''3.''' If <math>\Delta(S(D),S(X))<\epsilon</math> then accept <math>\theta^\ast</math> as a sample from the posterior.
 
==Method==
For <math>\epsilon</math> sufficiently small the ABC procedure should deliver a good approximation to the true posterior, in particular if the summary statistic <math>S</math> is a [[sufficient statistic]] of the probability model. If sufficient statistics do not exist or are hard to come by, setting up a satisfying and efficient ABC approach can be challenging.
 
===Motivation===
The generic procedure outlined above can be computationally inefficient but ABC and likelihood-free inferential procedures can be combined with the standard computational approaches used in [[Bayesian inference]] such as [[Markov chain Monte Carlo]] <ref name=Marjoram>{{cite journal|last = Marjoram|first = P.|coauthors = Molitor, J., Plagnol, V. and Tavaré, S.|title = Markov chain Monte Carlo without likelihoods|journal = P Natl Acad Sci USA|volume = 100|number = 26|date = 2003|pages = 15324–15328|doi = 10.1073/pnas.0306899100|pmid = 14663152}} </ref> <ref name=Plagnol>{{cite journal|last = Plagnol|first = V.|coauthors = Tavaré, S.|title = Approximate Bayesian computation and MCMC|journal = Monte Carlo and Quasi-Monte Carlo Methods 2002|year = 2004|url = http://www-gene.cimr.cam.ac.uk/vplagnol/papers/vpst-web.pdf}} (The link is to a preprint.)</ref> and [[Sequential Monte Carlo method]] <ref name=Toni2009> </ref>approaches. In these frameworks ABC can be used to tackle otherwise computationally intractable problems.
A common incarnation of [[Bayes' theorem|Bayes’ theorem]] relates the [[conditional probability]] (or density) of a particular parameter value <math>\theta</math> given data <math>D</math> to the [[probability]] of <math>D</math> given <math>\theta</math> by the rule
 
:<math>p(\theta|D) = \frac{p(D|\theta)p(\theta)}{p(D)}</math>,
An increasing number of software implementations of ABC approaches exist <ref name=Cornuet>{{cite journal|last = Cornuet|first = J-M.|coauthors = Santos, F., Beaumont, M. A., Robert, C. P., Marin, J-M., [[David Balding|Balding, D. J.]], Guillemaud, T. and Estoup, A.|title = Inferring population history with DIY ABC: a user-friendly approach to Approximate Bayesian Computation|journal = Bioinformatics|year = 2008|url = http://bioinformatics.oxfordjournals.org/cgi/content/abstract/btn514|pmid = 18842597|doi = 10.1093/bioinformatics/btn514}} </ref>.
 
where <math>p(\theta|D)</math> denotes the posterior, <math>p(D|\theta)</math> the likelihood, <math>p(\theta)</math> the prior, and <math>p(D)</math> the evidence (also referred to as the [[marginal likelihood]] or the prior predictive probability of the data). Note that the denominator <math>p(D)</math> is normalizing the total probability of the posterior density <math>p(\theta|D)</math> to one and can be calculated that way.
==See also==
[[Markov chain Monte Carlo]]
 
The prior represents beliefs or knowledge (such as e.g. physical constraints) about <math>\theta</math> before <math>D</math> is available. Since the prior narrows down uncertainty, the posterior estimates have less variance, but might be biased. For convenience the prior is often specified by choosing a particular distribution among a set of well-known and tractable families of distributions, such that both the evaluation of prior probabilities and random generation of values of <math>\theta</math> are relatively straightforward. For certain kinds of models, it is more pragmatic to specify the prior <math>p(\theta)</math> using a factorization of the joint distribution of all the elements of <math>\theta</math> in terms of a sequence of their conditional distributions. If one is only interested in the relative posterior plausibilities of different values of <math>\theta</math>, the evidence <math>p(D)</math> can be ignored, as it constitutes a [[Normalizing constant|normalising constant]], which cancels for any ratio of posterior probabilities. It remains, however, necessary to evaluate the likelihood <math>p(D|\theta)</math> and the prior <math>p(\theta)</math>. For numerous applications, it is [[computationally expensive]], or even completely infeasible, to evaluate the likelihood,<ref name="Busetto2009a" /> which motivates the use of ABC to circumvent this issue.
[[Sequential Monte Carlo Method]]
==References==
{{reflist}}
 
===The ABC rejection algorithm===
==Softwares==
All ABC-based methods approximate the likelihood function by simulations, the outcomes of which are compared with the observed data.<ref>{{Cite journal |last=Hunter |first=Dawn |date=2006-12-08 |title=Bayesian inference, Monte Carlo sampling and operational risk |url=https://www.risk.net/journal-of-operational-risk/2160915/bayesian-inference-monte-carlo-sampling-and-operational-risk |journal=Journal of Operational Risk |volume=1 |issue=3 |pages=27–50 |language=en |doi=10.21314/jop.2006.014|url-access=subscription }}</ref><ref name="Peters 2009"/><ref name="Beaumont2010" /><ref name="Bertorelle" /><ref name="Csillery" /> More specifically, with the ABC rejection algorithm — the most basic form of ABC — a set of parameter points is first sampled from the prior distribution. Given a sampled parameter point <math>\hat{\theta}</math>, a data set <math>\hat{D}</math> is then simulated under the statistical model <math>M</math> specified by <math>\hat{\theta}</math>. If the generated <math>\hat{D}</math> is too different from the observed data <math>D</math>, the sampled parameter value is discarded. In precise terms, <math>\hat{D}</math> is accepted with tolerance <math>\epsilon \ge 0</math> if:
[http://www1.montpellier.inra.fr/CBGP/diyabc/ DIYABC] : "Do it yourself ABC".
 
:<math>\rho (\hat{D},D)\le\epsilon</math>,
 
where the distance measure <math>\rho(\hat{D},D)</math> determines the level of discrepancy between <math>\hat{D}</math> and <math>D</math> based on a given [[Metric (mathematics)|metric]] (e.g. [[Euclidean distance]]). A strictly positive tolerance is usually necessary, since the probability that the simulation outcome coincides exactly with the data (event <math>\hat{D}=D</math>) is negligible for all but trivial applications of ABC, which would in practice lead to rejection of nearly all sampled parameter points. The outcome of the ABC rejection algorithm is a sample of parameter values approximately distributed according to the desired posterior distribution, and, crucially, obtained without the need to explicitly evaluate the likelihood function.
 
[[Image:Approximate Bayesian computation conceptual overview.svg|632px|thumb|center|Parameter estimation by approximate Bayesian computation: a conceptual overview.]]
 
===Summary statistics===
The probability of generating a data set <math>\hat{D}</math> with a small distance to <math>D</math> typically decreases as the dimensionality of the data increases. This leads to a substantial decrease in the computational efficiency of the above basic ABC rejection algorithm. A common approach to lessen this problem is to replace <math>D</math> with a set of lower-dimensional [[summary statistics]] <math>S(D)</math>, which are selected to capture the relevant information in <math>D</math>. The acceptance criterion in ABC rejection algorithm becomes:
 
:<math>\rho(S(\hat{D}),S(D))\le\epsilon</math>.
 
If the summary statistics are [[Sufficient statistic|sufficient]] with respect to the model parameters <math>\theta</math>, the efficiency increase obtained in this way does not introduce any error.<ref name="Didelot" /> Indeed, by definition, sufficiency implies that all information in <math>D</math> about <math>\theta</math> is captured by <math>S(D)</math>.
 
As [[#Choice and sufficiency of summary statistics|elaborated below]], it is typically impossible, outside the [[Exponential family|exponential family of distributions]], to identify a finite-dimensional set of sufficient statistics. Nevertheless, informative but possibly insufficient summary statistics are often used in applications where inference is performed with ABC methods.
 
==Example==
[[Image:A dynamic bistable hidden Markov model.svg|326px|thumb|right| A dynamic bistable hidden Markov model]]
 
An illustrative example is a [[Bistability|bistable]] system that can be characterized by a [[Hidden Markov model|hidden Markov model (HMM)]] subject to measurement noise. Such models are employed for many biological systems: They have, for example, been used in development, [[cell signaling]], [[activation]]/deactivation, logical processing and [[non-equilibrium thermodynamics]]. For instance, the behavior of the [[Sonic hedgehog (protein)|Sonic hedgehog]] (Shh) transcription factor in ''[[Drosophila melanogaster]]'' can be modeled with an HMM.<ref name="Lai" /> The (biological) dynamical model consists of two states: A and B. If the probability of a transition from one state to the other is defined as <math>\theta</math> in both directions, then the probability to remain in the same state at each time step is <math>{1-\theta}</math>. The probability to measure the state correctly is <math>\gamma</math> (and conversely, the probability of an incorrect measurement is <math>{1-\gamma}</math>).
 
Due to the conditional dependencies between states at different time points, calculation of the likelihood of time series data is somewhat tedious, which illustrates the motivation to use ABC. A computational issue for basic ABC is the large dimensionality of the data in an application like this. The dimensionality can be reduced using the summary statistic <math>S</math>, which is the frequency of switches between the two states. The absolute difference is used as a distance measure <math>\rho(\cdot,\cdot)</math> with tolerance <math>\epsilon=2</math>. The posterior inference about the parameter <math>\theta</math> can be done following the five steps presented in.
 
'''Step 1:''' Assume that the observed data form the state sequence AAAABAABBAAAAAABAAAA, which is generated using <math>\theta=0.25</math> and <math>\gamma=0.8</math>. The associated summary statistic—the number of switches between the states in the experimental data—is <math>\omega_E=6</math>.
 
'''Step 2:''' Assuming nothing is known about <math>\theta</math>, a uniform prior in the interval <math>[0,1]</math> is employed. The parameter <math>\gamma</math> is assumed to be known and fixed to the data-generating value <math>\gamma=0.8</math>, but it could in general also be estimated from the observations. A total of <math>n</math> parameter points are drawn from the prior, and the model is simulated for each of the parameter points <math>\theta_i: \text{ } i = 1,\ldots, n</math>, which results in <math>n</math> sequences of simulated data. In this example, <math>n=5</math>, with each drawn parameter and simulated dataset recorded in [[#table1|Table 1, columns 2-3]]. In practice, <math>n</math> would need to be much larger to obtain an appropriate approximation.
 
{| class="sortable wikitable" style="float: right; margin-left: 1em; text-align: center;" id="table1"
|+Example of ABC rejection algorithm
|-
! i
! <math>\theta_i</math>
! Simulated datasets (step 2)
! Summary statistic <br/> <math>\omega_{S,i}</math> (step 3)
! Distance <math>\rho(\omega_{S,i}, \omega_E)</math> <br />(step 4)
! Outcome <br />(step 4)
|-
| 1
| 0.08
| AABAAAABAABAAABAAAAA
| 8
| 2
| accepted
|-
| 2
| 0.68
| AABBABABAAABBABABBAB
| 13
| 7
| rejected
|-
| 3
| 0.87
| BBBABBABBBBABABBBBBA
| 9
| 3
| rejected
|-
| 4
| 0.43
| AABAAAAABBABBBBBBBBA
| 6
| 0
| accepted
|-
| 5
| 0.53
| ABBBBBAABBABBABAABBB
| 9
| 3
| rejected
|}
 
'''Step 3:''' The summary statistic is computed for each sequence of simulated data <math>\omega_{S,i}: \text{ } i = 1,\ldots,n</math>.
 
'''Step 4:''' The distance between the observed and simulated transition frequencies <math>\rho(\omega_{S,i}, \omega_E) = |\omega_{S,i} - \omega_{E}|</math> is computed for all parameter points. Parameter points for which the distance is smaller than or equal to <math>\epsilon</math> are accepted as approximate samples from the posterior.[[Image:Approximate Bayesian Computation example.svg|500px|thumb|right| Posterior of <math>\theta</math> obtained in the example (red), compared to the true posterior distribution (black) and ABC simulations with large <math>n</math>. The use of the insufficient summary statistic <math>\omega</math> introduces bias, even when requiring <math>\epsilon=0</math> (light green).]]
 
'''Step 5:''' The posterior distribution is approximated with the accepted parameter points. The posterior distribution should have a non-negligible probability for parameter values in a region around the true value of <math>\theta</math> in the system if the data are sufficiently informative. In this example, the posterior probability mass is evenly split between the values 0.08 and 0.43.
 
The posterior probabilities are obtained via ABC with large <math>n</math> by utilizing the summary statistic (with <math>\epsilon = 0 </math> and <math>\epsilon = 2 </math>) and the full data sequence (with <math>\epsilon = 0 </math>). These are compared with the true posterior, which can be computed exactly and efficiently using the [[Viterbi algorithm]]. The summary statistic utilized in this example is not sufficient, as the deviation from the theoretical posterior is significant even under the stringent requirement of <math>\epsilon = 0 </math>. A much longer observed data sequence would be needed to obtain a posterior concentrated around <math>\theta = 0.25</math>, the true value of <math>\theta</math>.
 
This example application of ABC uses simplifications for illustrative purposes. More realistic applications of ABC are available in a growing number of peer-reviewed articles.<ref name="Beaumont2010" /><ref name="Bertorelle" /><ref name="Csillery" /><ref name="Marin11" /><ref>{{cite book |first=Christian P. |last=Robert |chapter=Approximate Bayesian Computation: A Survey on Recent Results |year=2016 |editor-last=Cools |editor-first=R. |editor2-last=Nuyens |editor2-first=D. |title=Monte Carlo and Quasi-Monte Carlo Methods |pages=185–205 |series=Springer Proceedings in Mathematics & Statistics |volume=163 |isbn=978-3-319-33505-6 |doi=10.1007/978-3-319-33507-0_7 }}</ref>
 
==Model comparison with ABC==
Outside of parameter estimation, the ABC framework can be used to compute the posterior probabilities of different candidate models.<ref name="Wilkinson2007" /><ref name="Grelaud" /><ref name="Toni2010" /> In such applications, one possibility is to use rejection sampling in a hierarchical manner. First, a model is sampled from the prior distribution for the models. Then, parameters are sampled from the prior distribution assigned to that model. Finally, a simulation is performed as in single-model ABC. The relative acceptance frequencies for the different models now approximate the posterior distribution for these models. Again, computational improvements for ABC in the space of models have been proposed, such as constructing a particle filter in the joint space of models and parameters.<ref name="Toni2010" />
 
Once the posterior probabilities of the models have been estimated, one can make full use of the techniques of [[Bayesian model comparison]]. For instance, to compare the relative plausibilities of two models <math>M_1</math> and <math>M_2</math>, one can compute their posterior ratio, which is related to the [[Bayes factor]] <math>B_{1,2}</math>:
 
:<math>\frac{p(M_1 |D)}{p(M_2|D)}=\frac{p(D|M_1)}{p(D|M_2)}\frac{p(M_1)}{p(M_2)} = B_{1,2}\frac{p(M_1)}{p(M_2)}</math>.
 
If the model priors are equal—that is, <math>p(M_1)=p(M_2)</math>—the Bayes factor equals the posterior ratio.
 
In practice, [[#Bayes factor with ABC and summary statistics|as discussed below]], these measures can be highly sensitive to the choice of parameter prior distributions and summary statistics, and thus conclusions of model comparison should be drawn with caution.
 
==Pitfalls and remedies==
 
{| class="sortable wikitable" style="float: right; margin-left: 1em;" width="75%" id="table2"
|+Potential risks and remedies in ABC-based statistical inference
 
|-
! Error source
! Potential issue
! Solution
! Subsection
|-
| Nonzero tolerance <math>\epsilon</math>
| The inexactness introduces bias into the computed posterior distribution.
| Theoretical/practical studies of the sensitivity of the posterior distribution to the tolerance. Noisy ABC.
| [[#Approximation of the posterior]]
|-
| Insufficient summary statistics
| The information loss causes inflated credible intervals.
| Automatic selection/semi-automatic identification of sufficient statistics. Model validation checks (e.g., Templeton 2009<ref name="Templeton2009b" />).
| [[#Choice and sufficiency of summary statistics]]
|-
| Small number of models/incorrectly specified models
| The investigated models are not representative/lack predictive power.
| Careful selection of models. Evaluation of the predictive power.
| [[#Small number of models]]
|-
| Priors and parameter ranges
| Conclusions may be sensitive to the choice of priors. Model choice may be meaningless.
| Check sensitivity of Bayes factors to the choice of priors. Some theoretical results regarding choice of priors are available. Use alternative methods for model validation.
| [[#Prior distribution and parameter ranges]]
|-
| Curse of dimensionality
| Low parameter acceptance rates. Model errors cannot be distinguished from an insufficient exploration of the parameter space. Risk of overfitting.
| Methods for model reduction if applicable. Methods to speed up the parameter exploration. Quality controls to detect overfitting.
| [[#Curse of dimensionality]]
|-
| Model ranking with summary statistics
| The computation of Bayes factors on summary statistics may not be related to the Bayes factors on the original data, which may therefore render the results meaningless.
| Only use summary statistics that fulfill the necessary and sufficient conditions to produce a consistent Bayesian model choice. Use alternative methods for model validation.
| [[#Bayes factor with ABC and summary statistics]]
|-
| Implementation
| Low protection to common assumptions in the simulation and the inference process.
| Sanity checks of results. Standardization of software.
| [[#Indispensable quality controls]]
|}
 
As for all statistical methods, a number of assumptions and approximations are inherently required for the application of ABC-based methods to real modeling problems. For example, setting the [[#The ABC rejection algorithm|tolerance parameter <math>\epsilon </math>]] to zero ensures an exact result, but typically makes computations prohibitively expensive. Thus, values of <math>\epsilon</math> larger than zero are used in practice, which introduces a bias. Likewise, sufficient statistics are typically not available and instead, other summary statistics are used, which introduces an additional bias due to the loss of information. Additional sources of bias- for example, in the context of model selection—may be more subtle.<ref name="Didelot" /><ref name="Robert" />
 
At the same time, some of the criticisms that have been directed at the ABC methods, in particular within the field of [[phylogeography]],<ref name="Templeton2009b" /><ref name="Templeton2008" /><ref name="Templeton2009a" /> are not specific to ABC and apply to all Bayesian methods or even all statistical methods (e.g., the choice of prior distribution and parameter ranges).<ref name="Beaumont2010" /><ref name="Berger" /> However, because of the ability of ABC-methods to handle much more complex models, some of these general pitfalls are of particular relevance in the context of ABC analyses.
 
This section discusses these potential risks and reviews possible ways to address them.
 
===Approximation of the posterior===
A non-negligible <math>\epsilon</math> comes with the price that one samples from <math>p(\theta|\rho(\hat{D},D)\le\epsilon)</math> instead of the true posterior <math>p(\theta|D)</math>. With a sufficiently small tolerance, and a sensible distance measure, the resulting distribution <math>p(\theta|\rho(\hat{D},D)\le\epsilon)</math> should often approximate the actual target distribution <math>p(\theta|D)</math> reasonably well. On the other hand, a tolerance that is large enough that every point in the parameter space becomes accepted will yield a replica of the prior distribution. There are empirical studies of the difference between <math>p(\theta|\rho(\hat{D},D)\le\epsilon)</math> and <math>p(\theta|D)</math> as a function of <math>\epsilon</math>,<ref name="Sisson" /><ref name="Peters 2009"/> and theoretical results for an upper <math>\epsilon</math>-dependent bound for the error in parameter estimates.<ref name="Dean" /> The accuracy of the posterior (defined as the expected quadratic loss) delivered by ABC as a function of <math>\epsilon</math> has also been investigated.<ref name="Fearnhead" /> However, the convergence of the distributions when <math>\epsilon</math> approaches zero, and how it depends on the distance measure used, is an important topic that has yet to be investigated in greater detail. In particular, it remains difficult to disentangle errors introduced by this approximation from errors due to model mis-specification.<ref name="Beaumont2010" />
 
As an attempt to correct some of the error due to a non-zero <math>\epsilon</math>, the usage of local linear weighted regression with ABC to reduce the variance of the posterior estimates has been suggested.<ref name="Beaumont2002" /> The method assigns weights to the parameters according to how well simulated summaries adhere to the observed ones and performs linear regression between the summaries and the weighted parameters in the vicinity of observed summaries. The obtained regression coefficients are used to correct sampled parameters in the direction of observed summaries. An improvement was suggested in the form of nonlinear regression using a feed-forward neural network model.<ref name="Blum2010" /> However, it has been shown that the posterior distributions obtained with these approaches are not always consistent with the prior distribution, which did lead to a reformulation of the regression adjustment that respects the prior distribution.<ref name="Leuenberger2009" />
 
Finally, statistical inference using ABC with a non-zero tolerance <math>\epsilon</math> is not inherently flawed: under the assumption of measurement errors, the optimal <math>\epsilon</math> can in fact be shown to be not zero.<ref name="Fearnhead" /><ref name="Wilkinson" /> Indeed, the bias caused by a non-zero tolerance can be characterized and compensated by introducing a specific form of noise to the summary statistics. Asymptotic consistency for such “noisy ABC”, has been established, together with formulas for the asymptotic variance of the parameter estimates for a fixed tolerance.<ref name="Fearnhead" />
 
===Choice and sufficiency of summary statistics===
Summary statistics may be used to increase the acceptance rate of ABC for high-dimensional data. Low-dimensional sufficient statistics are optimal for this purpose, as they capture all relevant information present in the data in the simplest possible form.<ref name="Csillery" /><ref>{{Cite journal |last1=Peters |first1=Gareth William |last2=Wuthrich |first2=Mario V. |last3=Shevchenko |first3=Pavel V. |date=2009 |title=Chain Ladder Method: Bayesian Bootstrap Versus Classical Bootstrap |url=https://dx.doi.org/10.2139/ssrn.2980411 |journal=SSRN Electronic Journal |doi=10.2139/ssrn.2980411 |arxiv=1004.2548 |issn=1556-5068}}</ref><ref>{{cite arXiv|last1=Peters |first1=G. W. |title=Likelihood-free Bayesian inference for alpha-stable models |date=2009-12-23 |last2=Sisson |first2=S. A. |last3=Fan |first3=Y.|class=stat.CO |eprint=0912.4729 }}</ref> However, low-dimensional sufficient statistics are typically unattainable for statistical models where ABC-based inference is most relevant, and consequently, some [[heuristic]] is usually necessary to identify useful low-dimensional summary statistics. The use of a set of poorly chosen summary statistics will often lead to inflated [[credible interval]]s due to the implied loss of information,<ref name="Csillery" /> which can also bias the discrimination between models. A review of methods for choosing summary statistics is available,<ref name="Blum12" /> which may provide valuable guidance in practice.
 
One approach to capture most of the information present in data would be to use many statistics, but the accuracy and stability of ABC appears to decrease rapidly with an increasing numbers of summary statistics.<ref name="Beaumont2010" /><ref name="Csillery" /> Instead, a better strategy is to focus on the relevant statistics only—relevancy depending on the whole inference problem, on the model used, and on the data at hand.<ref name="Nunes" />
 
An algorithm has been proposed for identifying a representative subset of summary statistics, by iteratively assessing whether an additional statistic introduces a meaningful modification of the posterior.<ref name="Joyce" /> One of the challenges here is that a large ABC approximation error may heavily influence the conclusions about the usefulness of a statistic at any stage of the procedure. Another method<ref name="Nunes" /> decomposes into two main steps. First, a reference approximation of the posterior is constructed by minimizing the [[Entropy (statistical thermodynamics)|entropy]]. Sets of candidate summaries are then evaluated by comparing the ABC-approximated posteriors with the reference posterior.
 
With both of these strategies, a subset of statistics is selected from a large set of candidate statistics. Instead, the [[partial least squares regression]] approach uses information from all the candidate statistics, each being weighted appropriately.<ref name="Wegmann" /> Recently, a method for constructing summaries in a semi-automatic manner has attained a considerable interest.<ref name="Fearnhead" /> This method is based on the observation that the optimal choice of summary statistics, when minimizing the quadratic loss of the parameter point estimates, can be obtained through the posterior mean of the parameters, which is approximated by performing a linear regression based on the simulated data. Summary statistics for model selection have been obtained using [[multinomial logistic regression]] on simulated data, treating competing models as the label to predict.<ref name="Prangle" />
 
Methods for the identification of summary statistics that could also simultaneously assess the influence on the approximation of the posterior would be of substantial value.<ref name="Marjoram" /> This is because the choice of summary statistics and the choice of tolerance constitute two sources of error in the resulting posterior distribution. These errors may corrupt the ranking of models and may also lead to incorrect model predictions.
 
===Bayes factor with ABC and summary statistics===
It has been shown that the combination of insufficient summary statistics and ABC for model selection can be problematic.<ref name="Didelot" /><ref name="Robert" /> Indeed, if one lets the Bayes factor based on the summary statistic <math>S(D)</math> be denoted by <math>B_{1,2}^s</math>, the relation between <math>B_{1,2}</math> and <math>B_{1,2}^s</math> takes the form:<ref name="Didelot" />
 
:<math>B_{1,2}=\frac{p(D|M_1)}{p(D|M_2)}=\frac{p(D|S(D),M_1)}{p(D|S(D),M_2)} \frac{p(S(D)|M_1)}{p(S(D)|M_2)}=\frac{p(D|S(D),M_1)}{p(D|S(D),M_2)} B_{1,2}^s</math>.
 
Thus, a summary statistic <math>S(D)</math> is sufficient for comparing two models <math>M_1</math> and <math>M_2</math> if and only if:
 
:<math>p(D|S(D),M_1)=p(D|S(D),M_2)</math>,
 
which results in that <math>B_{1,2}=B_{1,2}^s</math>. It is also clear from the equation above that there might be a huge difference between <math>B_{1,2}</math> and <math>B_{1,2}^s</math> if the condition is not satisfied, as can be demonstrated by toy examples.<ref name="Didelot" /><ref name="Grelaud" /><ref name="Robert" /> Crucially, it was shown that sufficiency for <math>M_1</math> or <math>M_2</math> alone, or for both models, does not guarantee sufficiency for ranking the models.<ref name="Didelot" /> However, it was also shown that any [[Sufficient statistic|sufficient summary statistic]] for a model <math>M</math> in which both <math>M_1</math> and <math>M_2</math> are [[Multilevel model|nested]] is valid for ranking the [[Multilevel model|nested models]].<ref name="Didelot" />
 
The computation of Bayes factors on <math>S(D)</math> may therefore be misleading for model selection purposes, unless the ratio between the Bayes factors on <math>D</math> and <math>S(D)</math> would be available, or at least could be approximated reasonably well. Alternatively, necessary and sufficient conditions on summary statistics for a consistent Bayesian model choice have recently been derived,<ref name="Marin" /> which can provide useful guidance.
 
However, this issue is only relevant for model selection when the dimension of the data has been reduced. ABC-based inference, in which the actual data sets are directly compared—as is the case for some systems biology applications (e.g., see <ref name="Toni" />)—circumvents this problem.
 
===Indispensable quality controls===
As the above discussion makes clear, any ABC analysis requires choices and trade-offs that can have a considerable impact on its outcomes. Specifically, the choice of competing models/hypotheses, the number of simulations, the choice of summary statistics, or the acceptance threshold cannot currently be based on general rules, but the effect of these choices should be evaluated and tested in each study.<ref name="Bertorelle" />
 
A number of [[Heuristic (computer science)|heuristic approaches]] to the quality control of ABC have been proposed, such as the quantification of the fraction of parameter variance explained by the summary statistics.<ref name="Bertorelle" /> A common class of methods aims at assessing whether or not the inference yields valid results, regardless of the actually observed data. For instance, given a set of parameter values, which are typically drawn from the prior or the posterior distributions for a model, one can generate a large number of artificial datasets. In this way, the quality and robustness of ABC inference can be assessed in a controlled setting, by gauging how well the chosen ABC inference method recovers the true parameter values, and also models if multiple structurally different models are considered simultaneously.
 
Another class of methods assesses whether the inference was successful in light of the given observed data, for example, by comparing the [[posterior predictive distribution]] of summary statistics to the summary statistics observed.<ref name="Bertorelle" /> Beyond that, [[Cross-validation (statistics)|cross-validation]] techniques<ref name="Arlot" /> and [[Predictive analytics|predictive checks]]<ref name="Dawid" /><ref name="Vehtari" /> represent promising future strategies to evaluate the stability and out-of-sample predictive validity of ABC inferences. This is particularly important when modeling large data sets, because then the posterior support of a particular model can appear overwhelmingly conclusive, even if all proposed models in fact are poor representations of the stochastic system underlying the observation data. Out-of-sample predictive checks can reveal potential systematic biases within a model and provide clues on to how to improve its structure or parametrization.
 
Fundamentally novel approaches for model choice that incorporate quality control as an integral step in the process have recently been proposed. ABC allows, by construction, estimation of the discrepancies between the observed data and the model predictions, with respect to a comprehensive set of statistics. These statistics are not necessarily the same as those used in the acceptance criterion. The resulting discrepancy distributions have been used for selecting models that are in agreement with many aspects of the data simultaneously,<ref name="Ratmann" /> and model inconsistency is detected from conflicting and co-dependent summaries. Another quality-control-based method for model selection employs ABC to approximate the effective number of model parameters and the deviance of the posterior predictive distributions of summaries and parameters.<ref name="Francois" /> The deviance information criterion is then used as measure of model fit. It has also been shown that the models preferred based on this criterion can conflict with those supported by [[Bayes factor]]s. For this reason, it is useful to combine different methods for model selection to obtain correct conclusions.
 
Quality controls are achievable and indeed performed in many ABC-based works, but for certain problems, the assessment of the impact of the method-related parameters can be challenging. However, the rapidly increasing use of ABC can be expected to provide a more thorough understanding of the limitations and applicability of the method.
 
===General risks in statistical inference exacerbated in ABC===
This section reviews risks that are strictly speaking not specific to ABC, but also relevant for other statistical methods as well. However, the flexibility offered by ABC to analyze very complex models makes them highly relevant to discuss here.
 
====Prior distribution and parameter ranges====
The specification of the range and the prior distribution of parameters strongly benefits from previous knowledge about the properties of the system. One criticism has been that in some studies the “parameter ranges and distributions are only guessed based upon the subjective opinion of the investigators”,<ref name="Templeton2010" /> which is connected to classical objections of Bayesian approaches.<ref name="Beaumont2010b" />
 
With any computational method, it is typically necessary to constrain the investigated parameter ranges. The parameter ranges should if possible be defined based on known properties of the studied system, but may for practical applications necessitate an educated guess. However, theoretical results regarding [[Prior distribution#Uninformative priors|objective priors]] are available, which may for example be based on the [[principle of indifference]] or the [[principle of maximum entropy]].<ref name="Jaynes" /><ref name="Berger2006" /> On the other hand, automated or semi-automated methods for choosing a prior distribution often yield [[Prior probability#Improper priors|improper densities]]. As most ABC procedures require generating samples from the prior, improper priors are not directly applicable to ABC.
 
One should also keep the purpose of the analysis in mind when choosing the prior distribution. In principle, uninformative and flat priors, that exaggerate our subjective ignorance about the parameters, may still yield reasonable parameter estimates. However, Bayes factors are highly sensitive to the prior distribution of parameters. Conclusions on model choice based on Bayes factor can be misleading unless the sensitivity of conclusions to the choice of priors is carefully considered.
 
====Small number of models====
Model-based methods have been criticized for not exhaustively covering the hypothesis space.<ref name="Templeton2009a" /> Indeed, model-based studies often revolve around a small number of models, and due to the high computational cost to evaluate a single model in some instances, it may then be difficult to cover a large part of the hypothesis space.
 
An upper limit to the number of considered candidate models is typically set by the substantial effort required to define the models and to choose between many alternative options.<ref name="Bertorelle" /> There is no commonly accepted ABC-specific procedure for model construction, so experience and prior knowledge are used instead.<ref name="Csillery" /> Although more robust procedures for ''a priori'' model choice and formulation would be beneficial, there is no one-size-fits-all strategy for model development in statistics: sensible characterization of complex systems will always necessitate a great deal of detective work and use of expert knowledge from the problem ___domain.
 
Some opponents of ABC contend that since only few models—subjectively chosen and probably all wrong—can be realistically considered, ABC analyses provide only limited insight.<ref name="Templeton2009a" /> However, there is an important distinction between identifying a plausible null hypothesis and assessing the relative fit of alternative hypotheses.<ref name="Beaumont2010" /> Since useful null hypotheses, that potentially hold true, can extremely seldom be put forward in the context of complex models, predictive ability of statistical models as explanations of complex phenomena is far more important than the test of a statistical null hypothesis in this context. It is also common to average over the investigated models, weighted based on their relative plausibility, to infer model features (e.g., parameter values) and to make predictions.
 
====Large datasets====
Large data sets may constitute a computational bottleneck for model-based methods. It was, for example, pointed out that in some ABC-based analyses, part of the data have to be omitted.<ref name="Templeton2009a" /> A number of authors have argued that large data sets are not a practical limitation,<ref name="Bertorelle" /><ref name="Beaumont2010b" /> although the severity of this issue depends strongly on the characteristics of the models. Several aspects of a modeling problem can contribute to the computational complexity, such as the sample size, number of observed variables or features, time or spatial resolution, etc. However, with increasing computing power, this issue will potentially be less important.
 
Instead of sampling parameters for each simulation from the prior, it has been proposed alternatively to combine the [[Metropolis–Hastings algorithm|Metropolis-Hastings algorithm]] with ABC, which was reported to result in a higher acceptance rate than for plain ABC.<ref name="Marjoram" /> Naturally, such an approach inherits the general burdens of MCMC methods, such as the difficulty to assess convergence, correlation among the samples from the posterior,<ref name="Sisson" /> and relatively poor parallelizability.<ref name="Bertorelle" />
 
Likewise, the ideas of [[Particle filter|sequential Monte Carlo]] (SMC) and population Monte Carlo (PMC) methods have been adapted to the ABC setting.<ref name="Sisson" /><ref name="Beaumont2009" /> The general idea is to iteratively approach the posterior from the prior through a sequence of target distributions. An advantage of such methods, compared to ABC-MCMC, is that the samples from the resulting posterior are independent. In addition, with sequential methods the tolerance levels must not be specified prior to the analysis, but are adjusted adaptively.<ref name="DelMoral" />
 
It is relatively straightforward to parallelize a number of steps in ABC algorithms based on rejection sampling and [[Particle filter|sequential Monte Carlo]] methods. It has also been demonstrated that parallel algorithms may yield significant speedups for MCMC-based inference in phylogenetics,<ref name="Feng" /> which may be a tractable approach also for ABC-based methods. Yet an adequate model for a complex system is very likely to require intensive computation irrespectively of the chosen method of inference, and it is up to the user to select a method that is suitable for the particular application in question.
 
====Curse of dimensionality====
High-dimensional data sets and high-dimensional parameter spaces can require an extremely large number of parameter points to be simulated in ABC-based studies to obtain a reasonable level of accuracy for the posterior inferences. In such situations, the computational cost is severely increased and may in the worst case render the computational analysis intractable. These are examples of well-known phenomena, which are usually referred to with the umbrella term [[curse of dimensionality]].<ref name="Bellman" />
 
To assess how severely the dimensionality of a data set affects the analysis within the context of ABC, analytical formulas have been derived for the error of the ABC estimators as functions of the dimension of the summary statistics.<ref name="Blum10" /><ref name="Fearnhead12" /> In addition, Blum and François have investigated how the dimension of the summary statistics is related to the mean squared error for different correction adjustments to the error of ABC estimators. It was also argued that dimension reduction techniques are useful to avoid the curse-of-dimensionality, due to a potentially lower-dimensional underlying structure of summary statistics.<ref name="Blum10" /> Motivated by minimizing the quadratic loss of ABC estimators, Fearnhead and Prangle have proposed a scheme to project (possibly high-dimensional) data into estimates of the parameter posterior means; these means, now having the same dimension as the parameters, are then used as summary statistics for ABC.<ref name="Fearnhead12" />
 
ABC can be used to infer problems in high-dimensional parameter spaces, although one should account for the possibility of overfitting (e.g., see the model selection methods in <ref name="Ratmann" /> and <ref name="Francois" />). However, the probability of accepting the simulated values for the parameters under a given tolerance with the ABC rejection algorithm typically decreases exponentially with increasing dimensionality of the parameter space (due to the global acceptance criterion).<ref name="Csillery" /> Although no computational method (based on ABC or not) seems to be able to break the curse-of-dimensionality, methods have recently been developed to handle high-dimensional parameter spaces under certain assumptions (e.g., based on polynomial approximation on sparse grids,<ref name="Gerstner" /> which could potentially heavily reduce the simulation times for ABC). However, the applicability of such methods is problem dependent, and the difficulty of exploring parameter spaces should in general not be underestimated. For example, the introduction of deterministic global parameter estimation led to reports that the global optima obtained in several previous studies of low-dimensional problems were incorrect.<ref name="Singer" /> For certain problems, it might therefore be difficult to know whether the model is incorrect or, [[#Small number of models|as discussed above]], whether the explored region of the parameter space is inappropriate.<ref name="Templeton2009a" /> More pragmatic approaches are to cut the scope of the problem through model reduction,<ref name="Csillery" /> discretisation of variables and the use of canonical models such as noisy models. Noisy models exploit information on the conditional independence between variables.<ref>{{cite journal|last1= Cardenas |first1=IC|title= On the use of Bayesian networks as a meta-modeling approach to analyse uncertainties in slope stability analysis|journal =Georisk: Assessment and Management of Risk for Engineered Systems and Geohazards|date=2019|volume=13|issue=1|pages=53–65|doi=10.1080/17499518.2018.1498524|bibcode=2019GAMRE..13...53C |s2cid=216590427}}</ref>
 
==Software==
A number of software packages are currently available for application of ABC to particular classes of statistical models.
 
{| class="sortable wikitable" style="float: right; margin-left: 1em; text-align: center;" id="table3"
|+Software incorporating ABC
|-
! Software
! Keywords and features
! Reference
|-
| [http://github.com/icb-dcm/pyabc pyABC]
| Python framework for efficient distributed ABC-SMC (Sequential Monte Carlo).
| <ref name="Klinger2017" />
|-
| [[PyMC]]
| A Python package for Bayesian statistical modeling and probabilistic machine learning.
| <ref name="Salvatier2016" />
|-
| [http://www1.montpellier.inra.fr/CBGP/diyabc/ DIY-ABC]
| Software for fit of genetic data to complex situations. Comparison of competing models. Parameter estimation. Computation of bias and precision measures for a given model and known parameters values.
| <ref name="Cornuet08" />
|-
| [https://cran.r-project.org/web/packages/abc abc<br> R package]
| Several ABC algorithms for performing parameter estimation and model selection. Nonlinear heteroscedastic regression methods for ABC. Cross-validation tool.
| <ref name="Csillery12" /><ref>{{cite web |url= https://cran.r-project.org/web/packages/abc/vignettes/abcvignette.pdf |title=Approximate Bayesian Computation (ABC) in R: A Vignette |last1=Csillery |first1=K |last2=Francois |first2=O |last3=Blum |first3=MGB |date=2012-02-21| access-date=10 May 2013}}</ref>
|-
| [https://cran.r-project.org/web/packages/EasyABC/index.html EasyABC<br> R package]
| Several algorithms for performing efficient ABC sampling schemes, including 4 sequential sampling schemes and 3 MCMC schemes.
| <ref>{{cite journal |title=EasyABC: performing efficient approximate Bayesian computation sampling schemes using R. |last1=Jabot |first1=F |last2=Faure |first2=T |last3=Dumoulin |first3=N |doi=10.1111/2041-210X.12050 |volume=4 |issue = 7|journal=Methods in Ecology and Evolution |pages=684–687 |year=2013|bibcode=2013MEcEv...4..684J |doi-access=free }}</ref><ref>{{cite web |url=https://cran.r-project.org/web/packages/EasyABC/vignettes/EasyABC.pdf |title=EasyABC: a vignette |last1=Jabot |first1=F |last2=Faure |first2=T |last3=Dumoulin |first3=N |date=2013-06-03 |access-date=2016-07-19 |archive-date=2016-08-18 |archive-url=https://web.archive.org/web/20160818132912/https://cran.r-project.org/web/packages/EasyABC/vignettes/EasyABC.pdf |url-status=dead }}</ref>
|-
| [http://www.theosysbio.bio.ic.ac.uk/resources/abc-sysbio ABC-SysBio]
| Python package. Parameter inference and model selection for dynamical systems. Combines ABC rejection sampler, ABC SMC for parameter inference, and ABC SMC for model selection. Compatible with models written in Systems Biology Markup Language (SBML). Deterministic and stochastic models.
| <ref name="Liepe10" />
|-
| [https://archive.today/20130219204330/http://www.cmpg.iee.unibe.ch/content/softwares__services/computer_programs/abctoolbox ABCtoolbox]
| Open source programs for various ABC algorithms including rejection sampling, MCMC without likelihood, a particle-based sampler, and ABC-GLM. Compatibility with most simulation and summary statistics computation programs.
| <ref name="Wegmann2010" />
|-
| [https://msbayes.sourceforge.net/ msBayes]
| Open source software package consisting of several C and R programs that are run with a Perl "front-end". Hierarchical coalescent models. Population genetic data from multiple co-distributed species.
| <ref name="Hickerson07" />
|-
| [https://code.google.com/p/popabc/ PopABC]
| Software package for inference of the pattern of demographic divergence. Coalescent simulation. Bayesian model choice.
| <ref name="Lopes09" />
|-
| [http://genomics.jun.alaska.edu ONeSAMP]
| Web-based program to estimate the effective population size from a sample of microsatellite genotypes. Estimates of effective population size, together with 95% credible limits.
| <ref name="Tallmon08" />
|-
| [https://web.archive.org/web/20100429043632/http://www.etoology.net/index.php/software/genetics/112-abc4f.html ABC4F]
| Software for estimation of F-statistics for dominant data.
| <ref name="Foll08" />
|-
| [https://web.archive.org/web/20130425104634/http://compbio.igc.gulbenkian.pt/pcg/pcg_software.html#2BAD 2BAD]
| 2-event Bayesian ADmixture. Software allowing up to two independent admixture events with up to three parental populations. Estimation of several parameters (admixture, effective sizes, etc.). Comparison of pairs of admixture models.
| <ref name="Bray10" />
|-
| [https://elfi.readthedocs.io/en/latest/ ELFI]
| Engine for Likelihood-Free Inference. ELFI is a statistical software package written in Python for Approximate Bayesian Computation (ABC), also known e.g. as likelihood-free inference, simulator-based inference, approximative Bayesian inference etc.
| <ref name="Kangas16" />
|-
| [https://abcpy.readthedocs.io/en/latest/ ABCpy]
| Python package for ABC and other likelihood-free inference schemes. Several state-of-the-art algorithms available. Provides quick way to integrate existing generative (from C++, R etc.), user-friendly parallelization using MPI or Spark and summary statistics learning (with neural network or linear regression).
| <ref>{{cite journal |title=ABCpy: A High-Performance Computing Perspective to Approximate Bayesian Computation |last1=Dutta |first1=R |last2=Schoengens |first2=M |last3=Pacchiardi |first3=L |last4=Ummadisingu |first4=A |last5=Widmer |first5=N |last6=Onnela |first6=J. P. |last7=Mira |first7=A|journal=Journal of Statistical Software |author7-link=Antonietta Mira |year=2021|volume=100 |issue=7 |doi=10.18637/jss.v100.i07 |doi-access=free|arxiv=1711.04694 |s2cid=88516340 }}</ref>
|}
The suitability of individual software packages depends on the specific application at hand, the computer system environment, and the algorithms required.
 
==See also==
* [[Markov chain Monte Carlo]]
* [[Empirical Bayes]]
* [[Method of moments (statistics)]]
 
==References==
{{Academic peer reviewed|Q4781761|doi-access=free}}
 
{{reflist|35em|refs=
<ref name="Beaumont2010">{{cite journal | last1 = Beaumont | first1 = MA | year = 2010 | title = Approximate Bayesian Computation in Evolution and Ecology | journal = Annual Review of Ecology, Evolution, and Systematics | volume = 41 | pages = 379–406 | doi=10.1146/annurev-ecolsys-102209-144621}}</ref>
<ref name="Bharti">{{cite journal | last1 = Bharti | first1 = A | last2 = Briol | first2 = F.-X. | last3 = Pedersen | first3 = T | year = 2021 | title = A General Method for Calibrating Stochastic Radio Channel Models with Kernels | journal = IEEE Transactions on Antennas and Propagation | volume = 70 | issue = 6 | pages = 3986–4001 | doi=10.1109/TAP.2021.3083761| arxiv = 2012.09612 | s2cid = 233880538 }}</ref>
<ref name="Bertorelle">{{cite journal | last1 = Bertorelle | first1 = G | last2 = Benazzo | first2 = A | last3 = Mona | first3 = S | year = 2010 | title = ABC as a flexible framework to estimate demography over space and time: some cons, many pros | journal = Molecular Ecology | volume = 19 | issue = 13| pages = 2609–2625 | doi=10.1111/j.1365-294x.2010.04690.x| pmid = 20561199 | bibcode = 2010MolEc..19.2609B | s2cid = 12129604 | doi-access = free }}</ref>
<ref name="Csillery">{{cite journal | last1 = Csilléry | first1 = K | last2 = Blum | first2 = MGB | last3 = Gaggiotti | first3 = OE | last4 = François | first4 = O | year = 2010 | title = Approximate Bayesian Computation (ABC) in practice | journal = Trends in Ecology & Evolution | volume = 25 | issue = 7| pages = 410–418 | doi=10.1016/j.tree.2010.04.001| pmid = 20488578 | bibcode = 2010TEcoE..25..410C | s2cid = 13957079 }}</ref>
<ref name="Rubin">{{cite journal | last1 = Rubin | first1 = DB | year = 1984 | title = Bayesianly Justifiable and Relevant Frequency Calculations for the Applied Statistician | journal = The Annals of Statistics | volume = 12 | issue = 4| pages = 1151–1172 | doi=10.1214/aos/1176346785| doi-access = free }}</ref>
<ref name="Marjoram">{{cite journal | last1 = Marjoram | first1 = P | last2 = Molitor | first2 = J | last3 = Plagnol | first3 = V | last4 = Tavare | first4 = S | year = 2003 | title = Markov chain Monte Carlo without likelihoods | journal = Proc Natl Acad Sci U S A | volume = 100 | issue = 26| pages = 15324–15328 | doi=10.1073/pnas.0306899100| pmid = 14663152 | pmc = 307566 | bibcode = 2003PNAS..10015324M | doi-access = free }}</ref>
<ref name="Sisson">{{cite journal | last1 = Sisson | first1 = SA | last2 = Fan | first2 = Y | last3 = Tanaka | first3 = MM | year = 2007 | title = Sequential Monte Carlo without likelihoods | journal = Proc Natl Acad Sci U S A | volume = 104 | issue = 6| pages = 1760–1765 | doi=10.1073/pnas.0607208104| pmid = 17264216 | pmc = 1794282 | bibcode = 2007PNAS..104.1760S | doi-access = free }}</ref>
<ref name="Wegmann">{{cite journal | last1 = Wegmann | first1 = D | last2 = Leuenberger | first2 = C | last3 = Excoffier | first3 = L | year = 2009 | title = Efficient approximate Bayesian computation coupled with Markov chain Monte Carlo without likelihood | journal = Genetics | volume = 182 | issue = 4| pages = 1207–1218 | doi=10.1534/genetics.109.102509| pmid = 19506307 | pmc = 2728860 }}</ref>
<ref name="Templeton2008">{{cite journal | last1 = Templeton | first1 = AR | year = 2008 | title = Nested clade analysis: an extensively validated method for strong phylogeographic inference | journal = Molecular Ecology | volume = 17 | issue = 8| pages = 1877–1880 | doi=10.1111/j.1365-294x.2008.03731.x| pmid = 18346121 | pmc = 2746708| bibcode = 2008MolEc..17.1877T }}</ref>
<ref name="Templeton2009a">{{cite journal | last1 = Templeton | first1 = AR | year = 2009 | title = Statistical hypothesis testing in intraspecific phylogeography: nested clade phylogeographical analysis vs. approximate Bayesian computation | journal = Molecular Ecology | volume = 18 | issue = 2| pages = 319–331 | doi=10.1111/j.1365-294x.2008.04026.x| pmid = 19192182 | pmc = 2696056| bibcode = 2009MolEc..18..319T }}</ref>
<ref name="Templeton2009b">{{cite journal | last1 = Templeton | first1 = AR | year = 2009 | title = Why does a method that fails continue to be used? The answer | journal = Evolution | volume = 63 | issue = 4| pages = 807–812 | doi=10.1111/j.1558-5646.2008.00600.x| pmid = 19335340 | pmc = 2693665 }}</ref>
<ref name="Berger">{{cite journal | last1 = Berger | first1 = JO | last2 = Fienberg | first2 = SE | last3 = Raftery | first3 = AE | last4 = Robert | first4 = CP | year = 2010 | title = Incoherent phylogeographic inference | journal = Proceedings of the National Academy of Sciences of the United States of America | volume = 107 | issue = 41| pages = E157 | doi=10.1073/pnas.1008762107| pmid = 20870964 | bibcode = 2010PNAS..107E.157B | pmc = 2955098 | doi-access = free }}</ref>
<ref name="Didelot">{{cite journal | last1 = Didelot | first1 = X | last2 = Everitt | first2 = RG | last3 = Johansen | first3 = AM | last4 = Lawson | first4 = DJ | year = 2011 | title = Likelihood-free estimation of model evidence | journal = Bayesian Analysis | volume = 6 | pages = 49–76 | doi=10.1214/11-ba602| doi-access = free }}</ref>
<ref name="Robert">{{cite journal | last1 = Robert | first1 = CP | last2 = Cornuet | first2 = J-M | last3 = Marin | first3 = J-M | last4 = Pillai | first4 = NS | year = 2011 | title = Lack of confidence in approximate Bayesian computation model choice | journal = Proc Natl Acad Sci U S A | volume = 108 | issue = 37| pages = 15112–15117 | doi=10.1073/pnas.1102900108| pmid = 21876135 | pmc = 3174657 | bibcode = 2011PNAS..10815112R | doi-access = free }}</ref>
<ref name="Busetto2009a">Busetto A.G., Buhmann J. Stable Bayesian Parameter Estimation for Biological Dynamical Systems.; 2009. IEEE Computer Society Press pp. 148-157.</ref>
<!--<ref name="Busetto2009b">Busetto A, Ong C, Buhmann J. Optimized Expected Information Gain for Nonlinear Dynamical Systems. Int. Conf. Proc. Series; 2009. Association for Computing Machinery (ACM) pp. 97-104.</ref>-->
<!--<ref name="Jeffreys">Jeffreys H (1961) Theory of probability: Clarendon Press, Oxford.</ref>-->
<!--<ref name="Kass">{{cite journal | last1 = Kass | first1 = R | last2 = Raftery | first2 = AE | year = 1995 | title = Bayes factors | journal = Journal of the American Statistical Association | volume = 90 | pages = 773–795 | doi=10.1080/01621459.1995.10476572}}</ref>-->
<!--<ref name="Vyshemirsky">{{cite journal | last1 = Vyshemirsky | first1 = V | last2 = Girolami | first2 = MA | year = 2008 | title = Bayesian ranking of biochemical system models | journal = Bioinformatics | volume = 24 | pages = 833–839 | doi=10.1093/bioinformatics/btm607}}</ref>-->
<ref name="Arlot">{{cite journal | last1 = Arlot | first1 = S | last2 = Celisse | first2 = A | year = 2010 | title = A survey of cross-validation procedures for model selection | journal = Statistics Surveys | volume = 4 | pages = 40–79 | doi=10.1214/09-ss054| arxiv = 0907.4728 | s2cid = 14332192 }}</ref>
<ref name="Dawid">{{cite journal | last1 = Dawid | first1 = A | title = Present position and potential developments: Some personal views: Statistical theory: The prequential approach | journal = Journal of the Royal Statistical Society, Series A | volume = 1984 | pages = 278–292 }}</ref>
<ref name="Vehtari">{{cite journal | last1 = Vehtari | first1 = A | last2 = Lampinen | first2 = J | year = 2002 | title = Bayesian model assessment and comparison using cross-validation predictive densities | journal = Neural Computation | volume = 14 | issue = 10| pages = 2439–2468 | doi=10.1162/08997660260293292| pmid = 12396570 | citeseerx = 10.1.1.16.3206 | s2cid = 366285 }}</ref>
<ref name="Ratmann">{{cite journal | last1 = Ratmann | first1 = O | last2 = Andrieu | first2 = C | last3 = Wiuf | first3 = C | last4 = Richardson | first4 = S | year = 2009 | title = Model criticism based on likelihood-free inference, with an application to protein network evolution | journal = Proceedings of the National Academy of Sciences of the United States of America | volume = 106 | issue = 26| pages = 10576–10581 | doi=10.1073/pnas.0807882106| pmid = 19525398 | pmc = 2695753 | bibcode = 2009PNAS..10610576R | doi-access = free }}</ref>
<ref name="Francois">{{cite journal | last1 = Francois | first1 = O | last2 = Laval | first2 = G | year = 2011 | title = Deviance Information Criteria for Model Selection in Approximate Bayesian Computation | journal = Stat Appl Genet Mol Biol | volume = 10 | page = Article 33 | doi=10.2202/1544-6115.1678| arxiv = 1105.0269 | bibcode = 2011arXiv1105.0269F | s2cid = 11143942 }}</ref>
<ref name="Beaumont2009">{{cite journal | last1 = Beaumont | first1 = MA | last2 = Cornuet | first2 = J-M | last3 = Marin | first3 = J-M | last4 = Robert | first4 = CP | year = 2009 | title = Adaptive approximate Bayesian computation | journal = Biometrika | volume = 96 | issue = 4| pages = 983–990 | doi=10.1093/biomet/asp052| arxiv = 0805.2256 | s2cid = 16579245 }}</ref>
<ref name="DelMoral">Del Moral P, Doucet A, Jasra A (2011) An adaptive sequential Monte Carlo method for approximate Bayesian computation. Statistics and computing.</ref>
<ref name="Beaumont2002">{{cite journal | last1 = Beaumont | first1 = MA | last2 = Zhang | first2 = W | last3 = Balding | first3 = DJ | year = 2002 | title = Approximate Bayesian Computation in Population Genetics | journal = Genetics | volume = 162 | issue = 4| pages = 2025–2035 | doi = 10.1093/genetics/162.4.2025 | pmid = 12524368 | pmc = 1462356 }}</ref>
<ref name="Blum2010">{{cite journal | last1 = Blum | first1 = M | last2 = Francois | first2 = O | year = 2010 | title = Non-linear regression models for approximate Bayesian computation | journal = Stat Comp | volume = 20 | pages = 63–73 | doi=10.1007/s11222-009-9116-0| arxiv = 0809.4178 | s2cid = 2403203 }}</ref>
<ref name="Leuenberger2009">{{cite journal | last1 = Leuenberger | first1 = C | last2 = Wegmann | first2 = D | year = 2009 | title = Bayesian Computation and Model Selection Without Likelihoods | journal = Genetics | volume = 184 | issue = 1| pages = 243–252 | doi=10.1534/genetics.109.109058| pmid = 19786619 | pmc = 2815920 }}</ref>
<ref name="Beaumont2010b">{{cite journal | last1 = Beaumont | first1 = MA | last2 = Nielsen | first2 = R | last3 = Robert | first3 = C | last4 = Hey | first4 = J | last5 = Gaggiotti | first5 = O |display-authors=et al | year = 2010 | title = In defence of model-based inference in phylogeography | journal = Molecular Ecology | volume = 19 | issue = 3| pages = 436–446 | doi=10.1111/j.1365-294x.2009.04515.x| pmid = 29284924 | pmc = 5743441 | bibcode = 2010MolEc..19..436B }}</ref>
<!-- <ref name="Csillery2010">{{cite journal | last1 = Csilléry | first1 = K | last2 = Blum | first2 = MGB | last3 = Gaggiotti | first3 = OE | last4 = Francois | first4 = O | year = 2010 | title = Invalid arguments against ABC: Reply to AR Templeton | journal = Trends in Ecology & Evolution | volume = 25 | pages = 490–491 | doi=10.1016/j.tree.2010.06.011}}</ref> -->
<ref name="Templeton2010">{{cite journal | last1 = Templeton | first1 = AR | year = 2010 | title = Coherent and incoherent inference in phylogeography and human evolution | journal = Proceedings of the National Academy of Sciences of the United States of America | volume = 107 | issue = 14| pages = 6376–6381 | doi=10.1073/pnas.0910647107| pmid = 20308555 | pmc = 2851988 | bibcode = 2010PNAS..107.6376T| doi-access = free }}</ref>
<!--<ref name="Fagundes">{{cite journal | last1 = Fagundes | first1 = NJR | last2 = Ray | first2 = N | last3 = Beaumont | first3 = M | last4 = Neuenschwander | first4 = S | last5 = Salzano | first5 = FM |display-authors=et al | year = 2007 | title = Statistical evaluation of alternative models of human evolution | journal = Proceedings of the National Academy of Sciences of the United States of America | volume = 104 | pages = 17614–17619 | doi=10.1073/pnas.0708280104 | pmid=17978179 | pmc=2077041}}</ref>-->
<!-- <ref name="Gelfand">{{cite journal | last1 = Gelfand | first1 = AE | last2 = Dey | first2 = DK | year = 1994 | title = Bayesian model choice: Asymptotics and exact calculations | journal = J R Stat Soc Ser B | volume = 56 | pages = 501–514 }}</ref> -->
<!-- <ref name="Bernardo">Bernardo JM, Smith AFM (1994) Bayesian Theory: John Wiley.</ref> -->
<!-- <ref name="Box">Box G, Draper NR (1987) Empirical Model-Building and Response Surfaces: John Wiley and Sons, Oxford.</ref> -->
<!-- <ref name="Excoffier">{{cite journal | last1 = Excoffier | first1 = L | last2 = Foll | first2 = M | year = 2011 | title = fastsimcoal: a continuous-time coalescent simulator of genomic diversity under arbitrarily complex evolutionary scenarios | journal = Bioinformatics | volume = 27 | pages = 1332–1334 | doi=10.1093/bioinformatics/btr124 | pmid=21398675}}</ref> -->
<ref name="Wegmann2010">{{cite journal | last1 = Wegmann | first1 = D | last2 = Leuenberger | first2 = C | last3 = Neuenschwander | first3 = S | last4 = Excoffier | first4 = L | year = 2010 | title = ABCtoolbox: a versatile toolkit for approximate Bayesian computations | journal = BMC Bioinformatics | volume = 11 | page = 116 | doi=10.1186/1471-2105-11-116| pmid = 20202215 | pmc = 2848233 | doi-access = free }}</ref>
<!-- <ref name="Cornuet">{{cite journal | last1 = Cornuet | first1 = J-M | last2 = Santos | first2 = F | last3 = Beaumont | first3 = MA | last4 = Robert | first4 = CP | last5 = Marin | first5 = J-M |display-authors=et al | year = 2008 | title = Inferring population history with DIY ABC: a user-friendly approach to approximate Bayesian computation | journal = Bioinformatics | volume = 24 | pages = 2713–2719 | doi=10.1093/bioinformatics/btn514 | pmid=18842597 | pmc=2639274}}</ref> -->
<!-- <ref name="Templeton2010b">{{cite journal | last1 = Templeton | first1 = AR | year = 2010 | title = Coalescent-based, maximum likelihood inference in phylogeography | journal = Molecular Ecology | volume = 19 | pages = 431–435 | doi=10.1111/j.1365-294x.2009.04514.x}}</ref> -->
<ref name="Jaynes">Jaynes ET (1968) Prior Probabilities. IEEE Transactions on Systems Science and Cybernetics 4.</ref>
<ref name="Feng">{{cite journal | last1 = Feng | first1 = X | last2 = Buell | first2 = DA | last3 = Rose | first3 = JR | last4 = Waddellb | first4 = PJ | year = 2003 | title = Parallel Algorithms for Bayesian Phylogenetic Inference | journal = Journal of Parallel and Distributed Computing | volume = 63 | issue = 7–8| pages = 707–718 | doi=10.1016/s0743-7315(03)00079-0| citeseerx = 10.1.1.109.7764 }}</ref>
<ref name="Bellman">Bellman R (1961) Adaptive Control Processes: A Guided Tour: Princeton University Press.</ref>
<ref name="Gerstner">{{cite journal | last1 = Gerstner | first1 = T | last2 = Griebel | first2 = M | year = 2003 | title = Dimension-Adaptive Tensor-Product Quadrature | journal = Computing | volume = 71 | pages = 65–87 | doi=10.1007/s00607-003-0015-5| citeseerx = 10.1.1.16.2434 | s2cid = 16184111 }}</ref>
<ref name="Singer">{{cite journal | last1 = Singer | first1 = AB | last2 = Taylor | first2 = JW | last3 = Barton | first3 = PI | last4 = Green | first4 = WH | year = 2006 | title = Global dynamic optimization for parameter estimation in chemical kinetics | journal = J Phys Chem A | volume = 110 | issue = 3| pages = 971–976 | doi=10.1021/jp0548873| pmid = 16419997 | bibcode = 2006JPCA..110..971S }}</ref>
<ref name="Dean">{{cite arXiv | eprint=1103.5399 | last1=Dean | first1=Thomas A. | last2=Singh | first2=Sumeetpal S. | last3=Jasra | first3=Ajay | last4=Peters | first4=Gareth W. | title=Parameter Estimation for Hidden Markov Models with Intractable Likelihoods | date=2011 | class=math.ST }}</ref>
<ref name="Fearnhead">{{cite arXiv | eprint=1004.1112 | last1=Fearnhead | first1=Paul | last2=Prangle | first2=Dennis | title=Constructing Summary Statistics for Approximate Bayesian Computation: Semi-automatic ABC | date=2010 | class=stat.ME }}</ref>
<ref name="Wilkinson">{{cite journal | arxiv=0811.3355 | doi=10.1515/sagmb-2013-0010 | title=Approximate Bayesian computation (ABC) gives exact results under the assumption of model error | date=2013 | last1=Wilkinson | first1=Richard David | journal=Statistical Applications in Genetics and Molecular Biology | volume=12 | issue=2 | pmid=23652634 }}</ref>
<ref name="Nunes">{{cite journal | last1 = Nunes | first1 = MA | last2 = Balding | first2 = DJ | year = 2010 | title = On optimal selection of summary statistics for approximate Bayesian computation | journal = Stat Appl Genet Mol Biol | volume = 9 | page = Article 34 | doi=10.2202/1544-6115.1576| pmid = 20887273 | s2cid = 207319754 }}</ref>
<ref name="Joyce">{{cite journal | last1 = Joyce | first1 = P | last2 = Marjoram | first2 = P | year = 2008 | title = Approximately sufficient statistics and bayesian computation | journal = Stat Appl Genet Mol Biol | volume = 7 | issue = 1| page = Article 26 | doi=10.2202/1544-6115.1389| pmid = 18764775 | s2cid = 38232110 }}</ref>
<ref name="Grelaud">{{cite journal | last1 = Grelaud | first1 = A | last2 = Marin | first2 = J-M | last3 = Robert | first3 = C | last4 = Rodolphe | first4 = F | last5 = Tally | first5 = F | year = 2009 | title = Likelihood-free methods for model choice in Gibbs random fields | journal = Bayesian Analysis | volume = 3 | pages = 427–442 }}</ref>
<ref name="Marin">{{cite arXiv | eprint=1110.4700 | last1=Marin | first1=J. -M. | last2=Pillai | first2=N. | last3=Robert | first3=C. P. | last4=Rousseau | first4=J. | title=Relevant statistics for Bayesian model choice | date=2011 | class=math.ST }}</ref>
<ref name="Toni">{{cite journal | last1 = Toni | first1 = T | last2 = Welch | first2 = D | last3 = Strelkowa | first3 = N | last4 = Ipsen | first4 = A | last5 = Stumpf | first5 = M | year = 2007 | title = Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems | journal = J R Soc Interface | volume = 6 | issue = 31| pages = 187–202 | pmid = 19205079 | pmc = 2658655 | doi = 10.1098/rsif.2008.0172 }}</ref>
<ref name="Tavare">{{cite journal | last1 = Tavaré | first1 = S | last2 = Balding | first2 = DJ | last3 = Griffiths | first3 = RC | last4 = Donnelly | first4 = P | year = 1997 | title = Inferring Coalescence Times from DNA Sequence Data | journal = Genetics | volume = 145 | issue = 2 | pages = 505–518 | doi = 10.1093/genetics/145.2.505 | pmc = 1207814 | pmid=9071603}}</ref>
<ref name="Toni2010">{{cite journal | doi=10.1093/bioinformatics/btp619 | title=Simulation-based model selection for dynamical systems in systems and population biology | date=2010 | last1=Toni | first1=Tina | last2=Stumpf | first2=Michael P. H. | journal=Bioinformatics | volume=26 | issue=1 | pages=104–110 | pmid=19880371 | pmc=2796821 | arxiv=0911.1705 }}</ref>
.<ref name="Pritchard1999">{{cite journal | last1 = Pritchard | first1 = JK | last2 = Seielstad | first2 = MT | last3 = Perez-Lezaun | first3 = A |display-authors=et al | year = 1999 | title = Population Growth of Human Y Chromosomes: A Study of Y Chromosome Microsatellites | journal = Molecular Biology and Evolution | volume = 16 | issue = 12| pages = 1791–1798 | doi=10.1093/oxfordjournals.molbev.a026091| pmid = 10605120 | doi-access = free }}</ref>
<ref name="Diggle">{{cite journal | last1 = Diggle | first1 = PJ | year = 1984 | title = Monte Carlo Methods of Inference for Implicit Statistical Models | journal = Journal of the Royal Statistical Society, Series B | volume = 46 | issue = 2 | pages = 193–227 | doi = 10.1111/j.2517-6161.1984.tb01290.x }}</ref>
<ref name="Hoel71">{{cite journal | last1 = Hoel | first1 = DG | last2 = Mitchell | first2 = TJ | year = 1971 | title = The simulation, fitting and testing of a stochastic cellular proliferation model | journal = Biometrics | volume = 27 | issue = 1| pages = 191–199 | doi=10.2307/2528937| jstor = 2528937 | pmid = 4926451 }}</ref>
<ref name="Lai">{{cite journal | last1 = Lai | first1 = K | last2 = Robertson | first2 = MJ | last3 = Schaffer | first3 = DV | year = 2004 | title = The sonic hedgehog signaling system as a bistable genetic switch | journal = Biophys. J. | volume = 86 | issue = 5| pages = 2748–2757 | doi=10.1016/s0006-3495(04)74328-3 | pmid = 15111393 | bibcode=2004BpJ....86.2748L | pmc=1304145}}</ref>
<ref name="Bartlett63">{{cite journal | last1 = Bartlett | first1 = MS | year = 1963 | title = The spectral analysis of point processes | journal = Journal of the Royal Statistical Society, Series B | volume = 25 | issue = 2 | pages = 264–296 | doi = 10.1111/j.2517-6161.1963.tb00508.x }}</ref>
<ref name="Blum12">{{cite journal | arxiv=1202.3819 | doi=10.1214/12-STS406 | title=A Comparative Review of Dimension Reduction Methods in Approximate Bayesian Computation | date=2013 | last1=Blum | first1=M. G. B. | last2=Nunes | first2=M. A. | last3=Prangle | first3=D. | last4=Sisson | first4=S. A. | journal=Statistical Science | volume=28 | issue=2 }}</ref>
<ref name="Fearnhead12">{{cite journal | last1 = Fearnhead | first1 = P | last2 = Prangle | first2 = D | year = 2012 | title = Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation | journal = Journal of the Royal Statistical Society, Series B | volume = 74 | issue = 3| pages = 419–474 | doi=10.1111/j.1467-9868.2011.01010.x| citeseerx = 10.1.1.760.7753 | s2cid = 53861241 }}</ref>
<ref name="Blum10">Blum MGB (2010) Approximate Bayesian Computation: a nonparametric perspective, ''Journal of the American Statistical Association'' (105): 1178-1187</ref>
<!--<ref name="Marin11">Jean-Michel Marin, Pierre Pudlo, Christian P. Robert and Robin J. Ryder (2011) Approximate Bayesian computational methods, Statistics and Computing</ref>-->
<ref name="Cornuet08">{{cite journal | last1 = Cornuet | first1 = J-M | last2 = Santos | first2 = F | last3 = Beaumont | first3 = M |display-authors=et al | year = 2008 | title = Inferring population history with DIY ABC: a user-friendly approach to approximate Bayesian computation | journal = Bioinformatics | volume = 24 | issue = 23| pages = 2713–2719 | doi=10.1093/bioinformatics/btn514| pmid = 18842597 | pmc = 2639274 }}</ref>
<ref name="Csillery12">{{cite journal | last1 = Csilléry | first1 = K | last2 = François | first2 = O | last3 = Blum | first3 = MGB | year = 2012 | title = abc: an R package for approximate Bayesian computation (ABC) | journal = Methods in Ecology and Evolution | volume = 3 | issue = 3| pages = 475–479 | doi=10.1111/j.2041-210x.2011.00179.x| arxiv = 1106.2793 | bibcode = 2012MEcEv...3..475C | s2cid = 16679366 }}</ref>
<ref name="Liepe10">{{cite journal | last1 = Liepe | first1 = J | last2 = Barnes | first2 = C | last3 = Cule | first3 = E | last4 = Erguler | first4 = K | last5 = Kirk | first5 = P | last6 = Toni | first6 = T | last7 = Stumpf | first7 = MP | year = 2010 | title = ABC-SysBio—approximate Bayesian computation in Python with GPU support | journal = Bioinformatics | volume = 26 | issue = 14| pages = 1797–1799 | doi=10.1093/bioinformatics/btq278| pmid = 20591907 | pmc = 2894518 }}</ref>
<ref name="Hickerson07">{{cite journal | last1 = Hickerson | first1 = MJ | last2 = Stahl | first2 = E | last3 = Takebayashi | first3 = N | year = 2007 | title = msBayes: Pipeline for testing comparative phylogeographic histories using hierarchical approximate Bayesian computation | journal = BMC Bioinformatics | volume = 8 | issue = 268| pages = 1471–2105 | doi=10.1186/1471-2105-8-268| pmid = 17655753 | pmc = 1949838 | doi-access = free }}</ref>
<ref name="Lopes09">{{cite journal | last1 = Lopes | first1 = JS | last2 = Balding | first2 = D | last3 = Beaumont | first3 = MA | year = 2009 | title = PopABC: a program to infer historical demographic parameters | journal = Bioinformatics | volume = 25 | issue = 20| pages = 2747–2749 | doi=10.1093/bioinformatics/btp487| pmid = 19679678 | doi-access = }}</ref>
<ref name="Tallmon08">{{cite journal | last1 = Tallmon | first1 = DA | last2 = Koyuk | first2 = A | last3 = Luikart | first3 = G | last4 = Beaumont | first4 = MA | year = 2008 | title = COMPUTER PROGRAMS: onesamp: a program to estimate effective population size using approximate Bayesian computation | journal = Molecular Ecology Resources | volume = 8 | issue = 2| pages = 299–301 | doi=10.1111/j.1471-8286.2007.01997.x| pmid = 21585773 | s2cid = 9848290 }}</ref>
<ref name="Foll08">{{cite journal | last1 = Foll | first1 = M | last2 = Baumont | first2 = MA | last3 = Gaggiotti | first3 = OE | year = 2008 | title = An Approximate Bayesian Computation approach to overcome biases that arise when using AFLP markers to study population structure | journal = Genetics | volume = 179 | issue = 2| pages = 927–939 | doi=10.1534/genetics.107.084541| pmid = 18505879 | pmc = 2429886 }}</ref>
<ref name="Bray10">{{cite journal | last1 = Bray | first1 = TC | last2 = Sousa | first2 = VC | last3 = Parreira | first3 = B | last4 = Bruford | first4 = MW | last5 = Chikhi | first5 = L | year = 2010 | title = 2BAD: an application to estimate the parental contributions during two independent admisture events | journal = Molecular Ecology Resources | volume = 10 | issue = 3| pages = 538–541 | doi=10.1111/j.1755-0998.2009.02766.x| pmid = 21565053 | hdl = 10400.7/205 | s2cid = 6528668 | hdl-access = free }}</ref>
<ref name="Marin11">{{cite journal | last1 = Marin | first1 = JM | last2 = Pudlo | first2 = P | last3 = Robert | first3 = CP | last4 = Ryder | first4 = RJ | year = 2012 | title = Approximate Bayesian computational methods | journal = Statistics and Computing | volume = 22 | issue = 6| pages = 1167–1180 | doi=10.1007/s11222-011-9288-2| arxiv = 1101.0955 | s2cid = 40304979 }}</ref>
<ref name="Wilkinson2007">Wilkinson, R. G. (2007). Bayesian Estimation of Primate Divergence Times, Ph.D. thesis, University of Cambridge.</ref>
<ref name="Berger2006">{{cite journal |last= Berger |first= J.O. |year= 2006 |title= The case for objective Bayesian analysis |journal= Bayesian Analysis |volume= 1 |issue= pages 385–402 and 457–464 |pages= 385–402 |doi= 10.1214/06-BA115 |doi-access= free }}</ref>
<ref name="Kangas16">{{cite journal |last1= Kangasrääsiö |first1= Antti |last2= Lintusaari |first2= Jarno |last3= Skytén |first3= Kusti |last4= Järvenpää |first4= Marko |last5= Vuollekoski |first5= Henri |last6= Gutmann |first6= Michael |last7= Vehtari |first7= Aki |last8= Corander |first8= Jukka |last9= Kaski |first9= Samuel|year= 2016 |title= ELFI: Engine for Likelihood-Free Inference |url=http://approximateinference.org/accepted/KangasraasioEtAl2016.pdf |journal= NIPS 2016 Workshop on Advances in Approximate Bayesian Inference|bibcode= 2017arXiv170800707L |arxiv= 1708.00707 }}</ref>
<ref name="Klinger2017">Klinger, E.; Rickert, D.; Hasenauer, J. (2017). pyABC: distributed, likelihood-free inference.</ref>
<ref name="Salvatier2016">{{cite journal | doi=10.7717/peerj-cs.55 | doi-access=free | title=Probabilistic programming in Python using PyMC3 | date=2016 | last1=Salvatier | first1=John | last2=Wiecki | first2=Thomas V. | last3=Fonnesbeck | first3=Christopher | journal=PeerJ Computer Science | volume=2 | pages=e55 | arxiv=1507.08050 }}</ref>
<ref name="Prangle">{{cite journal | doi=10.1515/sagmb-2013-0012 | title=Semi-automatic selection of summary statistics for ABC model choice| date=2014 | last1=Prangle | first1=Dennis | last2=Fearnhead | first2=Paul | last3=Cox | first3=Murray P. | last4=Biggs | first4=Patrick J. | last5=French | first5=Nigel P. | journal=Stat Appl Genet Mol Biol | volume=13| issue=1| pages=67–82 | pmid=24323893| arxiv=1302.5624 }}</ref>
}}
 
==External links==
* {{cite web |url= http://darrenjw.wordpress.com/2013/03/31/introduction-to-approximate-bayesian-computation-abc/| title=Introduction to Approximate Bayesian Computation| author=Darren Wilkinson| date=March 31, 2013|access-date=2013-03-31}}
* {{cite web |url= http://www.sumsar.net/blog/2014/10/tiny-data-and-the-socks-of-karl-broman/| title=Tiny Data, Approximate Bayesian Computation and the Socks of Karl Broman| author= Rasmus Bååth| date=October 20, 2014|access-date=2015-01-22}}
 
{{DEFAULTSORT:Approximate Bayesian Computation}}
[[Category:Bayesian statistics]]
[[Category:Statistical approximations]]