Content deleted Content added
No edit summary |
m deprecated parameters cleanup(5) format cite template dates (4), combine day, month & year, rm dupe cite field(s), using Project:AWB |
||
Line 1:
'''Approximate Bayesian computation (ABC)''' is a family of computational techniques in [[Bayesian statistics]]. These simulation techniques operate on summary data (such as population mean, or variance) to make broad inferences with less computation than might be required if all available data were analyzed in detail. They are especially useful in situations where evaluation of the likelihood is computationally prohibitive, or whenever suitable likelihoods are not available.
ABC methods originated in population and evolutionary genetics <ref name=Pritchard1999>{{cite journal|last = Pritchard|first = J. K.|coauthors = Seielstad, M. T., Perez-Lezaun, A., and Feldman, M. T.|title = Population Growth of Human Y Chromosomes: A Study of Y Chromosome Microsatellites|journal = Mol. Biol. Evol.|volume = 16|
==Overview==
Line 9:
:<math>P(\theta|D)\propto P(D|\theta) \pi(\theta)</math>
where <math>\theta</math> are the parameters of a probability model, <math>D</math> are the observed data, and <math>\pi(\theta)</math> is the [[prior distribution]] of the parameters <math>\theta</math>. <math>P(D|\theta)</math> is the [[likelihood]] of <math>\theta</math>, that is the probability of observing the data <math>D</math> given the model with parameter <math>\theta</math>.
The explicit evaluation of the likelihood <math>P(D|\theta)</math> is avoided in ABC approaches by considering distances between observed and data simulated from a model with parameter <math>\theta</math>. For sufficiently complex models and large data sets the probability of happening upon a simulation run that yields precisely the same dataset as the one observed will be very small, often unacceptably so. So rather than considering the data we consider a summary statistic of the data, <math>S(D)</math>, and use a distance <math>\Delta(S(D),S(X))</math> between the summary statistics of real and simulated data, <math>D</math> and <math>X</math>, respectively.
Line 21:
For <math>\epsilon</math> sufficiently small the ABC procedure should deliver a good approximation to the true posterior, in particular if the summary statistic <math>S</math> is a [[sufficient statistic]] of the probability model. If sufficient statistics do not exist or are hard to come by, setting up a satisfying and efficient ABC approach can be challenging.
The generic procedure outlined above can be computationally inefficient but ABC and likelihood-free inferential procedures can be combined with the standard computational approaches used in [[Bayesian inference]] such as [[Markov chain Monte Carlo]] <ref name=Marjoram>{{cite journal|last = Marjoram|first = P.|coauthors = Molitor, J., Plagnol, V. and Tavaré, S.|title = Markov chain Monte Carlo without likelihoods|journal = P Natl Acad Sci USA|volume = 100|number = 26|
While ABC and related likelihood-free methods have overwhelmingly be employed for parameter estimation, they can also be used for [[model selection]], as the whole apparatus of Bayesian model selection can be adapted to the ABC framework <ref name= Toni2009b>{{cite journal |author = Toni, T.; Stumpf, M.P.H. |year = 2009 |title = Simulation-based model selection for dynamical systems in systems and population biology | journal = Bioinformatics |volume = (in press) |doi = doi:10.1093/bioinformatics/btp619 |url=http://bioinformatics.oxfordjournals.org/cgi/content/abstract/btp619}}</ref>.
An increasing number of software implementations of ABC approaches exist <ref name=Cornuet>{{cite journal|last = Cornuet|first = J-M.|coauthors = Santos, F., Beaumont, M. A., Robert, C. P., Marin, J-M., [[David Balding|Balding, D. J.]], Guillemaud, T. and Estoup, A.|title = Inferring population history with DIY ABC: a user-friendly approach to Approximate Bayesian Computation|journal = Bioinformatics|year = 2008|url = http://bioinformatics.oxfordjournals.org/cgi/content/abstract/btn514|pmid = 18842597|doi = 10.1093/bioinformatics/btn514|volume = 24|pages = 2713}}
==See also==
Line 39:
[http://abc-sysbio.sourceforge.net/ ABC SysBio] : A Tool for parameter inference and model selection in systems biology.
[[Category:Bayesian statistics]]
|