Revision as of 03:08, 22 November 2021 edit Closed Limelike Curves (talk \| contribs) Extended confirmed users, Pending changes reviewers 8,350 edits →Intuition ← Previous edit		Revision as of 21:43, 2 March 2022 edit undo TedPavlic (talk \| contribs) Extended confirmed users, Pending changes reviewers 3,736 edits →Intuition: Fixed notes about original Metropolis et al. (1953) paper that were incorrect. Boltzmann distribution was NOT the proposal distribution in the original Metropolis algorithm. The Metropolis algorithm produced samples from the Boltzmann distribution; its suggested (but arbitrary) proposal distribution was uniform. Next edit →
Line 27: Let <math>f(x)</math> be a function that is proportional to the desired probability density function <math>P(x)</math> (a.k.a. a target distribution). # Initialization: Choose an arbitrary point <math>x_t</math> to be the first observation in the sample and choose an arbitrary probability density <math>g(x\mid y)</math> (sometimes written <math>Q(x\mid y)</math>) that suggests a candidate for the next sample value <math>x</math>, given the previous sample value <math>y</math>. In this section, <math>g</math> is assumed to be symmetric; in other words, it must satisfy <math>g(x\mid y) = g(y\mid x)</math>. A usual choice is to let <math>g(x\mid y)</math> be a [[Gaussian distribution]] centered at <math>y</math>, so that points closer to <math>y</math> are more likely to be visited next, making the sequence of samples into a [[random walk]] {{efn\|In the original paper ~~however~~by Metropolis et al. (1953), <math>g(x\mid y)</math> was ~~actually~~suggested ~~the~~to ~~[[Boltzmann~~be ~~distribution]],~~a asrandom ittranslation ~~was~~with ~~applied~~uniform ~~to physical systems in the~~density ~~context~~over ofsome ~~[[statistical~~prescribed ~~mechanics]]~~range.}}. The function <math>g</math> is referred to as the ''proposal density'' or ''jumping distribution''. # For each iteration ''t'': #* ''Generate'' a candidate <math>x'</math> for the next sample by picking from the distribution <math>g(x'\mid x_t)</math>. #* ''Calculate'' the ''acceptance ratio'' <math>\alpha = f(x')/f(x_t)</math>, which will be used to decide whether to accept or reject the candidate{{efn\|In the original paper by Metropolis et al. (1953), <math>f</math> was actually the [[Boltzmann distribution]], as it was applied to physical systems in the context of [[statistical mechanics]] (e.g., a maximal-entropy distribution of microstates for a given temperature at thermal equilibrium). Consequently, the acceptance ratio was itself an exponential of the difference in the parameters of the numerator and denominator of this ratio.}}. Because ''f'' is proportional to the density of ''P'', we have that <math>\alpha = f(x')/f(x_t) = P(x')/P(x_t)</math>. #* ''Accept or reject'': #** Generate a uniform random number <math>u \in [0, 1]</math>.

Metropolis–Hastings algorithm: Difference between revisions