Revision as of 02:02, 27 February 2025 edit Cosmia Nebula (talk \| contribs) Extended confirmed users 11,305 edits →Their equivalence Tag: Visual edit ← Previous edit		Revision as of 12:18, 12 March 2025 edit undo Gruber76 (talk \| contribs) Extended confirmed users 669 edits m →Non-equilibrium thermodynamics: Word choice ("learn" meaning to teach is an archaic use of the word. The paper refers to what they do with models as "train") Next edit →
Line 14: === Non-equilibrium thermodynamics === Diffusion models were introduced in 2015 as a method to ~~learn~~train a model that can sample from a highly complex probability distribution. They used techniques from [[non-equilibrium thermodynamics]], especially [[diffusion]].<ref>{{Cite journal \|last1=Sohl-Dickstein \|first1=Jascha \|last2=Weiss \|first2=Eric \|last3=Maheswaranathan \|first3=Niru \|last4=Ganguli \|first4=Surya \|date=2015-06-01 \|title=Deep Unsupervised Learning using Nonequilibrium Thermodynamics \|url=http://proceedings.mlr.press/v37/sohl-dickstein15.pdf \|journal=Proceedings of the 32nd International Conference on Machine Learning \|language=en \|publisher=PMLR \|volume=37 \|pages=2256–2265\|arxiv=1503.03585 }}</ref> Consider, for example, how one might model the distribution of all naturally-occurring photos. Each image is a point in the space of all images, and the distribution of naturally-occurring photos is a "cloud" in space, which, by repeatedly adding noise to the images, diffuses out to the rest of the image space, until the cloud becomes all but indistinguishable from a [[Normal distribution\|Gaussian distribution]] <math>\mathcal{N}(0, I)</math>. A model that can approximately undo the diffusion can then be used to sample from the original distribution. This is studied in "non-equilibrium" thermodynamics, as the starting distribution is not in equilibrium, unlike the final distribution.

Diffusion model: Difference between revisions