Revision as of 18:41, 24 May 2025 edit OAbot (talk \| contribs) Bots 643,717 edits m Open access bot: url-access updated in citation with #oabot. ← Previous edit		Revision as of 17:50, 26 May 2025 edit undo 98.5.148.185 (talk) No edit summary Tag: Reverted Next edit →
Line 1: {{Short description\|Deep learning algorithm}}{{About\|the technique in generative statistical modeling\|3=Diffusion (disambiguation)}}{{Machine learning\|Artificial neural network}} In [[machine learning]], '''diffusion models''', also known as '''diffusion-based generative models''' or '''score-based generative models''', are a class of [[latent variable model\|latent variable]] [[generative model\|generative]] models. A diffusion model consists of two major components: the forward diffusion process, and the reverse sampling process. The goal of diffusion models is to learn a [[diffusion process]] for a given dataset, such that the process can generate new elements that are distributed similarly as the original dataset. A diffusion model models data as generated by a diffusion process, whereby a new datum performs a [[Wiener process\|random walk with drift]] through the space of all possible data.<ref name="song"/> A trained diffusion model can be sampled in many ways, with different efficiency and quality. The sampling is generally achieved by numerically solving a deterministic or stoachstic differential equation.<ref name="song"/> In particular, the sampling trajectory appears strong regularity for deterministic sampling.<ref name="chen">{{cite arXiv \|last1=Chen \|first1=Defang \|last2=Zhou \|first2=Zhenyu \|last3=Wang \|first3=Can \|last4=Shen \|first4=Chunhua \|last5=Lyu \|first5=siwei \|title=On the Trajectory Regularity of ODE-based Diffusion Sampling \|date=2024 \|class=cs.LG \|eprint=2405.11326}}</ref> There are various equivalent formalisms, including [[Markov chain]]s, denoising diffusion probabilistic models, noise conditioned score networks, and stochastic differential equations.<ref>{{cite journal \|last1=Croitoru \|first1=Florinel-Alin \|last2=Hondru \|first2=Vlad \|last3=Ionescu \|first3=Radu Tudor \|last4=Shah \|first4=Mubarak \|date=2023 \|title=Diffusion Models in Vision: A Survey \|journal=IEEE Transactions on Pattern Analysis and Machine Intelligence \|volume=45 \|issue=9 \|pages=10850–10869 \|arxiv=2209.04747 \|doi=10.1109/TPAMI.2023.3261988 \|pmid=37030794 \|s2cid=252199918}}</ref> They are typically trained using [[Variational Bayesian methods\|variational inference]].<ref name="ho" /> The model responsible for denoising is typically called its "[[#Choice of architecture\|backbone]]". The backbone may be of any kind, but they are typically [[U-Net\|U-nets]] or [[Transformer (deep learning architecture)\|transformers]].

Diffusion model: Difference between revisions