Content deleted Content added
m link inserted |
m →Formal definition: Add hyperlink to sigma-algebra page |
||
(16 intermediate revisions by 10 users not shown) | |||
Line 1:
{{Short description|Concept in probability theory}}
In [[probability theory]], a '''Markov kernel''' (also known as a '''stochastic kernel''' or '''probability kernel''') is a map that in the general theory of [[Markov process]]es plays the role that the [[Stochastic matrix|transition matrix]] does in the theory of Markov processes with a [[finite set|finite]] [[state space]].<ref>{{Cite book | last1 = Reiss | first1 = R. D. | title = A Course on Point Processes | doi = 10.1007/978-1-4613-9308-5 | series = Springer Series in Statistics | year = 1993 | isbn = 978-1-4613-9310-8 }}</ref>
== Formal definition ==
Let <math>(X,\mathcal A)</math> and <math>(Y,\mathcal B)</math> be [[measurable space]]s. A ''Markov kernel'' with source <math>(X,\mathcal A)</math> and target <math>(Y,\mathcal B)</math>, sometimes written as <math>\kappa:(X,\mathcal{A})\to(Y,\mathcal{B})</math>, is a
# For every (fixed) <math>
# For every (fixed) <math>
In other words it associates to each point <math>x \in X</math> a [[probability measure]] <math>\kappa(dy|x): B \mapsto \kappa(B, x)</math> on <math>(Y,\mathcal B)</math> such that, for every measurable set <math>B\in\mathcal B</math>, the map <math>x\mapsto \kappa(B, x)</math> is measurable with respect to the [[Σ-algebra|<math>\sigma</math>-algebra <math>\mathcal A</math>]].<ref>{{cite book |last1=Klenke |first1=Achim |title=Probability Theory: A Comprehensive Course|series=Universitext |year=2014 |publisher=Springer|page=180|edition=2|doi=10.1007/978-1-4471-5361-0|isbn=978-1-4471-5360-3 }}</ref>
== Examples ==
===[[Simple random walk]] on the integers ===
Take <math>X=Y=\Z</math>, and <math> \mathcal A = \mathcal B = \mathcal P(\Z)</math> (the [[power set]] of <math>\Z</math>). Then a Markov kernel is fully determined by the probability it assigns to
:<math>\kappa(B|n )=\sum_{m \in B}\kappa(\{m\}|n), \qquad \forall n \in \mathbb{Z}, \, \forall B \in \mathcal B</math>.
Now the random walk <math>\kappa</math> that goes to the right with probability <math>p</math> and to the left with probability <math>1 - p</math> is defined by
Line 43 ⟶ 44:
Note that the indicator function <math>\mathbf{1}_{f^{-1}(B)}</math> is <math>\mathcal{A}</math>-measurable for all <math>B \in \mathcal{B}</math> iff <math>f</math> is measurable.
This example allows us to think of a Markov kernel as a generalised function with a (in general) random rather than certain value. That is, it is a [[multivalued function]] where the values are not equally weighted.
===[[Galton–Watson process]]===
As a less obvious example, take <math>X = \N, \mathcal A = \mathcal P(\N)</math>, and <math>(Y, \mathcal B)</math> the real numbers <math>\R</math> with the standard sigma algebra of [[Borel set]]s. Then
:<math>\kappa(B|n)=\begin{cases} \mathbf{1}_B(0) & n=0\\ \Pr(\xi_1 + \cdots + \xi_x \in B) & n \neq 0 \\ \end{cases} </math>
== Composition of Markov Kernels
Given measurable spaces <math>(X, \mathcal A)</math>, <math>(Y, \mathcal B) </math> we consider a Markov kernel <math>\kappa: \mathcal B \times X \to [0,1]</math> as a morphism <math>\kappa: X \to Y</math>. Intuitively, rather than assigning to each <math>x \in X</math> a sharply defined point <math> y \in Y</math> the kernel assigns a "fuzzy" point in <math>Y</math> which is only known with some level of uncertainty, much like actual physical measurements. If we have a third measurable space <math>(Z, \mathcal C)</math>, and probability kernels <math>\kappa: X \to Y</math> and <math>\lambda: Y \to Z</math>, we can define a composition <math>\lambda \circ \kappa : X \to Z</math> by the [[Chapman-Kolmogorov equation]]
:<math>(\lambda \circ \kappa) (dz|x) = \int_Y \lambda(dz | y)\kappa(dy|x)</math>.
The composition is associative by
This composition defines the structure of a [[category (mathematics)|category]] on the measurable spaces with Markov kernels as morphisms, first defined by Lawvere
== Probability Space defined by Probability Distribution and a Markov Kernel==
A
:<math> P_Y(B) = \int_X \int_B \kappa(dy|x) P_X(dx) = \int_X \kappa(B|x)P_X(dx) = \mathbb{E}_{P_X}\kappa(B|\cdot)
▲:<math> P_Y(B) = \int_X \int_B \kappa(dy|x) P_X(dx) = \int_X \kappa(B|x)P_X(dx) = \mathbb{E}_{P_X}\kappa(B|\cdot) </math>
== Properties ==
Line 84:
can be any type of (non negative) measure, not necessarily a probability measure.
== External links ==
* [https://ncatlab.org/nlab/show/Markov+kernel Markov kernel] in [https://ncatlab.org/ nLab].
== References ==
Line 89 ⟶ 93:
* {{citation|first1=Heinz|last1=Bauer|title=Probability Theory|publisher=de Gruyter|year=1996|isbn=3-11-013935-9}}
:: §36. Kernels and semigroups of kernels
== See also ==
* [[Category of Markov kernels]]
[[Category:Markov processes]]
|