Revision as of 15:57, 16 March 2024 edit 2601:647:4d81:ead::1243 (talk) →Conditioning on a discrete random variable ← Previous edit		Revision as of 13:37, 24 May 2024 edit undo Tom Minka (talk \| contribs) 18 edits →Conditioning on an event of probability zero: Add equation number and reference Next edit →
Line 67: Instead of conditioning on {{mvar\|X}} being ''exactly'' {{mvar\|x}}, we could condition on it being closer than distance <math>\epsilon</math> away from {{mvar\|x}}. The event <math>B = \{ x-\epsilon < X < x+\epsilon \}</math> will generally have nonzero probability and hence, can be conditioned on. We can then take the [[limit (mathematics)\|limit]] {{NumBlk\|::\|<math>\lim_{\epsilon \to 0} P(A \mid x-\epsilon < X < x+\epsilon).</math>\|{{EquationRef\|1}}}} For example, if two continuous random variables {{mvar\|X}} and {{mvar\|Y}} have a joint density <math>f_{X,Y}(x,y)</math>, then by [[L'Hôpital's rule]] and [[Leibniz integral rule]], upon differentiation with respect to <math>\epsilon</math>: Line 79: The resulting limit is the [[conditional probability distribution]] of {{mvar\|Y}} given {{mvar\|X}} and exists when the denominator, the probability density <math>f_X(x_0)</math>, is strictly positive. It is tempting to ''define'' the undefined probability <math>P(A \mid X=x)</math> using ~~this~~ limit ({{EquationNote\|1}}), but this cannot be done in a consistent manner. In particular, it is possible to find random variables {{mvar\|X}} and {{mvar\|W}} and values {{mvar\|x}}, {{mvar\|w}} such that the events <math>\{X = x\}</math> and <math>\{W = w\}</math> are identical but the resulting limits are not:<ref>{{cite web \|last1=Gal \|first1=Yarin \|title=The Borel–Kolmogorov paradox \|url=https://www.cs.ox.ac.uk/people/yarin.gal/website/PDFs/Short-talk-03-2014.pdf}}</ref> :<math>\lim_{\epsilon \to 0} P(A \mid x-\epsilon \le X \le x+\epsilon) \neq \lim_{\epsilon \to 0} P(A \mid w-\epsilon \le W \le w+\epsilon).</math> The [[Borel–Kolmogorov paradox]] demonstrates this with a geometrical argument.

Conditional probability: Difference between revisions