Content deleted Content added
→Exchangeability and the i.i.d statistical model: replacing puffery |
Link suggestions feature: 2 links added. |
||
(42 intermediate revisions by 26 users not shown) | |||
Line 1:
{{Short description|Concept in statistics}}
In [[statistics]], an '''exchangeable sequence of random variables''' (also sometimes '''interchangeable''')<ref name="ChowTeicher"/> is a sequence ''X''<sub>1</sub>, ''X''<sub>2</sub>, ''X''<sub>3</sub>, ... (which may be finitely or infinitely long) whose [[joint probability distribution]] does not change when the positions in the sequence in which finitely many of them appear are altered. In other words, the joint distribution is invariant to finite permutation. Thus, for example the sequences
: <math> X_1, X_2, X_3, X_4, X_5, X_6 \quad \text{ and } \quad X_3, X_6, X_1, X_5, X_2, X_4 </math>
both have the same joint probability distribution.
It is closely related to the use of [[independent and identically distributed random variables]] in statistical models. Exchangeable sequences of random variables arise in cases of [[simple random sampling]].
== Definition ==
Formally, an '''exchangeable sequence of random variables''' is a finite or infinite sequence ''X''<sub>1</sub>, ''X''<sub>2</sub>, ''X''<sub>3</sub>, ... of [[random variable]]s such that for any finite [[permutation]] σ of the indices 1, 2, 3, ..., (the permutation acts on only finitely many indices, with the rest fixed), the [[joint probability distribution]] of the permuted sequence▼
▲is a finite or infinite sequence ''X''<sub>1</sub>, ''X''<sub>2</sub>, ''X''<sub>3</sub>, ... of [[random variable]]s such that for any finite [[permutation]] σ of the indices 1, 2, 3, ..., (the permutation acts on only finitely many indices, with the rest fixed), the [[joint probability distribution]] of the permuted sequence
:<math> X_{\sigma(1)}, X_{\sigma(2)}, X_{\sigma(3)}, \dots</math>
Line 10 ⟶ 16:
* Chow, Yuan Shih and Teicher, Henry, ''Probability theory. Independence, interchangeability, martingales,'' Springer Texts in Statistics, 3rd ed., Springer, New York, 1997. xxii+488 pp. {{ISBN|0-387-98228-0}}</ref><ref>Aldous, David J., ''Exchangeability and related topics'', in: École d'Été de Probabilités de Saint-Flour XIII — 1983, Lecture Notes in Math. 1117, pp. 1–198, Springer, Berlin, 1985. {{ISBN|978-3-540-15203-3}} {{doi|10.1007/BFb0099421}}</ref>
(A sequence ''E''<sub>1</sub>, ''E''<sub>2</sub>, ''E''<sub>3</sub>, ... of events is said to be exchangeable precisely if the sequence of its [[indicator function]]s is exchangeable.) The distribution function ''F''<sub>''X''<sub>
== History ==
The concept was introduced by [[William Ernest Johnson]] in his 1924 book ''Logic, Part III: The Logical Foundations of Science''.<ref>
== Exchangeability and the i.i.d. statistical model ==
The property of exchangeability is closely related to the use of [[independent and identically
This means that infinite sequences of exchangeable random variables can be regarded equivalently as sequences of conditionally i.i.d. random variables, based on some underlying distributional form. (Note that this equivalence does not quite hold for finite exchangeability. However, for finite vectors of random variables there is a close approximation to the i.i.d. model.) An infinite exchangeable sequence is [[strictly stationary]] and so a [[law of large numbers]] in the form of [[
'''The
:
(This is the [[
:
If the distribution function <math>F_\
:
These equations show the joint distribution or density characterised as a mixture distribution based on the underlying limiting empirical distribution (or a parameter indexing this distribution).
Note that not all finite exchangeable sequences are mixtures of i.i.d. To see this, consider sampling without replacement from a [[finite set]] until no elements are left. The resulting sequence is exchangeable, but not a mixture of i.i.d. Indeed, conditioned on all other elements in the sequence, the remaining element is known.
== Covariance and
Exchangeable sequences have some basic [[covariance and correlation]] properties which mean that they are generally positively correlated. For infinite sequences of exchangeable random variables, the covariance between the random variables is equal to the variance of the mean of the underlying distribution function.<ref name="O'Neill"/> For finite exchangeable sequences the covariance is also a fixed value which does not depend on the particular random variables in the sequence. There is a weaker lower bound than for infinite exchangeability and it is possible for negative correlation to exist.
The finite sequence result may be proved as follows. Using the fact that the values are exchangeable, we have
\begin{align}
0 & \le \operatorname{var}(X_1 + \cdots + X_n) \\
Line 64 ⟶ 69:
We can then solve the inequality for the covariance yielding the stated lower bound. The non-negativity of the covariance for the infinite sequence can then be obtained as a limiting result from this finite sequence result.
== Examples ==
* Any [[convex combination]] or [[mixture distribution]] of [[iid]] sequences of random variables is exchangeable. A converse proposition is [[de Finetti's theorem]].<ref>Spizzichino, Fabio ''Subjective probability models for lifetimes''. Monographs on Statistics and Applied Probability, 91. ''Chapman & Hall/CRC'', Boca Raton, FL, 2001. xx+248 pp. {{ISBN|1-58488-060-0}}
</ref>
* Suppose an [[urn model|urn]] contains
* Suppose an urn contains <math>n</math> red and <math>m</math> blue marbles. Further suppose a marble is drawn from the urn and then replaced, with an extra marble of the same colour. Let <math>X_i</math> be the indicator random variable of the event that the <math>i</math>-th marble drawn is red. Then <math>\left\{ X_i \right\}_{i\in \N}</math> is an exchangeable sequence. This model is called [[Polya's urn]].
* Let <math>(X, Y)</math> have a [[bivariate normal distribution]] with parameters <math>\mu = 0</math>, <math>\sigma_x = \sigma_y = 1</math> and an arbitrary [[Pearson product-moment correlation coefficient|correlation coefficient]] <math>\rho\in (-1, 1)</math>. The random variables <math>X</math> and <math>Y</math> are then exchangeable, but independent only if <math>\rho=0</math>. The [[density function]] is <math>p(x, y) = p(y, x) \propto \exp\left[-\frac{1}{2(1-\rho^2)}(x^2+y^2-2\rho xy)\right].</math>
Line 78 ⟶ 84:
Exchangeable random variables arise in the study of [[U statistic]]s, particularly in the Hoeffding decomposition.<ref>{{cite book |last=Borovskikh | first=Yu. V. | title=''U''-statistics in Banach spaces | publisher=VSP | ___location=Utrecht | year=1996 | pages=365–376 | isbn=90-6764-200-2 | mr=1419498|chapter=Chapter 10 Dependent variables}}</ref>
Exchangeability is a key assumption of the distribution-free inference method of [[conformal prediction]].<ref>{{cite journal |first1=Glenn |last1=Shafer |first2=Vladimir |last2=Vovk |title=A Tutorial on Conformal Prediction |journal=Journal of Machine Learning Research |volume=9 |year=2008 |pages=371–421 |url=https://www.jmlr.org/papers/v9/shafer08a.html }}</ref>
==See also==
* [[De Finetti theorem]]
* [[Resampling (statistics)#Permutation tests|Permutation tests]], a statistical test based on exchanging between groups▼
* [[
* [[Resampling (statistics)|Resampling]]
▲*
==
{{Reflist}}
==
* Aldous, David J., ''Exchangeability and related topics'', in: École d'Été de Probabilités de Saint-Flour XIII — 1983, Lecture Notes in Math. 1117, pp. 1–198, Springer, Berlin, 1985. {{ISBN|978-3-540-15203-3}} {{doi|10.1007/BFb0099421}}
* Chow, Yuan Shih and Teicher, Henry, ''Probability theory. Independence, interchangeability, martingales,'' Springer Texts in Statistics, 3rd ed., Springer, New York, 1997. xxii+488 pp. {{ISBN|0-387-98228-0}}
* {{cite book |last=Dawid |first=A. Philip |chapter=Exchangeability and its ramifications |pages=19–30 |title=Bayesian Theory and Applications |editor-first=Paul |editor-last=Damien |editor2-first=Petros |editor2-last=Dellaportas |editor3-first=Nicholas G. |editor3-last=Polson |editor4-first=David A. |editor4-last=Stephens |display-editors=1 |publisher=Oxford University Press |year=2013 |isbn=978-0-19-969560-7 }}
* [[Olav Kallenberg|Kallenberg, O.]], ''Probabilistic symmetries and invariance principles''. Springer-Verlag, New York (2005). 510 pp. {{ISBN|0-387-25115-4}}.
* Kingman, J. F. C., ''Uses of exchangeability'', Ann. Probability 6 (1978) 83–197 {{MR|494344}} {{JSTOR|2243211}}
* O'Neill, B. (2009) Exchangeability, Correlation and Bayes' Effect. ''International Statistical Review'' '''77(2)''', pp. 241–250. {{ISBN|978-3-540-15203-3}} {{doi|10.1111/j.1751-5823.2008.00059.x}}
* {{cite book|title=Limit theorems for sums of exchangeable random variables|first1=Robert Lee|last1=Taylor|first2=Peter Z.|last2=Daffer|first3=Ronald F.|last3=Patterson
{{DEFAULTSORT:Exchangeable Random Variables}}
|