Cantor's theorem: Difference between revisions

Content deleted Content added
No edit summary
WillF63 (talk | contribs)
When A is countably infinite: Added "Indeed," to the beginning of a sentence so that the sentence does not start with a mathematical symbol, as is often accepted as bad style. I think this does not change the meaning of the sentence
 
(380 intermediate revisions by more than 100 users not shown)
Line 1:
{{Short description|Every set is smaller than its power set}}
[[Category:Set theory]]
{{For|other theorems bearing Cantor's name}}
[[File:Hasse diagram of powerset of 3.svg|thumb|The cardinality of the set {''x'', ''y'', ''z''}, is three, while there are eight elements in its power set (3 < 2<sup title="Order theory">3</sup> = 8), here [[order theory|ordered]] by [[Inclusion (set theory)|inclusion]].]]
{{SpecialChars}}
 
In mathematical [[set theory]], '''Cantor's theorem''' is a fundamental result which states that, for any [[Set (mathematics)|set]] <math>A</math>, the set of all [[subset]]s of <math>A,</math> known as the [[power set]] of <math>A,</math> has a strictly greater [[cardinality]] than <math>A</math> itself.
''Note: in order to fully understand this article you may want to refer to the set theory portion of the [[Table of mathematical symbols]].''
 
For [[finite set]]s, Cantor's theorem can be seen to be true by simple [[enumeration]] of the number of subsets. Counting the [[empty set]] as a subset, a set with <math>n</math> elements has a total of <math>2^n</math> subsets, and the theorem holds because <math>2^n > n</math> for all [[non-negative integers]].
In Zermelo-Fränkel [[set theory]], '''Cantor's theorem''' states that the [[power set]] ([[set]] of all [[subset]]s) of any set ''A'' has a strictly greater [[cardinality]] than that of ''A''. Cantor's theorem is obvious for finite sets, but surprisingly it holds true for infinite sets as well. In particular, the [[power set]] of a [[countable set|countably infinite]] set is ''un''-countably infinite. To illustrate the validity of Cantor's theorem for infinite sets, just test an infinite set in the proof below.
 
Much more significant is Cantor's discovery of an argument that is applicable to any set, and shows that the theorem holds for [[infinite set|infinite]] sets also. As a consequence, the cardinality of the [[real number]]s, which is the same as that of the power set of the [[integer]]s, is strictly larger than the cardinality of the integers; see [[Cardinality of the continuum]] for details.
==The proof==
 
The theorem is named for [[Georg Cantor]], who first stated and proved it at the end of the 19th century. Cantor's theorem had immediate and important consequences for the [[philosophy of mathematics]]. For instance, by iteratively taking the power set of an infinite set and applying Cantor's theorem, we obtain an endless hierarchy of infinite cardinals, each strictly larger than the one before it. Consequently, the theorem implies that there is no largest [[cardinal number]] (colloquially, "there's no largest infinity").
The proof is a quick [[diagonal argument]]. Let ''f'' be any one-to-one function from ''A'' into the [[power set]] of ''A''. It must be shown that ''f'' is necessarily not surjective. To do that, it is enough to exhibit a subset of ''A'' that is not in the image of ''f''. That subset is
 
==Proof==
:<math>B=\left\{\,x\in A : x\not\in f(x)\,\right\}.</math>
Cantor's argument is elegant and remarkably simple. The complete proof is presented below, with detailed explanations to follow.
 
{{math theorem|name=Theorem (Cantor)|math_statement=Let <math>f</math> be a map from set <math>A</math> to its power set <math>\mathcal{P}(A)</math>. Then <math>f : A \to \mathcal{P}(A)</math> is not [[surjective]]. As a consequence, <math>\operatorname{card}(A) < \operatorname{card}(\mathcal{P}(A))</math> holds for any set <math>A</math>.}}
To show that ''B'' is not in the image of ''f'', suppose that ''B'' ''is'' in the image of ''f''. Then for some ''y'' &isin; ''A'', we have ''f''(''y'') = ''B''. Now consider whether ''y'' &isin; ''B'' or not. If ''y'' &isin; ''B'', then ''y''&isin; ''f''(''y''), but that implies, by definition of ''B'', that ''y'' &notin; ''B''. On the other hand if it is ''y''&notin; ''B'', then ''y'' &notin; ''f''(''y'') and therefore ''y''&isin; ''B''. Either way, we get a contradiction.
{{math proof| <math> B=\{x \in A \mid x \notin f(x)\}</math> exists via the [[axiom schema of specification]], and <math> B \in \mathcal{P}(A) </math> because <math> B \subseteq A </math>. <br> Assume <math> f </math> is surjective. <br> Then there exists a <math>\xi \in A</math> such that <math>f(\xi)=B</math>. <br> From &nbsp;''for all'' <math>x</math> ''in'' <math>A \ [ x \in B \iff x \notin f(x) ]</math> , we deduce &nbsp;<math>\xi \in B \iff \xi \notin f(\xi) </math> &nbsp;via [[universal instantiation]]. <br> The previous deduction yields a contradiction of the form <math>\varphi \Leftrightarrow \lnot \varphi</math>, since <math>f(\xi)=B</math>. <br> Therefore, <math>f</math> is not surjective, via [[reductio ad absurdum]]. <br> We know [[injective function|injective maps]] from <math>A</math> to <math>\mathcal{P}(A)</math> exist. For example, a function <math>g : A \to \mathcal{P}(A)</math> such that <math>g(x) = \{x\}</math>. <br> Consequently, <math>\operatorname{card}(A) < \operatorname{card}(\mathcal{P}(A))</math>. ∎}}
 
By definition of cardinality, we have <math>\operatorname{card}(X) < \operatorname{card}(Y)</math> for any two sets <math>X</math> and <math>Y</math> if and only if there is an [[injective function]] but no [[Bijective Function|bijective function]] from <math>X</math> {{nowrap|to <math>Y</math>.}} It suffices to show that there is no surjection from <math>X</math> {{nowrap|to <math>Y</math>}}. This is the heart of Cantor's theorem: there is no surjective function from any set <math>A</math> to its power set. To establish this, it is enough to show that no function <math>f</math> (that maps elements in <math>A</math> to subsets of <math>A</math>) can reach every possible subset, i.e., we just need to demonstrate the existence of a subset of <math>A</math> that is not equal to <math>f(x)</math> for any <math>x \in A</math>. Recalling that each <math>f(x)</math> is a subset of <math>A</math>, such a subset is given by the following construction, sometimes called the [[Cantor's diagonal argument|Cantor diagonal set]] of <math>f</math>:<ref name="Dasgupta2013">{{cite book|author=Abhijit Dasgupta|title=Set Theory: With an Introduction to Real Point Sets|year=2013|publisher=[[Springer Science & Business Media]]|isbn=978-1-4614-8854-5|pages=362–363}}</ref><ref name="Paulson1992">{{cite book|author=Lawrence Paulson|title=Set Theory as a Computational Logic |url=https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-271.pdf|year=1992|publisher=University of Cambridge Computer Laboratory|page=14}}</ref>
Because of the double occurrence of ''x'' in the expression "''x'' &notin; ''x''", this is a [[diagonal argument]].
 
:<math>B=\{x\in A \mid x\not\in f(x)\}.</math>
==A more concrete account of the same proof of Cantor's theorem==
 
This means, by definition, that for all <math>x\in A</math>, <math>x\in B</math> if and only if <math>x\notin f(x)</math>. For all <math>x</math> the sets <math>B</math> and <math>f(x)</math> cannot be equal because <math>B</math> was constructed from elements of <math>A</math> whose [[Image (mathematics)|images]] under <math>f</math> did not include themselves. For all <math>x\in A</math> either <math>x\in f(x)</math> or <math>x\notin f(x)</math>. If <math>x\in f(x)</math> then <math>f(x)</math> cannot equal <math>B</math> because <math>x\in f(x)</math> by assumption and <math>x\notin B</math> by definition. If <math>x\notin f(x)</math> then <math>f(x)</math> cannot equal <math>B</math> because <math>x\notin f(x)</math> by assumption and <math>x\in B</math> by the definition of <math>B</math>.
Here is a more concrete explanation of the proof of Cantor's theorem given above, which is perhaps easier to understand. This is merely an expanded version of the above proof. Once again this is a [[proof by contradiction]], where we suppose something is true and then by contradiction prove that it is not. Let's suppose that an infinite [[set]] '''''X''''' is [[bijective]] with its [[power set]] '''P'''('''''X'''''). By definition this means that the [[cardinality]] of '''''X''''' is equal to the [[cardinality]] of '''P'''('''''X''''').
 
Equivalently, and slightly more formally, we have just proved that the existence of <math>\xi \in A</math> such that <math>f(\xi )=B</math> implies the following [[contradiction]]:
Let us define '''''X''''' as the following infinite [[set]]:
 
:<math>X=\begin{\,a, b, c, d, e...\aligned}</math>
\xi\in B &\iff \xi\notin f(\xi) && \text{(by definition of }B\text{)}; \\
\xi \in B &\iff \xi \in f(\xi) && \text{(by assumption that }f(\xi)=B\text{)}. \\
 
\end{aligned}</math>
Each letter represents an [[element_(math)|element]] of the infinite [[set]] '''''X''''' (note that the [[set]] does not stop at z, it continues on infinitely with letter and number combinations i.e. ...a2, b2,...).
 
Therefore, by [[reductio ad absurdum]], the assumption must be false.<ref name="Priest2002"/> Thus there is no <math>\xi \in A</math> such that <math>f(\xi )=B</math> ; in other words, <math>B</math> is not in the image of <math>f</math> and <math>f</math> does not map onto every element of the power set of <math>A</math>, i.e., <math>f</math> is not surjective.
Let us see a sample of what '''P'''('''''X''''') looks like:
 
Finally, to complete the proof, we need to exhibit an injective function from <math>A</math> to its power set. Finding such a function is trivial: just map <math>x</math> to the singleton set <math>\{x\}</math>. The argument is now complete, and we have established the strict inequality for any set <math>A</math> that <math>\operatorname{card}(A) < \operatorname{card}(\mathcal{P}(A))</math>.
<math>P(X)=\{\,\{\,a, b\}, \{\,a, b, c\}, \{\,d\}, \{\,a, e\}, \{\,c, d, f\}...\}</math>
 
Another way to think of the proof is that <math>B</math>, empty or non-empty, is always in the power set of <math>A</math>. For <math>f</math> to be [[Surjective function|onto]], some element of <math>A</math> must map to <math>B</math>. But that leads to a contradiction: no element of <math>B</math> can map to <math>B</math> because that would contradict the criterion of membership in <math>B</math>, thus the element mapping to <math>B</math> must not be an element of <math>B</math> meaning that it satisfies the criterion for membership in <math>B</math>, another contradiction. So the assumption that an element of <math>A</math> maps to <math>B</math> must be false; and <math>f</math> cannot be onto.
Now that we know what we are dealing with let us attempt to pair off each [[element_(math)|element]] of '''''X''''' with each [[element_(math)|element]] of '''P'''('''''X''''') to show that these infinite [[set]]s are [[bijective]]. In other words we will attempt to pair off each [[element_(math)|element]] of the infinite [[set]] '''''X''''' with an [[element_(math)|element]] from the infinite [[set]] '''P'''('''''X''''') so that no [[element_(math)|element]] from either infinite [[set]] remains unpaired. Such an attempt to pair off each [[element_(math)|element]] would look like this:
 
Because of the double occurrence of <math>x</math> in the expression "<math>x\in f(x)</math>", this is a [[Cantor's diagonal argument|diagonal argument]]. For a countable (or finite) set, the argument of the proof given above can be illustrated by constructing a table in which
<math>X\begin{Bmatrix} a & \Longleftrightarrow & \{\,d, e\}\\ b & \Longleftrightarrow & \{\,a,b,c\} \\ c & \Longleftrightarrow & \{\,d,e,f\} \\ d & \Longleftrightarrow & \{\,a,c,e\} \\ \vdots & \vdots & \vdots \end{Bmatrix}P(X)</math>
# each row is labelled by a unique <math>x</math> from <math>A=\{x_1 ,x_2 , \ldots \}</math>, in this order. <math>A</math> is assumed to admit a [[Total order|linear order]] so that such table can be constructed.
# each column of the table is labelled by a unique <math>y</math> from the [[power set]] of <math>A</math>; the columns are ordered by the argument to <math>f</math>, i.e. the column labels are <math>f(x_1),f(x_2)</math>, ..., in this order.
# the intersection of each row <math>x</math> and column <math>y</math> records a true/false bit whether <math>x\in y</math>.
Given the order chosen for the row and column labels, the main diagonal <math>D</math> of this table thus records whether <math>x\in f(x)</math> for each <math>x\in A</math>. One such table will be the following:
<math display="block">\begin{array}{cccccc}
& f(x_1) & f(x_2) & f(x_3) & f(x_4) & \cdots \\
\hline
x_1 & {\color{red} T} & T & F & T & \cdots \\
x_2 & T & {\color{red} F} & F & F & \cdots \\
x_3 & F & F & {\color{red} T} & T & \cdots \\
x_4 & F & T & T & {\color{red} T} & \cdots \\
\vdots & \vdots & \vdots & \vdots & \vdots & \ddots
\end{array}</math>
The set <math>B</math> constructed in the previous paragraphs coincides with the row labels for the subset of entries on this main diagonal <math>D</math> (which in above example, coloured red) where the table records that <math>x\in f(x)</math> is false.<ref name="Priest2002">{{cite book|author=Graham Priest|title=Beyond the Limits of Thought|year=2002|publisher=Oxford University Press|isbn=978-0-19-925405-7|pages=118–119}}<!--note that the page numbers differ between the OUP and CUP editions of Priest's book!--></ref> Each row records the values of the [[indicator function]] of the set corresponding to the column. The indicator function of <math>B</math> coincides with the [[logical negation|logically negated]] (swap "true" and "false") entries of the main diagonal. Thus the indicator function of <math>B</math> does not agree with any column in at least one entry. Consequently, no column represents <math>B</math>.
 
Despite the simplicity of the above proof, it is rather difficult for an [[automated theorem prover]] to produce it. The main difficulty lies in an automated discovery of the Cantor diagonal set. [[Lawrence Paulson]] noted in 1992 that [[Otter (theorem prover)|Otter]] could not do it, whereas [[Isabelle (proof assistant)|Isabelle]] could, albeit with a certain amount of direction in terms of tactics that might perhaps be considered cheating.<ref name="Paulson1992"/>
Take note that some [[element_(math)|elements]] of '''''X''''' are paired with [[subset]]s that do not contain them. For instance, in our example the [[element_(math)|element]] <math>\boldsymbol a </math> is paired with the [[subset ]] <math>\{\,d,e\}</math>. Other [[element_(math)|elements]] of '''''X''''' are paired with [[subset]]s that do contain them. For instance, the [[element_(math)|element]] <math>\boldsymbol b </math> is paired with the [[subset ]] <math>\{\,a,b,c\}</math>.
 
==When ''A'' is countably infinite==
Using this idea, let us build a special [[subset ]]. Let us build a [[subset ]] '''''D''''' which consists of all [[element_(math)|elements]] of '''''X''''' which are paired with [[subset]]s that do not contain them. By definition our [[power set]] '''P'''('''''X''''') must contain this [[subset ]] '''''D'''''. Therefore, this [[subset ]] '''''D''''' must be placed in our column of [[subset]]s to be paired off. However, this causes a problem because with what [[element_(math)|element]] of '''''X''''' can '''''D''''' be paired with? If we pair an [[element_(math)|element]] of '''''X''''' off with '''''D''''' we must decide whether or not to place this [[element_(math)|element]] into '''''D'''''. If we decide to keep this [[element_(math)|element]] out of '''''D''''' we are immediately forced to place it into '''''D''''' by the very definition of '''''D'''''. Once we place the [[element_(math)|element]] into '''''D''''' we are immediately forced to remove the [[element_(math)|element]] from '''''D''''', once again by the definition of '''''D'''''. This is a contradiction because the [[element_(math)|element]] cannot be both inside and outside of '''''D''''' at the same time. Therefore, there is no [[element_(math)|element]] of '''''X''''' which can be paired with '''''D''''' and we have contradicted our original supposition, that there is a [[bijection]] between '''''X''''' and '''P'''('''''X''''').
Let us examine the proof for the specific case when <math>A</math> is [[countably infinite]]. [[Without loss of generality]], we may take <math>A = \mathbb{N} = \{1,2,3,\ldots\}</math>, the set of [[natural number]]s.
 
Suppose that <math>\mathbb{N}</math> is [[equinumerous]] with its [[power set]] <math>\mathcal{P}(\mathbb{N})</math>. Let us see a sample of what <math>\mathcal{P}(\mathbb{N})</math> looks like:
Through this [[proof by contradiction]] we have proven that the [[cardinality]] of '''''X''''' and '''P'''('''''X''''') can not be equal. We also know that the [[cardinality]] of '''P'''('''''X''''') can not be less than the [[cardinality]] of '''''X''''' because '''P'''('''''X''''') contains all of the [[singleton (mathematics) | singletons]] of '''''X''''', by definition. Therefore, only one possibility remains and that is the [[cardinality]] of '''P'''('''''X''''') is greater than the [[cardinality]] of '''''X''''', and this proves Cantor's theorem.
 
:<math>\mathcal{P}(\mathbb{N})=\{\varnothing,\{1, 2\}, \{1, 2, 3\}, \{4\}, \{1, 5\}, \{3, 4, 6\}, \{2, 4, 6,\dots\},\dots\}.</math>
== History ==
 
Indeed, <math>\mathcal{P}(\mathbb{N})</math> contains infinite subsets of <math>\mathbb{N}</math>, e.g. the set of all positive even numbers <math>\{2, 4, 6,\ldots\}=\{2k:k\in \mathbb{N}\}</math>, along with the [[empty set]] <math>\varnothing</math>.
Cantor gave essentially this proof in a paper published in 1891 ("Ueber eine elementare Frage der Mannigfaltigkeitslehre"), where the [[Cantor's diagonal argument|diagonal argument]] for the uncountability of the [[real number|reals]] also first appears (he had [[Cantor's first uncountability proof|earlier proved the uncountability of the reals by other methods]]). The version of this argument he gave in that paper was phrased in terms of indicator functions on a set rather than subsets of a set. He showed that if ''f'' is a function defined on ''X'' whose values are 2-valued functions on ''X'', then the 2-valued function ''G''(''x'') = 1 &minus; ''f''(''x'')(''x'') is not in the range of ''f''.
 
Now that we have an idea of what the elements of <math>\mathcal{P}(\mathbb{N})</math> are, let us attempt to pair off each [[element (math)|element]] of <math>\mathbb{N}</math> with each element of <math>\mathcal{P}(\mathbb{N})</math> to show that these infinite sets are equinumerous. In other words, we will attempt to pair off each element of <math>\mathbb{N}</math> with an element from the infinite set <math>\mathcal{P}(\mathbb{N})</math>, so that no element from either infinite set remains unpaired. Such an attempt to pair elements would look like this:
Russell has a very similar proof in ''Principles of Mathematics'' (1903, section 348, where he shows that that there are more [[propositional function]]s than objects. "For suppose a correlation of all objects and some propositional functions to have been affected [sic], and let phi-x be the correlate of x. Then "not-phi-x(x)," i.e. "phi-x does not hold of x" is a propositional function not contained in this correlation; for it is true or false of x according as phi-x is false or true of x, and therefore it differs from phi-x for every value of x." He attributes the idea behind the proof to Cantor.
 
:<math>\mathbb{N}\begin{Bmatrix} 1 & \longleftrightarrow & \{4, 5\}\\ 2 & \longleftrightarrow & \{1, 2, 3\} \\ 3 & \longleftrightarrow & \{4, 5, 6\} \\ 4 & \longleftrightarrow & \{1, 3, 5\} \\ \vdots & \vdots & \vdots \end{Bmatrix}\mathcal{P}(\mathbb{N}).</math>
[[Ernst Zermelo]] has a theorem (which he calls "Cantor's Theorem") that is identical to the form above in the paper that became the foundation of modern set theory ("Untersuchungen über die Grundlagen der Mengenlehre I"), published in 1908. See [[Zermelo set theory]].
 
Given such a pairing, some natural numbers are paired with [[subset]]s that contain the very same number. For instance, in our example the number 2 is paired with the subset {1, 2, 3}, which contains 2 as a member. Let us call such numbers ''selfish''. Other natural numbers are paired with [[subset]]s that do not contain them. For instance, in our example the number 1 is paired with the subset {4, 5}, which does not contain the number 1. Call these numbers ''non-selfish''. Likewise, 3 and 4 are non-selfish.
For one consequence of Cantor's theorem, see [[beth number]]s.
 
Using this idea, let us build a special set of natural numbers. This set will provide the [[proof by contradiction|contradiction]] we seek. Let <math>B</math> be the set of ''all'' non-selfish natural numbers. By definition, the [[power set]] <math>\mathcal{P}(\mathbb{N})</math> contains all sets of natural numbers, and so it contains this set <math>B</math> as an element. If the mapping is bijective, <math>B</math> must be paired off with some natural number, say <math>b</math>. However, this causes a problem. If <math>b</math> is in <math>B</math>, then <math>b</math> is selfish because it is in the corresponding set, which contradicts the definition of <math>B</math>. If <math>b</math> is not in <math>B</math>, then it is non-selfish and it should instead be a member of <math>B</math>. Therefore, no such element <math>b</math> which maps to <math>B</math> can exist.
[[Category:Set theory]][[Category:Theorems]]
 
Since there is no natural number which can be paired with <math>B</math>, we have contradicted our original supposition, that there is a [[bijection]] between <math>\mathbb{N}</math> and <math>\mathcal{P}(\mathbb{N})</math>.
 
Note that the set <math>B</math> may be empty. This would mean that every natural number <math>x</math> maps to a subset of natural numbers that contains <math>x</math>. Then, every number maps to a nonempty set and no number maps to the empty set. But the empty set is a member of <math>\mathcal{P}(\mathbb{N})</math>, so the mapping still does not cover <math>\mathcal{P}(\mathbb{N})</math>.
 
Through this [[proof by contradiction]] we have proven that the [[cardinality]] of <math>\mathbb{N}</math> and <math>\mathcal{P}(\mathbb{N})</math> cannot be equal. We also know that the [[cardinality]] of <math>\mathcal{P}(\mathbb{N})</math> cannot be less than the [[cardinality]] of <math>\mathbb{N}</math> because <math>\mathcal{P}(\mathbb{N})</math> contains all [[singleton (mathematics)|singleton]]s, by definition, and these singletons form a "copy" of <math>\mathbb{N}</math> inside of <math>\mathcal{P}(\mathbb{N})</math>. Therefore, only one possibility remains, and that is that the [[cardinality]] of <math>\mathcal{P}(\mathbb{N})</math> is strictly greater than the [[cardinality]] of <math>\mathbb{N}</math>, proving Cantor's theorem.
 
==Related paradoxes==
Cantor's theorem and its proof are closely related to two [[paradoxes of set theory]].
 
[[Cantor's paradox]] is the name given to a contradiction following from Cantor's theorem together with the assumption that there is a set containing all sets, the [[universal set]] <math>V</math>. In order to distinguish this paradox from the next one discussed below, it is important to note what this contradiction is. By Cantor's theorem <math>|\mathcal{P}(X)| > |X|</math> for any set <math>X</math>. On the other hand, all elements of <math>\mathcal{P}(V)</math> are sets, and thus contained in <math>V</math>, therefore <math>|\mathcal{P}(V)| \leq |V|</math>.<ref name="Dasgupta2013"/>
 
Another paradox can be derived from the proof of Cantor's theorem by instantiating the function ''f'' with the [[identity function]]; this turns Cantor's diagonal set into what is sometimes called the ''Russell set'' of a given set ''A'':<ref name="Dasgupta2013"/>
 
:<math>R_A=\left\{\,x\in A : x\not\in x\,\right\}.</math>
 
The proof of Cantor's theorem is straightforwardly adapted to show that assuming a set of all sets ''U'' exists, then considering its Russell set ''R''<sub>''U''</sub> leads to the contradiction:
:<math>R_U \in R_U \iff R_U \notin R_U.</math>
 
This argument is known as [[Russell's paradox]].<ref name="Dasgupta2013"/> As a point of subtlety, the version of Russell's paradox we have presented here is actually a theorem of [[Zermelo]];<ref name="Ebbinghaus2007"/> we can conclude from the contradiction obtained that we must reject the hypothesis that ''R''<sub>''U''</sub>∈''U'', thus disproving the existence of a set containing all sets. This was possible because we have used [[Axiom schema of specification|restricted comprehension]] (as featured in [[ZFC]]) in the definition of ''R''<sub>''A''</sub> above, which in turn entailed that
 
:<math>R_U \in R_U \iff (R_U \in U \wedge R_U \notin R_U). </math>
 
Had we used [[unrestricted comprehension]] (as in [[Frege]]'s system for instance) by defining the Russell set simply as <math>R=\left\{\,x : x\not\in x\,\right\}</math>, then the axiom system itself would have entailed the contradiction, with no further hypotheses needed.<ref name="Ebbinghaus2007">{{cite book|author=Heinz-Dieter Ebbinghaus|title=Ernst Zermelo: An Approach to His Life and Work|url=https://archive.org/details/ernstzermeloappr00ebbi_571|url-access=limited|year=2007|publisher=Springer Science & Business Media|isbn=978-3-540-49553-6|pages=[https://archive.org/details/ernstzermeloappr00ebbi_571/page/n97 86]–87}}</ref>
 
Despite the syntactical similarities between the Russell set (in either variant) and the Cantor diagonal set, [[Alonzo Church]] emphasized that Russell's paradox is independent of considerations of cardinality and its underlying notions like one-to-one correspondence.<ref>Church, A. [1974] "Set theory with a universal set." in ''Proceedings of the Tarski Symposium. Proceedings of Symposia in Pure Mathematics XXV,'' ed. L. Henkin, Providence RI, Second printing with additions 1979, pp. 297−308. {{ISBN|978-0-8218-7360-1}}. Also published in ''International Logic Review'' 15 pp. 11−23.</ref>
 
==History==
Cantor gave essentially this proof in a paper published in 1891 "Über eine elementare Frage der Mannigfaltigkeitslehre",<ref>{{Citation|language=de|first=Georg|last=Cantor|url=http://gdz.sub.uni-goettingen.de/dms/load/img/?PID=GDZPPN002113910&physid=PHYS_0084 <!--http://resolver.sub.uni-goettingen.de/purl?GDZPPN002113910-->|title=Über eine elementare Frage der Mannigfaltigskeitslehre|journal=Jahresbericht der Deutschen Mathematiker-Vereinigung|volume=1|year=1891|pages=75–78}}, also in ''Georg Cantor, Gesammelte Abhandlungen mathematischen und philosophischen Inhalts'', E. Zermelo, 1932.</ref> where the [[Cantor's diagonal argument|diagonal argument]] for the uncountability of the [[real number|reals]] also first appears (he had [[Cantor's first uncountability proof|earlier proved the uncountability of the reals by other methods]]). The version of this argument he gave in that paper was phrased in terms of indicator functions on a set rather than subsets of a set.<ref>A. Kanamori, "[https://math.bu.edu/people/aki/8.pdf The Empty Set, the Singleton, and the Ordered Pair]", p.276. Bulletin of Symbolic Logic vol. 9, no. 3, (2003). Accessed 21 August 2023.</ref> He showed that if ''f'' is a function defined on ''X'' whose values are 2-valued functions on ''X'', then the 2-valued function ''G''(''x'') = 1 &minus; ''f''(''x'')(''x'') is not in the range of ''f''.
 
[[Bertrand Russell]] has a very similar proof in ''[[Principles of Mathematics]]'' (1903, section 348), where he shows that there are more [[propositional function]]s than objects. "For suppose a correlation of all objects and some propositional functions to have been affected, and let phi-''x'' be the correlate of ''x''. Then "not-phi-''x''(''x'')," i.e. "phi-''x'' does not hold of ''x''" is a propositional function not contained in this correlation; for it is true or false of ''x'' according as phi-''x'' is false or true of ''x'', and therefore it differs from phi-''x'' for every value of ''x''." He attributes the idea behind the proof to Cantor.
 
[[Ernst Zermelo]] has a theorem (which he calls "Cantor's Theorem") that is identical to the form above in the paper that became the foundation of modern set theory ("Untersuchungen über die Grundlagen der Mengenlehre I"), published in 1908. See [[Zermelo set theory]].
 
==Generalizations==
[[Lawvere's fixed-point theorem]] provides for a broad generalization of Cantor's theorem to any [[Category (mathematics)|category]] with [[Product (category theory)|finite products]] in the following way:<ref name="LawvereSchanuel2009">{{cite book|author1=F. William Lawvere|author2=Stephen H. Schanuel|title=Conceptual Mathematics: A First Introduction to Categories|year=2009|publisher=Cambridge University Press|isbn=978-0-521-89485-2|at=Session 29|url-access=registration|url=https://archive.org/details/conceptualmathem00lawv}}</ref> let <math>\mathcal{C}</math> be such a category, and let <math>1</math> be a terminal object in <math>\mathcal{C}</math>. Suppose that <math>Y</math> is an object in <math>\mathcal{C}</math> and that there exists an endomorphism <math>\alpha : Y \to Y</math> that does not have any fixed points; that is, there is no morphism <math>y:1 \to Y</math> that satisfies <math>\alpha \circ y = y</math>. Then there is no object <math>T</math> of <math>\mathcal{C}</math> such that a morphism <math>f: T \times T \to Y</math> can parameterize all morphisms <math>T \to Y</math>. In other words, for every object <math>T</math> and every morphism <math>f : T \times T \to Y</math>, an attempt to write maps <math>T \to Y</math> as maps of the form <math>f(-,x) : T \to Y</math> must leave out at least one map <math>T \to Y</math>.
 
==See also==
*[[Schröder–Bernstein theorem]]
*[[Cantor's first uncountability proof]]
*[[Controversy over Cantor's theory]]
 
==References==
{{reflist}}
*[[Paul Halmos|Halmos, Paul]], ''[[Naive Set Theory (book)|Naive Set Theory]]''. Princeton, NJ: D. Van Nostrand Company, 1960. Reprinted by [[Springer-Verlag]], New York, 1974. {{ISBN|0-387-90092-6}} (Springer-Verlag edition). Reprinted by Martino Fine Books, 2011. {{ISBN|978-1-61427-131-4}} (Paperback edition).
*{{Citation|last=Jech|first=Thomas|author-link=Thomas Jech|year=2002|title=Set Theory|edition=3rd millennium|series=Springer Monographs in Mathematics|publisher=Springer|isbn=3-540-44085-2}}
 
==External links==
* {{springer|title=Cantor theorem|id=p/c020260}}
* {{MathWorld|title=Cantor's Theorem|id=CantorsTheorem}}
 
{{Metalogic}}
{{Set theory}}
{{Mathematical logic}}
 
[[Category:1891 introductions]]
[[Category:1891 in science]]
[[Category:Set theory]]
[[Category:Theorems in the foundations of mathematics]]
[[Category:Cardinal numbers]]
[[Category:Georg Cantor]]