Probabilistic method: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 11:40, 17 October 2019 edit Vince Vatter (talk \| contribs) Extended confirmed users 704 edits m →Two examples due to Erdős: The fourth edition of the Alon–Spencer textbook on the subject does not have Erdős' picture on the cover to highlight the method's association with him. ← Previous edit		Latest revision as of 01:18, 19 May 2025 edit undo Beland (talk \| contribs) Autopatrolled, Administrators 259,169 edits m custom spacing in math formulas (via WP:JWB) Tag: JWB
(36 intermediate revisions by 29 users not shown)
Line 1: {{short description\|Nonconstructive method for mathematical proofs}} ~~The~~In [[mathematics]], the '''probabilistic method''' is a [[nonconstructive proof\|nonconstructive]] method, primarily used in [[combinatorics]] and pioneered by [[Paul Erdős]], for [[mathematical proof\|proving]] the existence of a prescribed kind of mathematical object. It works by showing that if one randomly chooses objects from a specified class, the [[probability]] that the result is of the prescribed kind is strictly greater than zero. Although the proof uses probability, the final conclusion is determined for ''certain'', without any possible error. This method has now been applied to other areas of [[mathematics]] such as [[number theory]], [[linear algebra]], and [[real analysis]], as well as in [[computer science]] (e.g. [[randomized rounding]]), and [[information theory]]. ==Introduction== If every object in a collection of objects fails to have a certain property, then the probability that a random object chosen from the collection has that property is zero. Thus, by [[contraposition]], if the probability that a random object chosen from the collection has that property is nonzero, then some object in the collection must possess the property. Similarly, showing that the probability is (strictly) less than 1 can be used to prove the existence of an object that does ''not'' satisfy the prescribed properties. Another way to use the probabilistic method is by calculating the [[expected value]] of some [[random variable]]. If it can be shown that the random variable can take on a value less than the expected value, this proves that the random variable can also take on some value greater than the expected value. Alternatively, the probabilistic method can also be used to guarantee the existence of a desired element in a sample space with a value that is greater than or equal to the calculated expected value, since the non-existence of such element would imply every element in the sample space is less than the expected value, a contradiction. Common tools used in the probabilistic method include [[Markov's inequality]], the [[Chernoff bound]], and the [[Lovász local lemma]]. ==Two examples due to Erdős== Although others before him proved ~~theorems~~[[theorem]]s via the probabilistic method (for example, Szele's 1943 result that there exist [[tournament (graph theory)\|tournaments]] containing a large number of [[Hamiltonian cycle]]s), many of the most well known proofs using this method are due to Erdős. The first example below describes one such result from 1947 that gives a proof of a lower bound for the [[Ramsey's theorem\|Ramsey number]] {{math\|''R''(''r'', ''r'')}}. ===First example=== Suppose we have a [[complete graph]] on {{mvar\|n}} [[vertex (graph theory)\|vertices]]. We wish to show (for small enough values of {{mvar\|n}}) that it is possible to color the [[edge (graph theory)\|edges]] of the [[graph (discrete mathematics)\|graph]] in two colors (say red and blue) so that there is no complete [[subgraph (graph theory)\|subgraph]] on {{mvar\|r}} vertices which is monochromatic (every edge colored the same color). To do so, we color the graph randomly. Color each edge independently with probability {{math\|1/2}} of being red and {{math\|1/2}} of being blue. We calculate the expected number of monochromatic subgraphs on {{mvar\|r}} vertices as follows: For any set ~~{{mvar\|S}}~~<math>S_r</math> of ~~{{mvar\|~~<math>r}}</math> vertices from our graph, define the variable {{<math~~\|''~~>X''(~~''S''~~S_r)}}</math> to be {{math\|1}} if every edge amongst the ~~{{mvar\|~~<math>r}}</math> vertices is the same color, and {{math\|0}} otherwise. Note that the number of monochromatic ~~{{mvar\|~~<math>r}}</math>-subgraphs is the sum of {{<math~~\|''~~>X''(~~''S''~~S_r)}}</math> over all possible ~~subsets~~[[subset]]s <math>S_r</math>. For any ~~{{mvar\|S}}~~individual set <math>S_r^i</math>, the [[expected value]] of {{<math~~\|''~~>X''(~~''S''~~S_r^i)}}</math> is simply the probability that all of the <math>C(r, 2)</math> edges in <math>S_r^i</math> are the same color: :<math>E[X(S_r^i)] = 2 \cdot 2^{-{r \choose 2}}</math> (the factor of {{math\|2}} comes because there are two possible colors).▼ ~~edges in {{mvar\|S}} are the same color,~~ This holds true for any of the {{<math~~\|''~~>C''(''n'', ''r'')}}</math> possible subsets we could have chosen, soi.e. <math>i</math> ranges from {{math\|1}} to <math>C(n,r)</math>. So we have that the sum of {{<math~~\|''~~>E''[''X''(~~''S''~~S_r^i)]}}</math> over all ~~{{mvar\|S}}~~<math>S_r^i</math> is▼ :<math>2 \cdot 2^{-{r \choose 2}}</math>▼ ▲:<math>2\sum_{i=1}^{C(n,r)} E[X(S_r^i)] = {n \~~cdot~~choose r}2^{1-{r \choose 2}}.</math> ▲(the factor of {{math\|2}} comes because there are two possible colors). The sum of ~~an expectation~~expectations is the expectation of the sum (''regardless'' of whether the variables are [[statistical independence\|independent]]), so the expectation of the sum (the expected number of all monochromatic ~~{{mvar\|~~<math>r}}</math>-subgraphs) is▼ ▲This holds true for any of the {{math\|''C''(''n'', ''r'')}} possible subsets we could have chosen, so we have that the sum of {{math\|''E''[''X''(''S'')]}} over all {{mvar\|S}} is :<math>E[X(S_r)] = {n \choose r}2^{1-{r \choose 2}}.</math> Consider what happens if this value is less than {{math\|1}}. Since the expected number of monochromatic {{mvar\|r}}-subgraphs is strictly less than {{math\|1}}, itthere ~~must be that~~exists a ~~specific~~coloring ~~random~~satisfying ~~coloring~~the ~~satisfies~~condition that the number of monochromatic {{mvar\|r}}-subgraphs is strictly less than {{math\|1}}. The number of monochromatic {{mvar\|r}}-subgraphs in this random coloring is a non-negative [[integer]], hence it must be {{math\|0}} ({{math\|0}} is the only non-negative integer less than ~~1). It follows that if :<math>~~{~~n \choose r}2^~~{~~1-{r \choose 2}} < 1 ,</~~math~~>, (which holds, for example, for {{mvar~~\|n1}}~~=5 and {{mvar\|r}}=4~~) ~~there must exist a coloring in which there are no monochromatic {{mvar\|r}}-subgraphs~~. It follows ~~<ref>The~~that ~~same fact can be proved without probability, using a simple counting argument:~~if▼ ▲The sum of an expectation is the expectation of the sum (''regardless'' of whether the variables are [[statistical independence\|independent]]), so the expectation of the sum (the expected number of monochromatic {{mvar\|r}}-subgraphs) is :<math>E[X(S_r)] = {n \choose r}2^{1-{r \choose 2}}. < 1</math> (which holds, for example, for {{math\|''n'' {{=}} 5}} and {{math\|''r'' {{=}} 4}}), there must exist a coloring in which there are no monochromatic {{mvar\|r}}-subgraphs.{{efn\| ▲Consider what happens if this value is less than {{math\|1}}. Since the expected number of monochromatic {{mvar\|r}}-subgraphs is strictly less than 1, it must be that a specific random coloring satisfies that the number of monochromatic {{mvar\|r}}-subgraphs is strictly less than 1. The number of monochromatic {{mvar\|r}}-subgraphs in this random coloring is a non-negative integer, hence it must be 0 (0 is the only non-negative integer less than 1). It follows that if :<math>{n \choose r}2^{1-{r \choose 2}} < 1 ,</math>, (which holds, for example, for {{mvar\|n}}=5 and {{mvar\|r}}=4) there must exist a coloring in which there are no monochromatic {{mvar\|r}}-subgraphs. <ref>The same fact can be proved without probability, using a simple counting argument: The same fact can be proved without probability, using a simple counting argument: * The total number of ''r''-subgraphs is <math>{n \choose r}</math>. * Each ''r''-subgraphs has <math>{r \choose 2}</math> edges and thus can be colored in <math>2^{r \choose 2}</math> different ways. * Of these colorings, only 2 colorings are 'bad' for that subgraph (the colorings in which all vertices are red or all vertices are blue). * Hence, the total number of colorings that are bad for ~~''all''~~some ~~subgraphs~~(at least one) subgraph is at most <math>2 {n \choose r} 2^{{n \choose 2} - {r \choose 2}}</math>. * Hence, if <math>2 {n \choose r} 2^{{n \choose 2} - {r \choose 2}} >< 2^{n \choose 2} \Leftrightarrow {n \choose r}2^{1-{r \choose 2}} < 1</math>, there must be at least one coloring which is not 'bad' for any subgraph. }} ~~</ref>~~ By definition of the [[Ramsey number]], this implies that {{math\|''R''(''r'', ''r'')}} must be bigger than {{mvar\|n}}. In particular, {{math\|''R''(''r'', ''r'')}} must grow at least [[exponential growth\|~~grow at least~~ exponentially]] with {{mvar\|r}}. A ~~peculiarity~~weakness of this argument is that it is entirely [[nonconstructive proof\|nonconstructive]]. Even though it proves (for example) that almost every coloring of the complete graph on {{math\|(1.1)<sup>''r''</sup>}} vertices contains no monochromatic {{mvar\|r}}-subgraph, it gives no explicit example of such a coloring. The problem of finding such a coloring has been [[open problem\|open]] for more than 50 years. ===Second example=== A 1959 paper of Erdős (see reference cited below) addressed the following problem in [[graph theory]]: given positive integers {{mvar\|g}} and {{mvar\|k}}, does there exist a graph {{mvar\|G}} containing only [[cycle (graph theory)\|cycles]] of length at least {{mvar\|g}}, such that the [[chromatic number]] of {{mvar\|G}} is at least {{mvar\|k}}? It can be shown that such a graph exists for any {{mvar\|g}} and {{mvar\|k}}, and the proof is reasonably simple. Let {{mvar\|n}} be very large and consider a random graph {{mvar\|G}} on {{mvar\|n}} vertices, where every edge in {{mvar\|G}} exists with probability {{math\|''p'' {{=}} ''n''<sup>1/''g''−1</sup>}}. We show that with positive probability, ~~a graph~~{{mvar\|G}} satisfies the following two properties: :'''Property 1.''' {{mvar\|G}} contains at most {{math\|''n''/2}} cycles of length less than {{mvar\|g}}. '''Proof.''' Let {{mvar\|X}} be the number cycles of length less than {{mvar\|g}}. ~~Number~~The number of cycles of length {{mvar\|i}} in the complete graph on {{mvar\|n}} vertices is :<math>\frac{n!}{2\cdot i \cdot (n-i)!} \le \frac{n^i}{2}</math> Line 74 ⟶ 78: when :<math>y = \left \lceil \frac{n}{2k} \right \rceil\!.</math> Thus, for sufficiently large {{mvar\|n}}, property 2 holds with a probability of more than {{math\|1/2}}. For sufficiently large {{mvar\|n}}, the probability that a graph from the distribution has both properties is positive, as the events for these properties cannot be disjoint (if they were, their probabilities would sum up to more than 1). Here comes the trick: since {{mvar\|G}} has these two properties, we can remove at most {{math\|''n''/2}} vertices from {{mvar\|G}} to obtain a new graph {{math\|''G′''}} on <math>n'\geq n/2</math> vertices that contains only cycles of length at least {{mvar\|g}}. We can see that this new graph has no independent set of size <math>\left \lceil \frac{n'}{k} \right\rceil</math>. {{math\|''G′''}} can only be partitioned into at least {{mvar\|k}} independent sets, and, hence, has chromatic number at least {{mvar\|k}}. This result gives a hint as to why the computation of the ~~[[Graph coloring\|~~chromatic number]] of a graph is so difficult: even when there are no local reasons (such as small cycles) for a graph to require many colors the chromatic number can still be arbitrarily large. ==See also== {{Portal\|Mathematics}} [[Interactive proof system]] [[Las Vegas algorithm]] [[Incompressibility method]] [[Method of conditional probabilities]] [[Probabilistic proofs of non-probabilistic theorems]] [[Random graph]] == Additional resources == * [https://ocw.mit.edu/courses/18-226-probabilistic-methods-in-combinatorics-fall-2022/ Probabilistic Methods in Combinatorics], MIT OpenCourseWare ==References== * Alon, Noga; Spencer, Joel H. (2000). ''The probabilistic method'' (2ed). New York: Wiley-Interscience. {{isbn\|0-471-37046-0}}. * {{cite journal \|doi=10.4153/CJM-1959-003-9 \|author=Erdős, P. \|year=1959 \|title=Graph theory and probability \|journal=Can. J. Math. \|volume=11 ~~\|issue=~~ \|pages=34–38 \|mr=0102081 \|~~url~~s2cid=~~http://www.math~~122784453 \|doi-~~inst.hu/~p_erdos/1959-06.pdf~~access=free }} * {{cite journal \|doi=10.4153/CJM-1961-029-9 \|author=Erdős, P. \|year=1961 \|title=Graph theory and probability, II \|journal=Can. J. Math. \|volume=13 ~~\|issue=~~ \|pages=346–352 \|mr=0120168 \|url=~~http~~https://www.~~math-inst~~cambridge.huorg/~~~p_erdos~~core/~~1961~~journals/canadian-~~06.pdf~~journal-of-mathematics/article/graph-theory-and-probability-ii/38F46DC839201178C2EEC2B14B1647BC\|citeseerx=10.1.1.210.6669 \|s2cid=15134755 }} * [[Jiří Matoušek (mathematician)\|J. Matoušek]], J. Vondrak. [https://web.archive.org/web/20120205002452/http://kam.mff.cuni.cz/~matousek/prob-ln-2pp.ps.gz The Probabilistic Method]. Lecture notes. * Alon, N and Krivelevich, M (2006). [http://www.math.tau.ac.il/~nogaa/PDFS/epc7.pdf Extremal and Probabilistic Combinatorics] * Elishakoff I., Probabilistic Methods in the Theory of Structures: Random Strength of Materials, Random Vibration, and Buckling, World Scientific, Singapore, {{ISBN\|978-981-3149-84-7}}, 2017 * Elishakoff I., Lin Y.K. and Zhu L.P., Probabilistic and Convex Modeling of Acoustically Excited Structures, Elsevier Science Publishers, Amsterdam, 1994, VIII + pp. 296; {{ISBN\|0 444 81624 0}} {{Reflist}} ==Footnotes== ~~<references/>~~ {{notelist}} {{Authority control}} [[Category:Combinatorics]] [[Category:Mathematical proofs]] [[Category:Probabilistic arguments\|method]]