Content deleted Content added
Cleaned up using AutoEd |
Polygnotus (talk | contribs) m →Further reading: clean up, replaced: explaind → explained |
||
(43 intermediate revisions by 21 users not shown) | |||
Line 1:
In [[mathematics]] and [[computer science]], the '''method of conditional probabilities'''<ref name='spencer'>{{Citation
In [[mathematics]] and [[computer science]], the [[probabilistic method]] is used to prove the existence of mathematical objects with desired combinatorial properties. The proofs are probabilistic — they work by showing that a random object, chosen from some probability distribution, has the desired properties with positive probability. Consequently, they are [[nonconstructive proof|nonconstructive]] — they don't explicitly describe an efficient method for computing the desired objects.▼
| title=Ten lectures on the probabilistic method▼
| last=Spencer|first=Joel H.|authorlink=Joel Spencer▼
| year=1987▼
| publisher=SIAM▼
| isbn=978-0-89871-325-1}}</ref><ref name='raghavan'>▼
| title= Probabilistic construction of deterministic algorithms: approximating packing integer programs▼
| first = Prabhakar | last = Raghavan | authorlink=Prabhakar Raghavan▼
| journal=[[Journal of Computer and System Sciences]]▼
| volume=37▼
| issue=2▼
| pages=130–143▼
| year = 1988▼
}}
</ref> is a systematic method for converting [[non-constructive]] probabilistic existence proofs into efficient [[Deterministic algorithm|deterministic algorithms]] that explicitly construct the desired object.<ref>[http://algnotes.info/on/background/probabilistic-method/method-of-conditional-probabilities/ The probabilistic method — method of conditional probabilities], blog entry by Neal E. Young, accessed 19/04/2012 and 14/09/2023.</ref>
▲
The
The method is particularly relevant in the context of [[randomized rounding]] (which uses the probabilistic method to design [[approximation algorithm]]s).
When applying the method of conditional probabilities, the technical term '''pessimistic estimator''' refers to a quantity used in place of the true conditional probability (or conditional expectation) underlying the proof.
== Overview ==
: ''We first show the existence of a provably good approximate solution using the [[probabilistic method]]... [We then] show that the probabilistic existence proof can be converted, in a very precise sense, into a deterministic approximation algorithm.''
[[File:Method of conditional probabilities.png|thumb|450px
To apply the method to a probabilistic proof, the randomly chosen object in the proof must be choosable by a random experiment that consists of a sequence of "small" random choices.
Line 23 ⟶ 42:
: ''Probabilistic proof.'' If the three coins are flipped randomly, the expected number of tails is 1.5. Thus, there must be some outcome (way of flipping the coins) so that the number of tails is at least 1.5. Since the number of tails is an integer, in such an outcome there are at least 2 tails. ''QED''
In this example the random experiment consists of flipping three fair coins. The experiment is illustrated by the rooted tree in the adjacent diagram
To apply the method of conditional probabilities, one focuses on the ''conditional probability of failure, given the choices so far'' as the experiment proceeds step by step.
Line 38 ⟶ 57:
== Efficiency ==
In a typical application of the method, the goal is to be able to implement the resulting deterministic process by a reasonably efficient algorithm (
In the ideal case, given a partial state (a node in the tree), the conditional probability of failure (the label on the node) can be efficiently and exactly computed. (The example above is like this.) If this is so, then the algorithm can select the next node to go to by computing the conditional probabilities at each of the children of the current node, then moving to any child whose conditional probability is less than 1. As discussed above, there is guaranteed to be such a node.
Line 44 ⟶ 63:
Unfortunately, in most applications, the conditional probability of failure is not easy to compute efficiently. There are two standard and related techniques for dealing with this:
=== Using a conditional expectation ===
In this case, to keep the conditional probability of failure below 1, it suffices to keep the conditional expectation of ''Q'' below (or above) the threshold. To do this, instead of computing the conditional probability of failure, the algorithm computes the conditional expectation of ''Q'' and proceeds accordingly: at each interior node, there is some child whose conditional expectation is at most (at least) the node's conditional expectation; the algorithm moves from the current node to such a child, thus keeping the conditional expectation below (above) the threshold.
=== Using a pessimistic estimator ===
== Example using conditional expectations ==
Line 56 ⟶ 77:
=== Max-Cut Lemma ===
Given any undirected [[Graph (discrete mathematics)|graph]] ''G'' = (''V'', ''E''), the [[Max cut]] problem is to color each vertex of the graph with one of two colors (say black or white) so as to maximize the number of edges whose endpoints have different colors. (Say such an edge is ''cut''.)
'''Max-Cut Lemma:'''
<blockquote>'''Probabilistic proof.''' Color each vertex black or white by flipping a fair coin. By calculation, for any edge e in ''E'', the probability that it is cut is 1/2. Thus, by [[Expected value#Linearity|linearity of expectation]], the expected number of edges cut is |''E''|/2. Thus, there exists a coloring that cuts at least |''E''|/2 edges. ''QED''</blockquote>▼
▲Color each vertex black or white by flipping a fair coin. By calculation, for any edge e in ''E'', the probability that it is cut is 1/2. Thus, by [[Expected value#Linearity|linearity of expectation]], the expected number of edges cut is |''E''|/2. Thus, there exists a coloring that cuts at least |''E''|/2 edges. ''QED''
=== The method of conditional probabilities with conditional expectations ===
Line 70 ⟶ 89:
Next, replace the random choice at each step by a deterministic choice, so as to keep the conditional probability of failure, given the vertices colored so far, below 1. (Here ''failure'' means that finally fewer than |''E''|/2 edges are cut.)
In this case, the conditional probability of failure is not easy to calculate. Indeed, the original proof did not calculate the probability of failure directly; instead, the proof worked by showing that the expected number of cut edges was at least |''E''|/2.
Let random variable ''Q'' be the number of edges cut. To keep the conditional probability of failure below 1, it suffices to keep the conditional expectation of ''Q'' at or above the threshold |''E''|/2.
Given that some of the vertices are colored already, what is this conditional expectation? Following the logic of the original proof, the conditional expectation of the number of cut edges is
Line 84 ⟶ 103:
1. For each vertex ''u'' in ''V'' (in any order):
2. Consider the already-colored neighboring vertices of ''u''.
3. Among these vertices, if more are black than white, then color ''u'' white.
4. Otherwise, color ''u'' black.
Because of its derivation, this deterministic algorithm is guaranteed to cut at least half the edges of the given graph.
== Example using pessimistic estimators ==
Line 100 ⟶ 119:
: Any graph ''G'' = (''V'', ''E'') contains an [[Independent set (graph theory)|independent set]] of size at least |''V''|/(''D''+1), where ''D'' = 2|''E''|/|''V''| is the average degree of the graph.
=== Probabilistic proof of
Consider the following random process for constructing an independent set ''S'':
1. Initialize ''S'' to be the empty set.
2. For each vertex ''u'' in ''V'' in random order:
3. If no neighbors of ''u'' are in ''S'', add ''u'' to ''S''
4. Return ''S''.
Clearly the process computes an independent set. Any vertex ''u'' that is considered before all of its neighbors will be added to ''S''. Thus, letting ''d''(''u'') denote the degree of ''u'', the probability that ''u'' is added to ''S'' is at least
: <math>\sum_{u\in V} \frac{1}{d(u)+1} ~\ge~ \frac{|V|}{D+1}.</math>
(The inequality above follows because 1/(''x''+1) is [[Convex function|convex]] in ''x'', so the left-hand side is minimized, subject to the sum of the degrees being fixed at 2|''E''|, when each ''d''(''u'') = ''D'' = 2|''E''|/|''V''|.) ''QED''
Line 117 ⟶ 136:
In this case, the random process has |''V''| steps. Each step considers some not-yet considered vertex ''u'' and adds ''u'' to ''S'' if none of its neighbors have yet been added. Let random variable ''Q'' be the number of vertices added to ''S''. The proof shows that ''E''[''Q''] ≥ |''V''|/(''D''+1).
We will replace each random step by a deterministic step that keeps the conditional expectation of ''Q'' at or above |''V''|/(''D''+1). This will ensure a successful outcome, that is, one in which the independent set ''S'' has size at least |''V''|/(''D''+1), realizing the bound in
Given that the first t steps have been taken, let ''S''<
: <math>|S^{(t)}| ~+~ \sum_{w\in R^{(t)}} \frac{1}{d(w)+1}. </math>
Let ''Q''<
The proof showed that the pessimistic estimator is initially at least |''V''|/(''D''+1). (That is, ''Q''<
Let ''u'' be the vertex considered by the algorithm in the next ((''t''+1)-st) step.
If ''u'' already has a neighbor in ''S'', then ''u'' is not added to ''S'' and (by inspection of ''Q''<
By calculation, if ''u'' is chosen randomly from the remaining vertices, the expected increase in the pessimistic estimator is non-negative. ['''The calculation
Thus, there must exist some choice of ''u'' that keeps the pessimistic estimator from decreasing.
Line 139 ⟶ 158:
The algorithm below chooses each vertex ''u'' to maximize the resulting pessimistic estimator. By the previous considerations, this keeps the pessimistic estimator from decreasing and guarantees a successful outcome.
Below, ''N''<
1. Initialize ''S'' to be the empty set.
2. While there exists a not-yet-considered vertex ''u'' with no neighbor in ''S'':
3. Add such a vertex ''u'' to ''S'' where ''u'' minimizes <math>\sum_{w\in N^{(t)}(u)\cup\{u\}} \frac{1}{d(w)+1}</math>.
4. Return ''S''.
Line 151 ⟶ 170:
1. Initialize ''S'' to be the empty set.
2. While there exists a vertex ''u'' in the graph with no neighbor in ''S'':
3. Add such a vertex ''u'' to ''S'', where ''u'' minimizes ''d''(''u'') (the initial degree of ''u'').
4. Return ''S''.
1. Initialize ''S'' to be the empty set.
2. While the remaining graph is not empty:
3. Add a vertex ''u'' to ''S'', where ''u'' has minimum degree in the ''remaining'' graph.
4. Delete ''u'' and all of
5. Return ''S''.
Line 164 ⟶ 183:
: <math>1 - \sum_{w\in N^{(t)}(u)\cup\{u\}} \frac{1}{d(w)+1},</math>
where ''N''<
For the first algorithm, the net increase is non-negative because, by the choice of ''u'',
Line 176 ⟶ 195:
: <math>\sum_{w\in N^{(t)}(u)\cup\{u\}} \frac{1}{d(w)+1} \le (d'(u)+1) \frac{1}{d'(u)+1} = 1 </math>,
where
== See also ==
Line 182 ⟶ 201:
* [[Probabilistic method]]
* [[Derandomization]]
* [[Randomized rounding]]
{{no footnotes|date=June 2012}}
== References ==
{{reflist}}
▲* {{Citation
▲| title=Ten lectures on the probabilistic method
▲| last=Spencer|first=Joel H.|authorlink=Joel Spencer
▲| url=http://books.google.com/books?id=Kz0B0KkwfVAC
▲| year=1987
▲| publisher=SIAM
▲| isbn=978-0-89871-325-1}}
▲| title= Probabilistic construction of deterministic algorithms: approximating packing integer programs
▲| first = Prabhakar | last = Raghavan | authorlink=Prabhakar Raghavan
▲| journal=[[Journal of Computer and System Sciences]]
▲| volume=37
▲| issue=2
▲| pages=130–143
▲| year = 1988
▲| doi = 10.1016/0022-0000(88)90003-7}}.
== Further reading ==
The method of conditional rounding is explained in several textbooks:
* {{Cite book |first1=Noga |last1= Alon |authorlink1=Noga Alon
| first2=Joel |last2=Spencer |authorlink2=Joel Spencer
| series=Wiley-Interscience Series in Discrete Mathematics and Optimization
| title=The probabilistic method
| url=
| year=2008
| edition=
| publisher=John Wiley and Sons
| ___location=Hoboken, NJ
| isbn=978-0-470-17020-5
| pages=250 et seq
| mr=2437651 }} (cited pages in 2nd edition, {{ISBN|9780471653981}})
* {{Cite book
| first1=Rajeev |last1=Motwani |authorlink1=Rajeev Motwani
| first2=Prabhakar |last2=Raghavan |authorlink2=Prabhakar Raghavan
| title=Randomized algorithms
|date=25 August 1995 | url=
| publisher=[[Cambridge University Press]]
| pages=
| isbn=978-0-521-47465-8}}
* {{Citation
| first=Vijay |last=Vazirani
| authorlink=Vijay Vazirani
| title=Approximation algorithms
|date=5 December 2002
| url=
| publisher=[[Springer Verlag]]
| pages=
| isbn=978-3-540-65367-7}}
<!-- |url=
<!-- book references generated by http://reftag.appspot.com -->
== External links ==
* [http://
{{DEFAULTSORT:Method Of Conditional Probabilities}}
[[Category:Approximation algorithms]]
[[Category:Probabilistic arguments]]
|