Average-case complexity: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 00:41, 13 November 2008 edit Altenmann (talk \| contribs) Autopatrolled, Extended confirmed users, Pending changes reviewers 245,238 edits rm unnecessary and improper categories ← Previous edit		Latest revision as of 07:18, 21 July 2025 edit undo Citation bot (talk \| contribs) Bots 5,870,669 edits Removed URL that duplicated identifier. \| Use this bot. Report bugs. \| Suggested by Abductive \| Category:Analysis of algorithms \| #UCB_Category 22/47
(75 intermediate revisions by 50 users not shown)
Line 1: {{redirect\|AvgP\|other uses\|avgp (disambiguation)}} ~~{{main\|Probabilistic analysis of algorithms}}~~ In [[computational complexity theory]], the '''average-case complexity''' of an [[algorithm]] is the amount of some computational resource (typically time) used by the algorithm, averaged over all possible inputs. It is frequently contrasted with [[worst-case complexity]] which considers the maximal complexity of the algorithm over all possible inputs. There are three primary motivations for studying average-case complexity.<ref name="gol07">{{Cite journal \|last1=Goldreich \|first1=Oded \|last2=Vadhan \|first2=Salil \|date=December 2007 \|title=Special Issue On Worst-case Versus Average-case Complexity Editors' Foreword \|journal=Computational Complexity \|language=en \|volume=16 \|issue=4 \|pages=325–330 \|doi=10.1007/s00037-007-0232-y \|issn=1016-3328\|doi-access=free }}</ref> First, although some problems may be intractable in the worst-case, the inputs which elicit this behavior may rarely occur in practice, so the average-case complexity may be a more accurate measure of an algorithm's performance. Second, average-case complexity analysis provides tools and techniques to generate hard instances of problems which can be utilized in areas such as [[cryptography]] and [[derandomization]]. Third, average-case complexity allows discriminating the most efficient algorithm in practice among algorithms of equivalent best case complexity (for instance [[Quicksort#Formal analysis\|Quicksort]]). ~~For [[deterministic algorithm]]s, the '''average-case complexity''' ('''expected time complexity'''), associates a given input distribution with the [[expected value\|expected]] time of an algorithm.~~ Average-case analysis requires a notion of an "average" input to an algorithm, which leads to the problem of devising a [[probability distribution]] over inputs. Alternatively, a [[randomized algorithm]] can be used. The analysis of such algorithms leads to the related notion of an '''expected complexity'''.<ref name="clrs"/>{{rp\|28}} [[Leonid Levin]] presented the motivation for studying average-case complexity as follows:<ref>[http://www.cs.bu.edu/fac/lnd/research/hard.htm "Intractability Concepts for Concrete Problems", [[Leonid Levin]]]</ref>: :"Many combinatorial problems (called search or NP problems) have easy methods of checking solutions for correctness. Examples: finding factors of a long integer, or proofs of math theorems or short fast programs generating a given string. Such problems can be stated as a task to invert a given, easy to compute, function (multiplication or extraction of a theorem from its proof). In 1971 I noticed that many such problems can be proven to be as hard as the Tiling problem (which, I knew for a while, was universal, i.e. at least as hard as any search problem)... :"A common misinterpretation of these results was that all NP-complete problems are hard, no chance for good algorithms. On this basis some such problems generated much hope in cryptography: the adversary would be helpless. Karp and others noticed that this was naive. While worst instances of NP-complete problems defeat our algorithms, such instances may be extremely rare. In fact, fast on average algorithms were found for a great many NP-complete problems. If all NP problems are easy on average, the P=?NP question becomes quite academic. Even if exponentially hard instances exist, those we could ever find might all be easy. Some problems (like factoring) seem hard for typical instances, but nothing is proven at all to support this (crucial, e.g., for cryptography) belief. These issues turned out to be subtle and it was not clear how a theory could distinguish intrinsically hard on average problems. [Levin 86], [Venkatesan, Levin STOC-88], [Impagliazzo, Levin, FOCS-90] proposed such a theory with first average case intractability results. Random (under uniform distribution) instances of some problems are now known to be as hard as random instances of any NP-problem under any samplable distribution." ==History and background== ~~==Literature==~~ The average-case performance of algorithms has been studied since modern notions of computational efficiency were developed in the 1950s. Much of this initial work focused on problems for which worst-case polynomial time algorithms were already known.<ref name="bog06">{{Cite journal \|last1=Bogdanov \|first1=Andrej \|last2=Trevisan \|first2=Luca \|date=2006 \|title=Average-Case Complexity \|url=http://www.nowpublishers.com/article/Details/TCS-004 \|journal=Foundations and Trends in Theoretical Computer Science \|language=en \|volume=2 \|issue=1 \|pages=1–106 \|doi=10.1561/0400000004 \|issn=1551-305X\|url-access=subscription }}</ref> In 1973, [[Donald Knuth]]<ref name="knu73">{{cite book ~~The literature of average case complexity includes the following work:~~ \| last = Knuth \| first = Donald \| title = [[The Art of Computer Programming]] \| volume = 3 \| publisher = Addison-Wesley \| date = 1973 * John Franco. On the probabilistic performance of algorithms for the satisfiability problem. Information Processing Letters, 3:103–106, 1986. }}</ref> published Volume 3 of the [[Art of Computer Programming]] which extensively surveys average-case performance of algorithms for problems solvable in worst-case polynomial time, such as sorting and median-finding. * [[Leonid Levin]]. Average case complete problems. SIAM J. Comput., 15:285–286, 1986. * [[Philippe Flajolet]] and [[Jeffrey Vitter\|J.S. Vitter]]. Average-case analysis of algorithms and data structures. Technical report, Institut National de Recherche en Informatique et en Automatique, B.P. 105-78153 Le Chesnay Cedex France, August 1987. * [[Yuri Gurevich]] and Saharon Shelah. Expected computation time for [[Hamiltonian path problem]]. SIAM Journal on Computing, volume 16, issue 3, June 1897, pages 486-502. ISSN: 0097-5397. * Shai Ben-David, Benny Chor, [[Oded Goldreich]], and [[Michael Luby]]. On the theory of average case complexity. In Proceedings of the Twenty First Annual ACM Symposium on Theory of Computing, pages 204–216. ACM, 1989. * [[Yuri Gurevich]]. Average case completeness. Journal of Computer and`System Sciences, 42:346–398, 1991. See also [http://research.microsoft.com/~gurevich/Opera/76.pdf 1989 draft]. * B. Selman D. Mitchell and H. Levesque. Hard and easy distributions of SAT problems. In Proceedings of the 10th National Conference on Artificial Intelligence, pages 459–465, 1992. * Rainer Schuler and Tomoyuki Yamakami. Structural average case complexity. In Foundations of Software Technology and Theoretical Computer Science, pages 128–139. Springer-Verlag Lecture Notes in Computer Science #652, 1992. * Rüdiger Reischuk and Christian Schindelhauer. Precise average case complexity. In 10th Annual Symposium on Theoretical Aspects of Computer Science, pages 650–661, 1993. * R. Venkatesan and S. Rajagopalan. Average case intractability of matrix and Diophantine problems. In 24th Annual ACM STOC, pages 632–642, May 1992. * [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.36.3850 Jim Cox, Lars Ericson, Bud Mishra. "The average case complexity of multilevel syllogistic", New York University, 1995] * [[Russell Impagliazzo]], [http://www-cse.ucsd.edu/~russell/average.ps "A personal view of average-case complexity", UC San Diego, April 17, 1995] An efficient algorithm for [[NP-complete\|{{math\|'''NP'''}}-complete]] problems is generally characterized as one which runs in polynomial time for all inputs; this is equivalent to requiring efficient worst-case complexity. However, an algorithm which is inefficient on a "small" number of inputs may still be efficient for "most" inputs that occur in practice. Thus, it is desirable to study the properties of these algorithms where the average-case complexity may differ from the worst-case complexity and find methods to relate the two. ~~==See also==~~ The fundamental notions of average-case complexity were developed by [[Leonid Levin]] in 1986 when he published a one-page paper<ref name="levin86">{{Cite journal \|last=Levin \|first=Leonid A. \|date=February 1986 \|title=Average Case Complete Problems \|url=http://epubs.siam.org/doi/10.1137/0215020 \|journal=SIAM Journal on Computing \|language=en \|volume=15 \|issue=1 \|pages=285–286 \|doi=10.1137/0215020 \|issn=0097-5397\|url-access=subscription }}</ref> defining average-case complexity and completeness while giving an example of a complete problem for {{math\|'''distNP'''}}, the average-case analogue of [[NP (complexity)\|{{math\|'''NP'''}}]]. ==Definitions== ===Efficient average-case complexity=== The first task is to precisely define what is meant by an algorithm which is efficient "on average". An initial attempt might define an efficient average-case algorithm as one which runs in expected polynomial time over all possible inputs. Such a definition has various shortcomings; in particular, it is not robust to changes in the computational model. For example, suppose algorithm {{mvar\|A}} runs in time {{math\|''t''<sub>''A''</sub>(''x'')}} on input {{mvar\|x}} and algorithm {{mvar\|B}} runs in time {{math\|''t''<sub>''A''</sub>(''x'')<sup>2</sup>}} on input {{mvar\|x}}; that is, {{mvar\|B}} is quadratically slower than {{mvar\|A}}. Intuitively, any definition of average-case efficiency should capture the idea that {{mvar\|A}} is efficient-on-average if and only if {{mvar\|B}} is efficient on-average. Suppose, however, that the inputs are drawn randomly from the uniform distribution of strings with length {{mvar\|n}}, and that {{mvar\|A}} runs in time {{math\|''n''<sup>2</sup>}} on all inputs except the string {{math\|1<sup>''n''</sup>}} for which {{mvar\|A}} takes time {{math\|2<sup>''n''</sup>}}. Then it can be easily checked that the expected running time of {{mvar\|A}} is polynomial but the expected running time of {{mvar\|B}} is exponential.<ref name="bog06" /> To create a more robust definition of average-case efficiency, it makes sense to allow an algorithm {{mvar\|A}} to run longer than polynomial time on some inputs but the fraction of inputs on which {{mvar\|A}} requires larger and larger running time becomes smaller and smaller. This intuition is captured in the following formula for average polynomial running time, which balances the polynomial trade-off between running time and fraction of inputs: :<math> \Pr_{x \in_R D_n} \left[t_A(x) \geq t \right] \leq \frac{p(n)}{t^\epsilon} </math> for every {{math\|''n'', ''t'' > 0}} and polynomial {{mvar\|p}}, where {{math\|''t''<sub>''A''</sub>(''x'')}} denotes the running time of algorithm {{mvar\|A}} on input {{mvar\|x}}, and {{mvar\|ε}} is a positive constant value.<ref name="wangsurvey">{{cite book \|last1=Wang \|first1=Jie \|url=https://www.cs.uml.edu/~wang/acc-forum/avgcomp.pdf \|title=Complexity Theory: Retrospective II \|date=1997 \|publisher=Springer Science & Business Media \|editor-last1=Hemaspaandra \|editor-first1=Lane A. \|volume=2 \|pages=295–328 \|chapter=Average-case computational complexity theory \|editor-last2=Selman \|editor-first2=Alan L.}}</ref> Alternatively, this can be written as :<math> E_{x \in_R D_n} \left[ \frac{t_{A}(x)^{\epsilon}}{n} \right] \leq C </math> for some constants {{mvar\|C}} and {{mvar\|ε}}, where {{math\|''n'' {{=}} {{abs\|''x''}}}}.<ref name="ab09">{{cite book \|last1=Arora \|first1=Sanjeev \|title=Computational Complexity: A Modern Approach \|last2=Barak \|first2=Boaz \|date=2009 \|publisher=Cambridge University Press \|___location=Cambridge; New York \|chapter=18. Average case complexity: Levin’s theory}}</ref> In other words, an algorithm {{mvar\|A}} has good average-case complexity if, after running for {{math\|''t''<sub>''A''</sub>(''n'')}} steps, {{mvar\|A}} can solve all but a {{math\|{{sfrac\|''n''<sup>''c''</sup>\|(''t''<sub>''A''</sub>(''n''))<sup>''ε''</sup>}}}} fraction of inputs of length {{mvar\|n}}, for some {{math\|''ε'', ''c'' > 0}}.<ref name="bog06"/> ===Distributional problem=== The next step is to define the "average" input to a particular problem. This is achieved by associating the inputs of each problem with a particular probability distribution. That is, an "average-case" problem consists of a language {{mvar\|L}} and an associated probability distribution {{mvar\|D}} which forms the pair {{math\|(''L'', ''D'')}}.<ref name="ab09"/> The two most common classes of distributions which are allowed are: #Polynomial-time computable distributions ({{math\|'''P'''}}-computable): these are distributions for which it is possible to compute the cumulative density of any given input {{mvar\|x}}. More formally, given a probability distribution {{mvar\|μ}} and a string {{math\|''x'' ∈ {{mset\|0, 1}}<sup>''n''</sup>}} it is possible to compute the value <math>\mu(x) = \sum\limits_{y \in \{0, 1\}^n : y \leq x} \Pr[y]</math> in polynomial time. This implies that {{math\|Pr[''x'']}} is also computable in polynomial time. #Polynomial-time samplable distributions ({{math\|'''P'''}}-samplable): these are distributions from which it is possible to draw random samples in polynomial time. These two formulations, while similar, are not equivalent. If a distribution is {{math\|'''P'''}}-computable it is also {{math\|'''P'''}}-samplable, but the converse is not true if [[P (complexity)\|{{math\|'''P'''}}]] ≠ {{math\|'''P'''<sup>'''#P'''</sup>}}.<ref name="ab09"/> ===AvgP and distNP=== A distributional problem {{math\|(''L'', ''D'')}} is in the complexity class {{math\|'''AvgP'''}} if there is an efficient average-case algorithm for {{mvar\|L}}, as defined above. The class {{math\|'''AvgP'''}} is occasionally called {{math\|'''distP'''}} in the literature.<ref name="ab09"/> A distributional problem {{math\|(''L'', ''D'')}} is in the complexity class {{math\|'''distNP'''}} if {{mvar\|L}} is in {{math\|'''NP'''}} and {{mvar\|D}} is {{math\|'''P'''}}-computable. When {{mvar\|L}} is in {{math\|'''NP'''}} and {{mvar\|D}} is {{math\|'''P'''}}-samplable, {{math\|(''L'', ''D'')}} belongs to {{math\|'''sampNP'''}}.<ref name="ab09"/> Together, {{math\|'''AvgP'''}} and {{math\|'''distNP'''}} define the average-case analogues of {{math\|'''P'''}} and {{math\|'''NP'''}}, respectively.<ref name="ab09"/> ==Reductions between distributional problems== Let {{math\|(''L'',''D'')}} and {{math\|(''L′'', ''D′'')}} be two distributional problems. {{math\|(''L'', ''D'')}} average-case reduces to {{math\|(''L′'', ''D′'')}} (written {{math\|(''L'', ''D'') ≤<sub>'''AvgP'''</sub> (''L′'', ''D′'')}}) if there is a function {{mvar\|f}} that for every {{mvar\|n}}, on input {{mvar\|x}} can be computed in time polynomial in {{mvar\|n}} and #(Correctness) {{math\|''x'' ∈ ''L''}} if and only if {{math\|''f''(''x'') ∈ ''L′''}} #(Domination) There are polynomials {{mvar\|p}} and {{mvar\|m}} such that, for every {{mvar\|n}} and {{mvar\|y}}, <math>\sum\limits_{x: f(x) = y} D_n(x) \leq p(n)D'_{m(n)}(y)</math> The domination condition enforces the notion that if problem {{math\|(''L'', ''D'')}} is hard on average, then {{math\|(''L′'', ''D′'')}} is also hard on average. Intuitively, a reduction should provide a way to solve an instance {{mvar\|x}} of problem {{mvar\|L}} by computing {{math\|''f''(''x'')}} and feeding the output to the algorithm which solves {{mvar\|L'}}. Without the domination condition, this may not be possible since the algorithm which solves {{mvar\|L}} in polynomial time on average may take super-polynomial time on a small number of inputs but {{mvar\|f}} may map these inputs into a much larger set of {{mvar\|D'}} so that algorithm {{mvar\|A'}} no longer runs in polynomial time on average. The domination condition only allows such strings to occur polynomially as often in {{mvar\|D'}}.<ref name="wangsurvey"/> ===DistNP-complete problems=== The average-case analogue to {{math\|'''NP'''}}-completeness is {{math\|'''distNP'''}}-completeness. A distributional problem {{math\|(''L′'', ''D′'')}} is {{math\|'''distNP'''}}-complete if {{math\|(''L′'', ''D′'')}} is in {{math\|'''distNP'''}} and for every {{math\|(''L'', ''D'')}} in {{math\|'''distNP'''}}, {{math\|(''L'', ''D'')}} is average-case reducible to {{math\|(''L′'', ''D′'')}}.<ref name="ab09" /> An example of a {{math\|'''distNP'''}}-complete problem is the Bounded Halting Problem, {{math\|({{mvar\|BH}},''D'')}} (for any {{math\|'''P'''}}-computable ''D'') defined as follows: <math>BH = \{(M, x, 1^t) : M \text{ is a non-deterministic Turing machine that accepts } x \text{ in} \leq t \text{ steps}\}</math><ref name="ab09"/> In his original paper, Levin showed an example of a distributional tiling problem that is average-case {{math\|'''NP'''}}-complete.<ref name="levin86"/> A survey of known {{math\|'''distNP'''}}-complete problems is available online.<ref name="wangsurvey"/> One area of active research involves finding new {{math\|'''distNP'''}}-complete problems. However, finding such problems can be complicated due to a result of Gurevich which shows that any distributional problem with a flat distribution cannot be {{math\|'''distNP'''}}-complete unless [[EXP\|{{math\|'''EXP'''}}]] = [[NEXP\|{{math\|'''NEXP'''}}]].<ref name="gur87">{{Cite book \|last=Gurevich \|first=Yuri \|chapter=Complete and incomplete randomized NP problems \|date=October 1987 \|title=28th Annual Symposium on Foundations of Computer Science (SFCS 1987) \|pages=111–117 \|doi=10.1109/SFCS.1987.14\|isbn=0-8186-0807-2 }}</ref> (A flat distribution {{mvar\|μ}} is one for which there exists an {{math\|''ε'' > 0}} such that for any {{mvar\|x}}, {{math\|''μ''(''x'') ≤ 2<sup>−{{abs\|''x''}}<sup>''ε''</sup></sup>}}.) A result by Livne shows that all natural {{math\|'''NP'''}}-complete problems have {{math\|'''DistNP'''}}-complete versions.<ref name="livne06">{{Cite journal \|last=Livne \|first=Noam \|date=December 2010 \|title=All Natural NP-Complete Problems Have Average-Case Complete Versions \|url=http://link.springer.com/10.1007/s00037-010-0298-9 \|journal=Computational Complexity \|language=en \|volume=19 \|issue=4 \|pages=477–499 \|doi=10.1007/s00037-010-0298-9 \|issn=1016-3328\|url-access=subscription }}</ref> However, the goal of finding a natural distributional problem that is {{math\|'''DistNP'''}}-complete has not yet been achieved.<ref name="gol97">{{Citation \|last=Goldreich \|first=Oded \|title=Notes on Levin's Theory of Average-Case Complexity \|date=2011 \|work=Studies in Complexity and Cryptography. Miscellanea on the Interplay between Randomness and Computation \|series=Lecture Notes in Computer Science \|volume=6650 \|pages=233–247 \|editor-last=Goldreich \|editor-first=Oded \|url=http://link.springer.com/10.1007/978-3-642-22670-0_21 \|access-date=2025-05-21 \|place=Berlin, Heidelberg \|publisher=Springer Berlin Heidelberg \|doi=10.1007/978-3-642-22670-0_21 \|isbn=978-3-642-22669-4\|url-access=subscription }}</ref> ==Applications== ===Sorting algorithms=== As mentioned above, much early work relating to average-case complexity focused on problems for which polynomial-time algorithms already existed, such as sorting. For example, many sorting algorithms which utilize randomness, such as [[Quicksort]], have a worst-case running time of {{math\|O(''n''<sup>2</sup>)}}, but an average-case running time of {{math\|O(''n'' log(''n''))}}, where {{mvar\|n}} is the length of the input to be sorted.<ref name="clrs">{{cite book \| last1 = Cormen \| first1 = Thomas H. \| last2 = Leiserson \| first2 = Charles E. \| last3 = Rivest \| first3 = Ronald L. \| last4 = Stein \| first4 = Clifford \| title = Introduction to Algorithms \| edition = 3rd \| date = 2009 \| orig-year = 1990 \| publisher = MIT Press and McGraw-Hill \| isbn=978-0-262-03384-8 \|url=https://www.worldcat.org/title/311310321 \|oclc=311310321}}</ref> ===Cryptography=== For most problems, average-case complexity analysis is undertaken to find efficient algorithms for a problem that is considered difficult in the worst-case. In cryptographic applications, however, the opposite is true: the worst-case complexity is irrelevant; we instead want a guarantee that the average-case complexity of every algorithm which "breaks" the cryptographic scheme is inefficient.<ref name="katz07">{{Cite book \|last1=Katz \|first1=Jonathan \|title=Introduction to modern cryptography \|last2=Lindell \|first2=Yehuda \|date=2021 \|publisher=CRC Press \|isbn=978-1-351-13303-6 \|edition=3 \|series=Chapman & Hall/CRC cryptography and network security series \|___location=Boca Raton, FL}}</ref>{{pn\|date=May 2025}} Thus, all secure cryptographic schemes rely on the existence of [[one-way functions]].<ref name="bog06"/> Although the existence of one-way functions is still an open problem, many candidate one-way functions are based on hard problems such as [[integer factorization]] or computing the [[discrete log]]. Note that it is not desirable for the candidate function to be {{math\|'''NP'''}}-complete since this would only guarantee that there is likely no efficient algorithm for solving the problem in the worst case; what we actually want is a guarantee that no efficient algorithm can solve the problem over random inputs (i.e. the average case). In fact, both the integer factorization and discrete log problems are in {{math\|'''NP''' ∩ }}[[coNP\|{{math\|'''coNP'''}}]], and are therefore not believed to be {{math\|'''NP'''}}-complete.<ref name="ab09"/> The fact that all of cryptography is predicated on the existence of average-case intractable problems in {{math\|'''NP'''}} is one of the primary motivations for studying average-case complexity. ==Other results== [[Yao's principle]], from a 1978 paper by [[Andrew Yao]], shows that for broad classes of computational problems, average-case complexity for a hard input distribution and a deterministic algorithm adapted to that distribution is the same thing as expected complexity for a fast randomized algorithm and its worst-case input.<ref>{{citation \| last = Yao \| first = Andrew \| author-link = Andrew Yao \| contribution = Probabilistic computations: Toward a unified measure of complexity \| doi = 10.1109/SFCS.1977.24 \| pages = 222–227 \| title = Proceedings of the 18th IEEE Symposium on Foundations of Computer Science (FOCS) \| year = 1977}}</ref> In 1990, Impagliazzo and Levin showed that if there is an efficient average-case algorithm for a {{math\|'''distNP'''}}-complete problem under the uniform distribution, then there is an average-case algorithm for every problem in {{math\|'''NP'''}} under any polynomial-time samplable distribution.<ref name="imp90">R. Impagliazzo and L. Levin, "No Better Ways to Generate Hard NP Instances than Picking Uniformly at Random," in Proceedings of the 31st IEEE Sympo- sium on Foundations of Computer Science, pp. 812–821, 1990.</ref> Applying this theory to natural distributional problems remains an outstanding open question.<ref name="bog06"/> In 1992, Ben-David et al. showed that if all languages in {{math\|'''distNP'''}} have good-on-average decision algorithms, they also have good-on-average search algorithms. Further, they show that this conclusion holds under a weaker assumption: if every language in {{math\|'''NP'''}} is easy on average for decision algorithms with respect to the uniform distribution, then it is also easy on average for search algorithms with respect to the uniform distribution.<ref name="bd92">{{Cite book \|last1=Ben-David \|first1=S. \|last2=Chor \|first2=B. \|last3=Goldreich \|first3=O. \|chapter=On the theory of average case complexity \|date=1989 \|title=Proceedings of the twenty-first annual ACM symposium on Theory of computing - STOC '89 \|chapter-url=http://portal.acm.org/citation.cfm?doid=73007.73027 \|language=en \|publisher=ACM Press \|pages=204–216 \|doi=10.1145/73007.73027 \|isbn=978-0-89791-307-2}}</ref> Thus, cryptographic one-way functions can exist only if there are {{math\|'''distNP'''}} problems over the uniform distribution that are hard on average for decision algorithms. In 1993, Feigenbaum and Fortnow showed that it is not possible to prove, under non-adaptive random reductions, that the existence of a good-on-average algorithm for a {{math\|'''distNP'''}}-complete problem under the uniform distribution implies the existence of worst-case efficient algorithms for all problems in {{math\|'''NP'''}}.<ref name="ff93">{{Cite journal \|last1=Feigenbaum \|first1=Joan \|last2=Fortnow \|first2=Lance \|date=October 1993 \|title=Random-Self-Reducibility of Complete Sets \|url=http://epubs.siam.org/doi/10.1137/0222061 \|journal=SIAM Journal on Computing \|language=en \|volume=22 \|issue=5 \|pages=994–1005 \|doi=10.1137/0222061 \|issn=0097-5397\|url-access=subscription }}</ref> In 2003, Bogdanov and Trevisan generalized this result to arbitrary non-adaptive reductions.<ref name="bog03">{{Cite journal \|last1=Bogdanov \|first1=Andrej \|last2=Trevisan \|first2=Luca \|date=January 2006 \|title=On Worst-Case to Average-Case Reductions for NP Problems \|url=https://epubs.siam.org/doi/10.1137/S0097539705446974 \|journal=SIAM Journal on Computing \|language=en \|volume=36 \|issue=4 \|pages=1119–1159 \|doi=10.1137/S0097539705446974 \|issn=0097-5397\|url-access=subscription }}</ref> These results show that it is unlikely that any association can be made between average-case complexity and worst-case complexity via reductions.<ref name="bog06"/> ==See also== * [[Probabilistic analysis of algorithms]] [[NP-complete problems]] [[Worst-case complexity]] [[Amortized analysis]] [[Best, worst and average case]] ==References== ~~[[Category:Computational complexity theory]]~~ {{Reflist\|30em}} ==Further reading== Pedagogical presentations: * {{Cite book \|last=Impagliazzo \|first=R. \|chapter=A personal view of average-case complexity \|date=1995 \|title=Proceedings of Structure in Complexity Theory. Tenth Annual IEEE Conference \|publisher=IEEE Comput. Soc. Press \|pages=134–147 \|doi=10.1109/SCT.1995.514853 \|isbn=978-0-8186-7052-7}} * {{cite book \|last1=Wang \|first1=Jie \|url=https://www.cs.uml.edu/~wang/acc-forum/avgcomp.pdf \|title=Complexity Theory: Retrospective II \|date=1997 \|publisher=Springer Science & Business Media \|editor-last1=Hemaspaandra \|editor-first1=Lane A. \|volume=2 \|pages=295–328 \|chapter=Average-case computational complexity theory \|editor-last2=Selman \|editor-first2=Alan L.}} * {{Citation \|last=Goldreich \|first=Oded \|title=Average Case Complexity, Revisited \|date=2011 \|work=Studies in Complexity and Cryptography. Miscellanea on the Interplay between Randomness and Computation \|series=Lecture Notes in Computer Science \|volume=6650 \|pages=422–450 \|editor-last=Goldreich \|editor-first=Oded \|url=https://www.wisdom.weizmann.ac.il/~oded/COL/aver.pdf \|place=Berlin, Heidelberg \|publisher=Springer Berlin Heidelberg \|doi=10.1007/978-3-642-22670-0_29 \|isbn=978-3-642-22669-4}} * {{cite book \|last1=Arora \|first1=Sanjeev \|title=Computational Complexity: A Modern Approach \|last2=Barak \|first2=Boaz \|date=2009 \|publisher=Cambridge University Press \|___location=Cambridge; New York \|chapter=18. Average case complexity: Levin’s theory}} The literature of average case complexity includes the following work: {{citation \| last = Levin \| first = Leonid \| author-link = Leonid Levin \| doi = 10.1137/0215020 \| issue = 1 \| journal = SIAM Journal on Computing \| pages = 285–286 \| title = Average case complete problems \| volume = 15 \| year = 1986}} {{citation \| last = Franco \| first = John \| doi = 10.1016/0020-0190(86)90051-7 \| issue = 2 \| journal = Information Processing Letters \| pages = 103–106 \| title = On the probabilistic performance of algorithms for the satisfiability problem \| volume = 23 \| year = 1986}}.. {{citation \| last1 = Flajolet \| first1 = Philippe \| author1-link = Philippe Flajolet \| last2 = Vitter \| first2 = J. S. \| author2-link = Jeffrey Vitter \| date = August 1987 \| publisher = Institut National de Recherche en Informatique et en Automatique, B.P. 105-78153 Le Chesnay Cedex France \| series = Tech. Report \| title = Average-case analysis of algorithms and data structures}}. {{citation \| last1 = Gurevich \| first1 = Yuri \| author1-link = Yuri Gurevich \| last2 = Shelah \| first2 = Saharon \| author2-link = Saharon Shelah \| doi = 10.1137/0216034 \| issue = 3 \| journal = SIAM Journal on Computing \| pages = 486–502 \| title = Expected computation time for [[Hamiltonian path problem]] \| volume = 16 \| year = 1987\| citeseerx = 10.1.1.359.8982}}. {{citation \| last1 = Ben-David \| first1 = Shai \| last2 = Chor \| first2 = Benny \| author2-link = Benny Chor \| last3 = Goldreich \| first3 = Oded \| author3-link = Oded Goldreich \| last4 = Luby \| first4 = Michael \| author4-link = Michael Luby \| contribution = On the theory of average case complexity \| pages = 204–216 \| publisher = [[Association for Computing Machinery]] \| title = [[Symposium on Theory of Computing\|Proc. 21st Annual Symposium on Theory of Computing]] \| year = 1989}}. {{citation \| last = Gurevich \| first = Yuri \| author-link = Yuri Gurevich \| doi = 10.1016/0022-0000(91)90007-R \| issue = 3 \| journal = Journal of Computer and System Sciences \| pages = 346–398 \| title = Average case completeness \| volume = 42 \| year = 1991\| hdl = 2027.42/29307 \| hdl-access = free }}. See also [http://research.microsoft.com/~gurevich/Opera/76.pdf 1989 draft]. {{citation \| last1 = Selman \| first1 = B. \| last2 = Mitchell \| first2 = D. \| last3 = Levesque \| first3 = H. \| contribution = Hard and easy distributions of SAT problems \| pages = 459–465 \| title = Proc. 10th National Conference on Artificial Intelligence \| year = 1992}}. {{citation \| last1 = Schuler \| first1 = Rainer \| last2 = Yamakami \| first2 = Tomoyuki \| contribution = Structural average case complexity \| pages = 128–139 \| publisher = Springer-Verlag \| series = Lecture Notes in Computer Science \| title = Proc. Foundations of Software Technology and Theoretical Computer Science \| volume = 652 \| year = 1992}}. {{citation \| last1 = Reischuk \| first1 = Rüdiger \| last2 = Schindelhauer \| first2 = Christian \| contribution = Precise average case complexity \| pages = 650–661 \| title = Proc. 10th Annual Symposium on Theoretical Aspects of Computer Science \| year = 1993}}. {{citation \| last1 = Venkatesan \| first1 = R. \| last2 = Rajagopalan \| first2 = S. \| contribution = Average case intractability of matrix and Diophantine problems \| pages = 632–642 \| publisher = [[Association for Computing Machinery]] \| title = [[Symposium on Theory of Computing\|Proc. 24th Annual Symposium on Theory of Computing]] \| year = 1992}}. *{{citation \| last1 = Cox \| first1 = Jim \| last2 = Ericson \| first2 = Lars \| last3 = Mishra \| first3 = Bud \| series = Technical Report TR1995-711 \| publisher = New York University Computer Science Department \| title = The average case complexity of multilevel syllogistic \| url = http://www.cs.nyu.edu/csweb/Research/TechReports/TR1995-711/TR1995-711.pdf \| year = 1995}}. [[Category:Randomized algorithms]] ~~{{comp-sci-stub}}~~ [[Category:Analysis of algorithms]]