Revision as of 18:04, 21 May 2021 edit Fawly (talk \| contribs) Extended confirmed users 584 edits finish table Tag: 2017 wikitext editor ← Previous edit		Revision as of 18:08, 21 May 2021 edit undo Fawly (talk \| contribs) Extended confirmed users 584 edits No edit summary Tag: 2017 wikitext editor Next edit →
Line 3: Directly applying the mathematical definition of matrix multiplication gives an algorithm that [[Analysis of algorithms\|takes time]] on the order of {{math\|''n''<sup>3</sup>}} [[Field (mathematics)\|field]] operations to multiply two {{math\|''n'' × ''n''}} matrices over that field ({{math\|Θ(''n''<sup>3</sup>)}} in [[big O notation]]). Better asymptotic bounds on the time required to multiply matrices have been known since the [[Strassen algorithm\|Strassen's algorithm]] in the 1960s, but it is still unknown what the optimal time is (i.e., what the [[Computational complexity theory\|complexity of the problem]] is). {{As of\|2020\|12}}, the matrix multiplication algorithm with best asymptotic complexity runs in {{math\|O(''n''<sup>2.3728596</sup>)}} time, given by Josh Alman and [[Virginia Vassilevska Williams]],<ref name="aw20">{{Citation \| last1=Alman \| first1=Josh \| last2=Williams \| first2=Virginia Vassilevska \| contribution=A Refined Laser Method and Faster Matrix Multiplication \| year = 2020 \| arxiv=2010.05846 \| title = 32nd Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2021) \|url=https://www.siam.org/conferences/cm/program/accepted-papers/soda21-accepted-papers}}</ref><ref>{{Cite web\|last=Hartnett\|first=Kevin\|title=Matrix Multiplication Inches Closer to Mythic Goal\|url=https://www.quantamagazine.org/mathematicians-inch-closer-to-matrix-multiplication-goal-20210323/\|access-date=2021-04-01\|website=Quanta Magazine\|language=en}}</ref> however this algorithm is a [[galactic algorithm]] because of the large constants and cannot be realized practically. ~~Algorithms~~Surprisingly, algorithms exist that provide better running times than the straightforward ones. The first to be discovered was [[Strassen algorithm\|Strassen's algorithm]], devised by [[Volker Strassen]] in 1969 and often referred to as "fast matrix multiplication". It is based on a way of multiplying two {{math\|2 × 2}}-matrices which requires only 7 multiplications (instead of the usual 8), at the expense of several additional addition and subtraction operations. Applying this recursively gives an algorithm with a multiplicative cost of <math>O( n^{\log_{2}7}) \approx O(n^{2.807})</math>. Strassen's algorithm is more complex, and the [[numerical stability]] is reduced compared to the naïve algorithm,<ref>{{Citation \| last1=Miller \| first1=Webb \| title=Computational complexity and numerical stability \| citeseerx = 10.1.1.148.9947 \| year=1975 \| journal=SIAM News \| volume=4 \| issue=2 \| pages=97–107 \| doi=10.1137/0204009}}</ref> but it is faster in cases where {{math\|''n'' > 100}} or so<ref name="skiena"/> and appears in several libraries, such as [[Basic Linear Algebra Subprograms\|BLAS]].<ref>{{cite book \|last1=Press \|first1=William H. \|last2=Flannery \|first2=Brian P. \|last3=Teukolsky \|first3=Saul A. \|author3-link=Saul Teukolsky \|last4=Vetterling \|first4=William T. \|title=Numerical Recipes: The Art of Scientific Computing \|publisher=[[Cambridge University Press]] \|edition=3rd \|isbn=978-0-521-88068-8 \|year=2007 \|page=[https://archive.org/details/numericalrecipes00pres_033/page/n131 108]\|title-link=Numerical Recipes }}</ref> It is very useful for large matrices over exact domains such as [[finite field]]s, where numerical stability is not an issue. The matrix multiplication [[algorithm]] that results of the definition requires, in the [[worst-case complexity\|worst case]], {{tmath\|n^3}} multiplications of scalars and {{tmath\|(n-1)n^2}} additions for computing the product of two square {{math\|''n''×''n''}} matrices. Its [[computational complexity]] is therefore {{tmath\|O(n^3)}}, in a [[model of computation]] for which the scalar operations require a constant time (in practice, this is the case for [[floating point]] numbers, but not for integers). The [[greatest lower bound]] for the exponent of matrix multiplication algorithm is generally called {{tmath\|\omega}}. It is trivially true that {{tmath\|\omega \ge 2}}, because one must read the {{tmath\|n^2}} elements of a matrix in order to multiply it with another matrix. Thus {{tmath\|2\le \omega < 2.373}}. It is unknown whether {{tmath\|\omega > 2}}. The largest known lower bound for matrix-multiplication complexity is {{math\|Ω(''n''<sup>2</sup> log(''n''))}}, for a restricted kind of [[Arithmetic circuit complexity\|arithmetic circuits]], and is due to [[Ran Raz]].<ref>▼ {{Cite journal▼ \| last=Raz▼ \| first=Ran▼ \| date=January 2003▼ \| title=On the Complexity of Matrix Product▼ \| journal=SIAM Journal on Computing▼ \| volume=32▼ \| issue=5▼ \| pages=1356–1369▼ \| doi=10.1137/s0097539702402147▼ \| issn=0097-5397▼ }}</ref>▼ [[Freivalds' algorithm]] is a simple [[Monte Carlo algorithm]] that, given matrices {{mvar\|A}}, {{mvar\|B}} and {{mvar\|C}}, verifies in {{math\|Θ(''n''<sup>2</sup>)}} time if {{math\|''AB'' {{=}} ''C''}}.▼ === Matrix multiplication exponent ===▼ [[File:MatrixMultComplexity svg.svg\|thumb\|400px\|right\|Improvement of estimates of exponent {{math\|ω}} over time for the computational complexity of matrix multiplication <math>O(n^\omega)</math>.]]▼ It is an open question in [[theoretical computer science]] how well Strassen's algorithm can be improved in terms of [[Time complexity\|asymptotic complexity]]. The ''matrix multiplication exponent'', usually denoted <math>\omega</math>, is the smallest real number for which any <math>n\times n</math> matrix over a field can be multiplied together using <math>n^{\omega + o(1)}</math> field operations. The current best bound on <math>\omega</math> is <math>\omega < 2.3728596</math>, by Josh Alman and [[Virginia Vassilevska Williams]].<ref name="aw20"/> This algorithm, like all other recent algorithms in this line of research, is a generalization of the Coppersmith–Winograd algorithm, which was given by [[Don Coppersmith]] and [[Shmuel Winograd]] in 1990, was the best matrix multiplication algorithm until 2010, and has an asymptotic complexity of {{math\|''O''(''n''<sup>2.375477</sup>)}}.<ref name="coppersmith">{{Citation\|doi=10.1016/S0747-7171(08)80013-2 \|title=Matrix multiplication via arithmetic progressions \|url=http://www.cs.umd.edu/~gasarch/TOPICS/ramsey/matrixmult.pdf \|year=1990 \|last1=Coppersmith \|first1=Don \|last2=Winograd \|first2=Shmuel \|journal=Journal of Symbolic Computation \|volume=9\|issue=3\|pages=251}}</ref> The conceptual idea of these algorithms are similar to Strassen's algorithm: a way is devised for multiplying two {{math\|''k'' × ''k''}}-matrices with fewer than {{math\|''k''<sup>3</sup>}} multiplications, and this technique is applied recursively. However, the constant coefficient hidden by the [[Big O notation]] is so large that these algorithms are only worthwhile for matrices that are too large to handle on present-day computers.<ref>{{citation▼ \| last = Iliopoulos▼ \| first = Costas S.▼ \| doi = 10.1137/0218045▼ \| issue = 4▼ \| journal = SIAM Journal on Computing▼ \| mr = 1004789▼ \| quote = The Coppersmith–Winograd algorithm is not practical, due to the very large hidden constant in the upper bound on the number of multiplications required.▼ \| pages = 658–669▼ \| title = Worst-case complexity bounds on algorithms for computing the canonical structure of finite abelian groups and the Hermite and Smith normal forms of an integer matrix▼ \| url = http://www.williamstein.org/home/wstein/www/home/pernet/Papers/Hermite/Iliopoulos88.pdf▼ \| volume = 18▼ \| year = 1989▼ \| citeseerx = 10.1.1.531.9309▼ \| access-date = 2015-01-16▼ \| archive-url = https://web.archive.org/web/20140305182030/http://www.williamstein.org/home/wstein/www/home/pernet/Papers/Hermite/Iliopoulos88.pdf▼ \| archive-date = 2014-03-05▼ \| url-status = dead▼ }}</ref><ref name="robinson">{{Citation \| last=Robinson \| first=Sara \| title=Toward an Optimal Algorithm for Matrix Multiplication \| url=https://archive.siam.org/pdf/news/174.pdf \| date=November 2005 \| journal=SIAM News \| volume=38 \| issue=9 \| quote=Even if someone manages to prove one of the conjectures—thereby demonstrating that {{math\|1=''ω'' = 2}}—the wreath product approach is unlikely to be applicable to the large matrix problems that arise in practice. [...] the input matrices must be astronomically large for the difference in time to be apparent.}}</ref>▼ Since any algorithm for multiplying two {{math\|''n'' × ''n''}}-matrices has to process all {{math\|2''n''<sup>2</sup>}} entries, there is an asymptotic lower bound of {{math\|Ω(''n''<sup>2</sup>)}} operations. Raz proved a lower bound of {{math\|Ω(''n''<sup>2</sup> log(''n''))}} for bounded coefficient arithmetic circuits over the real or complex numbers.<ref>{{cite journal \| last1 = Raz \| first1 = Ran \| author-link = Ran Raz \| year = 2002\| title = On the complexity of matrix product \| journal = Proceedings of the Thirty-fourth Annual ACM Symposium on Theory of Computing \| pages = 144 \| doi = 10.1145/509907.509932 \| isbn = 1581134959 \| s2cid = 9582328 }}</ref>▼ {\| class="wikitable" Line 77 ⟶ 120: }}</ref> \|- \| 1981 \|\| 2.496 \|\| [[Don Coppersmith\|Coppersmith]], [[Shmuel Winograd\|Winograd]]<ref> {{cite book \| doi=10.1109/SFCS.1981.27 Line 89 ⟶ 132: }}</ref> \|- \| 1986 \|\| 2.479 \|\| [[Volker Strassen\|Strassen]]<ref> {{cite book \| doi=10.1109/SFCS.1986.52 Line 100 ⟶ 143: }}</ref> \|- \| 1990 \|\| 2.3755 \|\| [[Don Coppersmith\|Coppersmith]], [[Shmuel Winograd\|Winograd]]<ref> {{cite journal \| url=https://www.sciencedirect.com/science/article/pii/S0747717108800132 Line 161 ⟶ 204: }}</ref> \|- \| 2020 \|\| 2.3728596 \|\| Alman, [[Virginia Vassilevska Williams\|Williams]]<ref name="Alman2020"> {{Citation \| last1=Alman Line 174 ⟶ 217: }}</ref> \|} Rather surprisingly, this complexity is not optimal, as shown in 1969 by [[Volker Strassen]], who provided an algorithm, now called [[Strassen's algorithm]], with a complexity of <math>O( n^{\log_{2}7}) \approx O(n^{2.8074}).</math> ~~The exponent appearing in the complexity of matrix multiplication has been improved several times,~~ ~~leading to the [[Coppersmith–Winograd algorithm]] with a complexity of {{math\|''O''(''n''<sup>2.3755</sup>)}} (1990).~~ ~~This algorithm has been slightly improved in 2010 by Stothers to a complexity of {{math\|''O''(''n''<sup>2.3737</sup>)}},~~ ~~in 2013 by [[Virginia Vassilevska Williams]] to {{math\|''O''(''n''<sup>2.3729</sup>)}},<ref name="Williams.2014"/>~~ ~~and in 2014 by François Le Gall to {{math\|''O''(''n''<sup>2.3728639</sup>)}}.~~ ~~This was further refined in 2020 by Josh Alman and Virginia Vassilevska Williams to a final (up to date) complexity of {{math\|''O''(''n''<sup>2.3728596</sup>)}}.~~ ▲The [[greatest lower bound]] for the exponent of matrix multiplication algorithm is generally called {{tmath\|\omega}}. It is trivially true that {{tmath\|\omega \ge 2}}, because one must read the {{tmath\|n^2}} elements of a matrix in order to multiply it with another matrix. Thus {{tmath\|2\le \omega < 2.373}}. It is unknown whether {{tmath\|\omega > 2}}. The largest known lower bound for matrix-multiplication complexity is {{math\|Ω(''n''<sup>2</sup> log(''n''))}}, for a restricted kind of [[Arithmetic circuit complexity\|arithmetic circuits]], and is due to [[Ran Raz]].<ref> ▲{{Cite journal ▲ \| last=Raz ▲ \| first=Ran ▲ \| date=January 2003 ▲ \| title=On the Complexity of Matrix Product ▲ \| journal=SIAM Journal on Computing ▲ \| volume=32 ▲ \| issue=5 ▲ \| pages=1356–1369 ▲ \| doi=10.1137/s0097539702402147 ▲ \| issn=0097-5397 ▲}}</ref> ▲[[File:MatrixMultComplexity svg.svg\|thumb\|400px\|right\|Improvement of estimates of exponent {{math\|ω}} over time for the computational complexity of matrix multiplication <math>O(n^\omega)</math>.]] ▲[[Freivalds' algorithm]] is a simple [[Monte Carlo algorithm]] that, given matrices {{mvar\|A}}, {{mvar\|B}} and {{mvar\|C}}, verifies in {{math\|Θ(''n''<sup>2</sup>)}} time if {{math\|''AB'' {{=}} ''C''}}. ▲=== Matrix multiplication exponent === ▲It is an open question in [[theoretical computer science]] how well Strassen's algorithm can be improved in terms of [[Time complexity\|asymptotic complexity]]. The ''matrix multiplication exponent'', usually denoted <math>\omega</math>, is the smallest real number for which any <math>n\times n</math> matrix over a field can be multiplied together using <math>n^{\omega + o(1)}</math> field operations. The current best bound on <math>\omega</math> is <math>\omega < 2.3728596</math>, by Josh Alman and [[Virginia Vassilevska Williams]].<ref name="aw20"/> This algorithm, like all other recent algorithms in this line of research, is a generalization of the Coppersmith–Winograd algorithm, which was given by [[Don Coppersmith]] and [[Shmuel Winograd]] in 1990, was the best matrix multiplication algorithm until 2010, and has an asymptotic complexity of {{math\|''O''(''n''<sup>2.375477</sup>)}}.<ref name="coppersmith">{{Citation\|doi=10.1016/S0747-7171(08)80013-2 \|title=Matrix multiplication via arithmetic progressions \|url=http://www.cs.umd.edu/~gasarch/TOPICS/ramsey/matrixmult.pdf \|year=1990 \|last1=Coppersmith \|first1=Don \|last2=Winograd \|first2=Shmuel \|journal=Journal of Symbolic Computation \|volume=9\|issue=3\|pages=251}}</ref> The conceptual idea of these algorithms are similar to Strassen's algorithm: a way is devised for multiplying two {{math\|''k'' × ''k''}}-matrices with fewer than {{math\|''k''<sup>3</sup>}} multiplications, and this technique is applied recursively. However, the constant coefficient hidden by the [[Big O notation]] is so large that these algorithms are only worthwhile for matrices that are too large to handle on present-day computers.<ref>{{citation ▲ \| last = Iliopoulos ▲ \| first = Costas S. ▲ \| doi = 10.1137/0218045 ▲ \| issue = 4 ▲ \| journal = SIAM Journal on Computing ▲ \| mr = 1004789 ▲ \| quote = The Coppersmith–Winograd algorithm is not practical, due to the very large hidden constant in the upper bound on the number of multiplications required. ▲ \| pages = 658–669 ▲ \| title = Worst-case complexity bounds on algorithms for computing the canonical structure of finite abelian groups and the Hermite and Smith normal forms of an integer matrix ▲ \| url = http://www.williamstein.org/home/wstein/www/home/pernet/Papers/Hermite/Iliopoulos88.pdf ▲ \| volume = 18 ▲ \| year = 1989 ▲ \| citeseerx = 10.1.1.531.9309 ▲ \| access-date = 2015-01-16 ▲ \| archive-url = https://web.archive.org/web/20140305182030/http://www.williamstein.org/home/wstein/www/home/pernet/Papers/Hermite/Iliopoulos88.pdf ▲ \| archive-date = 2014-03-05 ▲ \| url-status = dead ▲ }}</ref><ref name="robinson">{{Citation \| last=Robinson \| first=Sara \| title=Toward an Optimal Algorithm for Matrix Multiplication \| url=https://archive.siam.org/pdf/news/174.pdf \| date=November 2005 \| journal=SIAM News \| volume=38 \| issue=9 \| quote=Even if someone manages to prove one of the conjectures—thereby demonstrating that {{math\|1=''ω'' = 2}}—the wreath product approach is unlikely to be applicable to the large matrix problems that arise in practice. [...] the input matrices must be astronomically large for the difference in time to be apparent.}}</ref> ▲Since any algorithm for multiplying two {{math\|''n'' × ''n''}}-matrices has to process all {{math\|2''n''<sup>2</sup>}} entries, there is an asymptotic lower bound of {{math\|Ω(''n''<sup>2</sup>)}} operations. Raz proved a lower bound of {{math\|Ω(''n''<sup>2</sup> log(''n''))}} for bounded coefficient arithmetic circuits over the real or complex numbers.<ref>{{cite journal \| last1 = Raz \| first1 = Ran \| author-link = Ran Raz \| year = 2002\| title = On the complexity of matrix product \| journal = Proceedings of the Thirty-fourth Annual ACM Symposium on Theory of Computing \| pages = 144 \| doi = 10.1145/509907.509932 \| isbn = 1581134959 \| s2cid = 9582328 }}</ref> === Group theory reformulation of matrix multiplication algorithms ===

User:Fawly/Computational complexity of matrix multiplication: Difference between revisions