Revision as of 04:41, 28 April 2024 edit Citation bot (talk \| contribs) Bots 5,861,326 edits Added date. \| Use this bot. Report bugs. \| Suggested by Abductive \| Category:Articles containing potentially dated statements from April 2024 \| #UCB_Category 814/957 ← Previous edit		Revision as of 22:33, 24 August 2024 edit undo 2a06:c701:9619:8a00:c20:4208:f8cb:a855 (talk) →Sub-cubic algorithms: added another paper related to the table Next edit →
Line 172: [[File:MatrixMultComplexity svg.svg\|thumb\|400px\|right\|Improvement of estimates of exponent {{math\|ω}} over time for the computational complexity of matrix multiplication <math>O(n^\omega)</math>.]] Algorithms exist that provide better running times than the straightforward ones. The first to be discovered was [[Strassen algorithm\|Strassen's algorithm]], devised by [[Volker Strassen]] in 1969 and often referred to as "fast matrix multiplication". It is based on a way of multiplying two {{math\|2 × 2}}-matrices which ~~requires~~require only 7 multiplications (instead of the usual 8), at the expense of several additional addition and subtraction operations. Applying this recursively gives an algorithm with a multiplicative cost of <math>O( n^{\log_{2}7}) \approx O(n^{2.807})</math>. Strassen's algorithm is more complex, and the [[numerical stability]] is reduced compared to the naïve algorithm,<ref>{{Citation \| last1=Miller \| first1=Webb \| title=Computational complexity and numerical stability \| citeseerx = 10.1.1.148.9947 \| year=1975 \| journal=SIAM News \| volume=4 \| issue=2 \| pages=97–107 \| doi=10.1137/0204009}}</ref> but it is faster in cases where {{math\|''n'' > 100}} or so<ref name="skiena"/> and appears in several libraries, such as [[Basic Linear Algebra Subprograms\|BLAS]].<ref>{{cite book \|last1=Press \|first1=William H. \|last2=Flannery \|first2=Brian P. \|last3=Teukolsky \|first3=Saul A. \|author3-link=Saul Teukolsky \|last4=Vetterling \|first4=William T. \|title=Numerical Recipes: The Art of Scientific Computing \|publisher=[[Cambridge University Press]] \|edition=3rd \|isbn=978-0-521-88068-8 \|year=2007 \|page=[https://archive.org/details/numericalrecipes00pres_033/page/n131 108]\|title-link=Numerical Recipes }}</ref> It is very useful for large matrices over exact domains such as [[finite field]]s, where numerical stability is not an issue. Since Strassen's algorithm is actually used in practical numerical software and computer algebra systems improving on the constants hidden in the [[big O notation]] has its merits. A table ~~which~~that compares key aspects of the improved version based on recursive multiplication of 2x2-block matrices via 7 block matrix multiplications follows. As usual <math>n</math> gives the dimensions of the matrix and <math>M</math> designates the memory size. {\| class="wikitable" Line 186: \|- \| 2017 \|\| Karstadt, Schwartz<ref>{{cite conference \|url=https://dl.acm.org/doi/10.1145/3087556.3087579 \|title=Matrix Multiplication, a Little Faster \|last1=Karstadt \|first1=Elaye \|last2=Schwartz \|first2=Oded \|date=July 2017 \|publisher= \|book-title=Proceedings of the 29th ACM Symposium on Parallelism in Algorithms and Architectures \|pages=101–110 \|conference=SPAA '17 \|doi=10.1145/3087556.3087579}}</ref> \|\| 7 \|\| 12 \|\| <math>5n^{\log_2 7}-4n^2+3n^2\log_2n</math> \|\| <math>4\left(\frac{\sqrt{3}n}{\sqrt{M}}\right)^{\log_2 7}\cdot M-12n^2 +3n^2\cdot\log_2\left(\frac{\sqrt{2}n}{\sqrt{M}}\right) +5M</math> \|- \| 2023 \|\| Schwartz, Vaknin<ref>{{cite conference \|url=https://doi.org/10.1137/22M1502719 \|title=Pebbling Game and Alternative Basis for High Performance Matrix Multiplication \|last1=Schwartz \|first1=Oded \|last2=Vaknin \|first2=Noa \|date=2023 \|publisher= \|book-title=SIAM Journal on Scientific Computing \|pages=C277-C303 \|doi=10.1137/22M1502719}}</ref> \|\| 7 \|\| 12 \|\| <math>5n^{\log_2 7}-4n^2+1.5n^2\log_2n</math> \|\| <math>4\left(\frac{\sqrt{3}n}{\sqrt{M}}\right)^{\log_2 7}\cdot M-12n^2 +1.5n^2\cdot\log_2\left(\frac{\sqrt{2}n}{\sqrt{M}}\right) +5M</math> \|} It is known that a Strassen-like algorithm with a 2x2-block matrix step requires at least 7 block matrix multiplications. In 1976 Probert<ref>{{cite journal \|last=Probert \|first=Robert L. \|title=On the additive complexity of matrix multiplication \|journal=SIAM J. Comput. \|volume=5 \|issue= 2 \|pages=187–203 \|year=1976 \|doi=10.1137/0205016}}</ref> showed that such an algorithm requires at least 15 additions (including subtractions), however, a hidden assumption was that the blocks and the 2x2-~~blockmatrix~~block matrix are represented in the same basis. Karstadt and Schwartz computed in different bases and traded 3 additions for less expensive basis transformations. They also proved that one cannot go below 12 additions per step using different bases. In subsequent work Beniamini et el.<ref>{{cite arXiv\|eprint=2008.03759 \|last1=Beniamini \|first1=Gal \|last2=Cheng \|first2=Nathan \|last3=Holtz \|first3=Olga \|last4=Karstadt \|first4=Elaye \|last5=Schwartz \|first5=Oded \|title=Sparsifying the Operators of Fast Matrix Multiplication Algorithms \|date=2020 \|class=cs.DS }}</ref> applied this base change trick to more general decompositions than 2x2-~~blockmatrices~~block matrices and improved the leading constant for their run times. It is an open question in [[theoretical computer science]] how well Strassen's algorithm can be improved in terms of [[Time complexity\|asymptotic complexity]]. The ''matrix multiplication exponent'', usually denoted <math>\omega</math>, is the smallest real number for which any <math>n\times n</math> matrix over a field can be multiplied together using <math>n^{\omega + o(1)}</math> field operations. The current best bound on <math>\omega</math> is <math>\omega < 2.371552</math>, by [[Virginia Vassilevska Williams\|Williams]], Xu, Xu, and Zhou.<ref name="apr24w"/><ref name="aw20" /> This algorithm, like all other recent algorithms in this line of research, is a generalization of the Coppersmith–Winograd algorithm, which was given by [[Don Coppersmith]] and [[Shmuel Winograd]] in 1990.<ref name="coppersmith">{{Citation\|doi=10.1016/S0747-7171(08)80013-2 \|title=Matrix multiplication via arithmetic progressions \|url=http://www.cs.umd.edu/~gasarch/TOPICS/ramsey/matrixmult.pdf \|year=1990 \|last1=Coppersmith \|first1=Don \|last2=Winograd \|first2=Shmuel \|journal=Journal of Symbolic Computation \|volume=9\|issue=3\|pages=251\|doi-access=free }}</ref> The conceptual idea of these algorithms are similar to Strassen's algorithm: a way is devised for multiplying two {{math\|''k'' × ''k''}}-matrices with fewer than {{math\|''k''<sup>3</sup>}} multiplications, and this technique is applied recursively. However, the constant coefficient hidden by the [[Big O notation]] is so large that these algorithms are only worthwhile for matrices that are too large to handle on present-day computers.<ref>{{citation

Matrix multiplication algorithm: Difference between revisions