User:Fawly/Computational complexity of matrix multiplication: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 21:00, 23 May 2021 edit Fawly (talk \| contribs) Extended confirmed users 584 edits →The matrix multiplication exponent Tag: 2017 wikitext editor ← Previous edit		Latest revision as of 21:59, 23 May 2021 edit undo Fawly (talk \| contribs) Extended confirmed users 584 edits No edit summary Tag: 2017 wikitext editor
(8 intermediate revisions by the same user not shown)
Line 1: In [[theoretical computer science]], an active area of research is determining [[Analysis of algorithms\|how efficient]] the operation of [[matrix multiplication]] can be performed. [[Matrix multiplication algorithm]]s are a central subroutine in theoretical and [[numerical algorithm\|numerical]] algorithms for [[numerical linear algebra]] and [[optimization]], so finding the right amount of time it should take is of major practical relevance. Directly applying the mathematical definition of matrix multiplication gives an algorithm that on the order of {{math\|''n''<sup>3</sup>}} [[Field (mathematics)\|field]] operations to multiply two {{math\|''n'' × ''n''}} matrices over that field ({{math\|Θ(''n''<sup>3</sup>)}} in [[big O notation]]). Surprisingly, algorithms exist that provide better running times than this straightforward "schoolbook algorithm". The first to be discovered was [[Strassen algorithm\|Strassen's algorithm]], devised by [[Volker Strassen]] in 1969 and often referred to as "fast matrix multiplication". ~~It is still unknown what the~~The optimal number of field operations ~~is required~~needed to multiply two square {{math\|''n'' × ''n''}} matrices [[big O notation\|up to constant factors]] is still unknown. This is a major open question in [[theoretical computer science]]. {{As of\|2020\|12}}, the matrix multiplication algorithm with best [[Time complexity\|asymptotic complexity]] runs in {{math\|O(''n''<sup>2.3728596</sup>)}} time, given by Josh Alman and [[Virginia Vassilevska Williams]],<ref name="aw20"> {{Citation \| last1=Alman Line 34: }}</ref><ref name="robinson">{{Citation \| last=Robinson \| first=Sara \| title=Toward an Optimal Algorithm for Matrix Multiplication \| url=https://archive.siam.org/pdf/news/174.pdf \| date=November 2005 \| journal=SIAM News \| volume=38 \| issue=9 \| quote=Even if someone manages to prove one of the conjectures—thereby demonstrating that {{math\|1=''ω'' = 2}}—the wreath product approach is unlikely to be applicable to the large matrix problems that arise in practice. [...] the input matrices must be astronomically large for the difference in time to be apparent.}}</ref> == ~~Matrix~~Simple ~~multiplication~~algorithms == ~~{{Main\|Matrix multiplication}}~~ If ''A'', ''B'' are {{math\|''n'' × ''n''}} matrices over a field, then their product ''AB'' is also an {{math\|''n'' × ''n''}} matrix over that field, defined entrywise as :<math> (AB)_{ij} = \sum_{k = 1}^n A_{ik} B_{kj}. </math> === Schoolbook algorithm === {{For\|implementation techniques (in particular parallel and distributed algorithms)\|Matrix multiplication algorithm}} The simplest approach to computing the product of two {{math\|''n'' × ''n''}} matrices ''A'' and ''B'' is to compute the arithmetic expressions coming from the definition of matrix multiplication. In pseudocode: The matrix multiplication [[algorithm]] that results of the definition requires, in the [[worst-case complexity\|worst case]], {{tmath\|n^3}} multiplications of scalars and {{tmath\|(n-1)n^2}} additions for computing the product of two square {{math\|''n''×''n''}} matrices. Its [[computational complexity]] is therefore {{tmath\|O(n^3)}}, in a [[model of computation]] for which the scalar operations require a constant time (in practice, this is the case for [[floating point]] numbers, but not for integers).▼ input is A and B, both n by n matrices initialize C to be an n by n matrix of all zeros for i from 1 to n: for j from 1 to n: for k from 1 to n: C[i][j] = C[i][j] + A[i][k]B[k][j] output C (as AB) ▲~~The matrix multiplication~~This [[algorithm]] ~~that results of the definition~~ requires, in the [[worst-case complexity\|worst case]], {{tmath\|n^3}} multiplications of scalars and {{tmath\|(n^3 -1) n^2}} additions for computing the product of two square {{math\|''n''×''n''}} matrices. Its [[computational complexity]] is therefore {{tmath\|O(n^3)}}, in a [[model of computation]] ~~for~~where ~~which~~field ~~the~~operations ~~scalar~~(addition ~~operations~~and ~~require~~multiplication) atake constant time (in practice, this is the case for [[floating point]] numbers, but not necessarily for integers). === Strassen's algorithm === Line 206 ⟶ 220: \|} ~~It is an open question in [[theoretical computer science]] how well Strassen's algorithm can be improved in terms of [[Time complexity\|asymptotic complexity]].~~ The '''matrix multiplication exponent''', usually denoted {{math\|ω}}, is the smallest real number for which any <math>n\times n</math> matrix over a field can be multiplied together using <math>n^{\omega + o(1)}</math> field operations. This notation is commonly used in [[algorithm]]s research, algorithms using matrix multiplication as a subroutine have meaningful bounds on running time regardless of the true value of {{math\|ω}}. Using a naive lower bound and schoolbook matrix multiplication for the upper bound, one can straightforwardly conclude that {{math\|2 ≤ ω ≤ 3}}. Whether {{math\|1=ω = 2}} is a major open question in [[theoretical computer science]], and there is a line of research developing matrix multiplication algorithms to get improved bounds on {{math\|ω}}. The current best bound on {{math\|ω}} is {{math\|ω < 2.3728596}}, by Josh Alman and [[Virginia Vassilevska Williams]].<ref name="aw20"/> This algorithm, like all other recent algorithms in this line of research, uses the ''laser method'', a generalization of the Coppersmith–Winograd algorithm, which was given by [[Don Coppersmith]] and [[Shmuel Winograd]] in 1990, was the best matrix multiplication algorithm until 2010, and has an asymptotic complexity of {{math\|''O''(''n''<sup>2.375477</sup>)}}.<ref name="coppersmith">{{Citation\|doi=10.1016/S0747-7171(08)80013-2 \|title=Matrix multiplication via arithmetic progressions \|url=http://www.cs.umd.edu/~gasarch/TOPICS/ramsey/matrixmult.pdf \|year=1990 \|last1=Coppersmith \|first1=Don \|last2=Winograd \|first2=Shmuel \|journal=Journal of Symbolic Computation \|volume=9\|issue=3\|pages=251}}</ref> The conceptual idea of these algorithms are similar to Strassen's algorithm: a way is devised for multiplying two {{math\|''k'' × ''k''}}-matrices with fewer than {{math\|''k''<sup>3</sup>}} multiplications, and this technique is applied recursively. The laser method has limitations to its power, and cannot be used to show that {{math\|ω < 2.3725}}.<ref>{{Cite journal\|last=Ambainis\|first=Andris\|last2=Filmus\|first2=Yuval\|last3=Le Gall\|first3=François\|date=2015-06-14\|title=Fast Matrix Multiplication: Limitations of the Coppersmith-Winograd Method\|url=https://doi.org/10.1145/2746539.2746554\|journal=Proceedings of the forty-seventh annual ACM symposium on Theory of Computing\|series=STOC '15\|___location=Portland, Oregon, USA\|publisher=Association for Computing Machinery\|pages=585–593\|doi=10.1145/2746539.2746554\|isbn=978-1-4503-3536-2}}</ref> Line 217 ⟶ 233: There is a trivial lower bound of {{tmath\|\omega \ge 2}}. Since any algorithm for multiplying two {{math\|''n'' × ''n''}}-matrices has to process all {{math\|2''n''<sup>2</sup>}} entries, there is a trivial asymptotic lower bound of {{math\|Ω(''n''<sup>2</sup>)}} operations for any matrix multiplication algorithm. Thus {{tmath\|2\le \omega < 2.373}}. It is unknown whether {{tmath\|\omega > 2}}. The best known lower bound for matrix-multiplication complexity is {{math\|Ω(''n''<sup>2</sup> log(''n''))}}, for bounded coefficient [[Arithmetic circuit complexity\|arithmetic circuits]] over the real or complex numbers, and is due to [[Ran Raz]].<ref>{{cite journal \| last1 = Raz \| first1 = Ran \| author-link = Ran Raz \| year = 2002\| title = On the complexity of matrix product \| journal = Proceedings of the Thirty-fourth Annual ACM Symposium on Theory of Computing \| pages = 144 \| doi = 10.1145/509907.509932 \| isbn = 1581134959 \| s2cid = 9582328 }}</ref> === Rectangular matrix multiplication ===▼ <ref>{{Citation\|last=Gall\|first=Francois Le\|title=Improved Rectangular Matrix Multiplication using Powers of the Coppersmith-Winograd Tensor\|date=2018-01-01\|url=https://epubs.siam.org/doi/10.1137/1.9781611975031.67\|work=Proceedings of the 2018 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA)\|pages=1029–1046\|series=Proceedings\|publisher=Society for Industrial and Applied Mathematics\|doi=10.1137/1.9781611975031.67\|access-date=2021-05-23\|last2=Urrutia\|first2=Florent\|arxiv=1708.05622}}</ref>▼ ==Related complexities== Line 222 ⟶ 242: Problems that have the same asymptotic complexity as matrix multiplication include [[determinant]], [[matrix inversion]], [[Gaussian elimination]] (see next section). Problems with complexity that is expressible in terms of <math>\omega</math> include characteristic polynomial, eigenvalues (but not eigenvectors), [[Hermite normal form]], and [[Smith normal form]].{{citation needed\|date=March 2018}} ▲=== Rectangular matrix multiplication === ▲<ref>{{Citation\|last=Gall\|first=Francois Le\|title=Improved Rectangular Matrix Multiplication using Powers of the Coppersmith-Winograd Tensor\|date=2018-01-01\|url=https://epubs.siam.org/doi/10.1137/1.9781611975031.67\|work=Proceedings of the 2018 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA)\|pages=1029–1046\|series=Proceedings\|publisher=Society for Industrial and Applied Mathematics\|doi=10.1137/1.9781611975031.67\|access-date=2021-05-23\|last2=Urrutia\|first2=Florent\|arxiv=1708.05622}}</ref> ===Matrix inversion, determinant and Gaussian elimination===