Biconjugate gradient stabilized method: Difference between revisions

Content deleted Content added
m Reverted edit by 186.177.184.151 (talk) to last version by Weberjoh
 
(21 intermediate revisions by 16 users not shown)
Line 1:
{{Short description|Concept in mathematics}}
{{Technical|date=May 2015}}
 
In [[numerical linear algebra]], the '''biconjugate gradient stabilized method''', often abbreviated as '''BiCGSTAB''', is an [[iterative method]] developed by [[Henk van der Vorst|H. A. van der Vorst]] for the numerical solution of nonsymmetric [[System of linear equations|linear system]]s. It is a variant of the [[biconjugate gradient method]] (BiCG) and has faster and smoother convergence than the original BiCG as well as other variants such as the [[conjugate gradient squared method]] (CGS). It is a [[Krylov subspace]] method. Unlike the original BiCG method, it doesn't require multiplication by the transpose of the system matrix.
 
==Algorithmic steps==
===Unpreconditioned BiCGSTAB===
In the following sections, {{math|1=('''<var>x</var>''','''<var>y</var>''') = '''<var>x</var>'''<sup>T</sup> '''<var>y</var>'''}} denotes the [[dot product]] of vectors. To solve a linear system {{math|'''<var>Ax</var>''' {{=}} '''<var>b</var>'''}}, BiCGSTAB starts with an initial guess {{math|'''<var>x</var>'''<sub>0</sub>}} and proceeds as follows:
 
# {{math|'''<var>r</var>'''<sub>0</sub> {{=}} '''<var>b</var>''' − '''<var>Ax</var>'''<sub>0</sub>}}
# Choose an arbitrary vector {{math|'''<var>r̂</var>'''<sub>0</sub>}} such that {{math|('''<var>r̂</var>'''<sub>0</sub>, '''<var>r</var>'''<sub>0</sub>) ≠ 0}}, e.g., {{math|'''<var>r̂</var>'''<sub>0</sub> {{=}} '''<var>r</var>'''<sub>0</sub>}} . Note that notation
# {{math|('''<var>xρ</var>''','''<varsub>y0</varsub>''') {{=}} applies for scalar product of vectors {{math|1=('''<var>x</var>''','''<varsub>y0</varsub>''') =, '''<var>x^Tr</var>''' '''<varsub>y0</varsub>''') }}
# {{math|'''<var>ρp</var>'''<sub>0</sub> {{=}} '''<var>α</var> {{=}} <var>ωr</var>'''<sub>0</sub> {{=}} 1}}
# {{math|'''<var>v</var>'''<sub>0</sub> {{=}} '''<var>p</var>'''<sub>0</sub> {{=}} '''0'''}}
# For {{math|<var>i</var> {{=}} 1, 2, 3, …}}
## {{math|'''<var>ρ<sub>i</sub>v</var>''' {{=}} ('''<var>r̂</var>'''<sub>0</sub>, '''<var>rAp</var>'''<sub><var>i</var>−1</sub>)}}
## {{math|<var>βα</var> {{=}} (<var>ρ<sub>i</sub></var>/<var>ρ</var><sub><var>i</var>−1</sub>)/('''<var>α</var>/'''<varsub>ω0</var><sub>, <var>i'''v'''</var>−1</sub>)}}
## {{math|<var>'''ph'''<sub>i</sub></var> {{=}} '''<var>rx</var>'''<sub><var>i</var>−1</sub> + <var>β</var>(α'''<var>p</var>'''<sub><var>i</var>−1</sub> − <var>ω</var><sub><var>i</var>−1</sub>'''<var>v</var>'''<sub><var>i</var>−1</sub>) }}
## {{math|<var>'''vs'''<sub>i</sub></var> {{=}} <var>'''r'''<var>Ap</var>'''<sub><var>i</var>−1</sub> − <var>α'''v'''</var>}}
## If {{math|<var>α'''h'''</var> {{=}} <var>ρ<sub>is accurate enough, i.e., if </sub></var>/('''s'''</var> is small enough, then set {{math|</var>'''x'''<sub>0i</sub>,</var> {{=}} <var>'''vh'''<sub>i</sub></var>)}} and quit
## {{math|<var>'''h'''</var> {{=}} '''<var>x</var>'''<sub><var>i</var>−1</sub> + <var>α'''p'''<sub>i</sub></var> }}
## If {{math|<var>'''h'''</var>}} is accurate enough, then set {{math|<var>'''x'''<sub>i</sub></var> {{=}} <var>'''h'''</var>}} and quit
## {{math|<var>'''s'''</var> {{=}} <var>'''r'''</var><sub><var>i</var>−1</sub> − <var>α'''v'''<sub>i</sub></var>}}
## {{math|'''<var>t</var>''' {{=}} '''<var>As</var>'''}}
## {{math|<var>ω<sub>i</sub></var> {{=}} (<var>'''t'''</var>, <var>'''s'''</var>)/(<var>'''t'''</var>, <var>'''t'''</var>)}}
## {{math|<var>'''x'''<sub>i</sub></var> {{=}} <var>'''h'''</var> + <var>ω<sub>i</sub>'''s'''</var>}}
## If {{math|<var>'''xr'''<sub>i</sub></var> {{=}} is<var>'''s'''</var> accurate enough, then quit<var>ω'''t'''</var>}}
## If {{math|<var>'''rx'''<sub>i</sub></var>}} is accurate enough, i.e., if {{=}} math|<var>'''sr'''</var> − <var>ω<sub>i</sub>'''t'''</var>}} is small enough, then quit
## {{math|<var>'''h'''ρ<sub>i</sub></var> {{=}} ('''<var>x</var>'''<sub>0<var/sub>i, '''</var>−1r</sub> + <var>α'''p'''<sub><var>i</subvar></varsub> )}}
## {{math|<var>'''s'''β</var> {{=}} (<var>'''r'''ρ<sub>i</sub></var>/<var>ρ</var><sub><var>i</var>−1</sub>)(<var>α'''v'''<sub/var>i/</subvar>ω</var>)}}
## {{math|<var>α'''p'''<sub>i</sub></var> {{=}} '''<var>ρr</var>'''<sub><var>i</subvar></sub> + <var>β</var>('''<var>p</var>'''<sub>0<var>i</var>−1</sub>, <var>'''vω</var>'''<sub>i</subvar>v</var>''')}}
In some cases, choosing the vector {{math|'''<var>r̂</var>'''<sub>0</sub>}} randomly improves numerical stability.<ref>{{Cite journal |last=Schoutrop |first=Chris |last2=Boonkkamp |first2=Jan ten Thije |last3=Dijk |first3=Jan van |date=July 2022 |title=Reliability Investigation of BiCGStab and IDR Solvers for the Advection-Diffusion-Reaction Equation |url=https://doi.org/10.4208/cicp.OA-2021-0182 |journal=Communications in Computational Physics |language=en |volume=32 |issue=1 |pages=156–188 |doi=10.4208/cicp.oa-2021-0182 |issn=1815-2406}}</ref>
 
===Preconditioned BiCGSTAB===
Line 31 ⟶ 33:
# {{math|'''<var>r</var>'''<sub>0</sub> {{=}} '''<var>b</var>''' − '''<var>Ax</var>'''<sub>0</sub>}}
# Choose an arbitrary vector {{math|'''<var>r̂</var>'''<sub>0</sub>}} such that {{math|('''<var>r̂</var>'''<sub>0</sub>, '''<var>r</var>'''<sub>0</sub>) ≠ 0}}, e.g., {{math|'''<var>r̂</var>'''<sub>0</sub> {{=}} '''<var>r</var>'''<sub>0</sub>}}
# {{math|<var>ρ</var><sub>0</sub> {{=}} ('''<var>α</var>'''<sub>0</sub>, {{=}} '''<var>ωr</var>'''<sub>0</sub>) {{=}} 1}}
# {{math|'''<var>vp</var>'''<sub>0</sub> {{=}} '''<var>pr</var>'''<sub>0</sub> {{=}} '''0'''}}
# For {{math|<var>i</var> {{=}} 1, 2, 3, …}}
## {{math|'''<var>ρ<sub>i</sub>y</var>''' {{=}} ({{SubSup|'''<var>K</var>'''|2|−1}}{{SubSup|'''<subvar>0K</subvar>, '''|1|−1}}'''<var>rp</var>'''<sub><var>i</var>−1</sub>)}}
## {{math|<var>β'''v'''</var> {{=}} ('''<var>ρ<sub>i</sub>Ay</var>/<var>ρ</var><sub><var>i</var>−1</sub>)(<var>α</var>/<var>ω</var><sub><var>i</var>−1</sub>)'''}}
## {{math|<var>'''p'''<sub>i</sub>α</var> {{=}} '''<var>rρ</var>'''<sub><var>i</var>−1</sub> + <var>β</var>('''<var>p</var>'''<sub><var>i</var>−10</sub>, <var>ω</var><sub><var>i</var>−1</sub>'''<var>v</var>'''<sub><var>i</var>−1</sub>)}}
## {{math|'''<var>y</var>''' {{=}} '''<var>K</var>'''<sup>−1</sup>'''<var>p</var>'''<sub><var>i</var></sub>}}
## {{math|<var>'''v'''<sub>i</sub></var> {{=}} '''<var>Ay</var>'''}}
## {{math|<var>α</var> {{=}} <var>ρ<sub>i</sub></var>/('''<var>r̂</var>'''<sub>0</sub>, <var>'''v'''<sub>i</sub></var>)}}
## {{math|<var>'''h'''</var> {{=}} '''<var>x</var>'''<sub><var>i</var>−1</sub> + <var>α'''y'''</var> }}
## {{math|'''<var>vs</var>'''<sub>0</sub> {{=}} '''<var>pr</var>'''<sub>0<var>i</var>−1</sub> {{=}} <var>α'''0v'''</var>}}
## If {{math|<var>'''h'''</var>}} is accurate enough then {{math|<var>'''x'''<sub>i</sub></var> {{=}} <var>'''h'''</var>}} and quit
## {{math|'''<var>sz</var>''' {{=}} {{SubSup|'''<var>rK</var>'''<sub>|2|−1}}{{SubSup|'''<var>iK</var>−1</sub> − <var>α'''v|1|−1}}'''<sub>i</subvar>s</var>'''}}
## {{math|'''<var>z</var>''' {{=}} '''<var>K</var>'''<sup>−1</sup>'''<var>s</var>'''}}
## {{math|'''<var>t</var>''' {{=}} '''<var>Az</var>'''}}
## {{math|<var>ω<sub>i</sub></var> {{=}} ({{SubSup|'''<var>K</var>'''|1|−1}}'''<var>t</var>''', {{SubSup|'''<var>K</var>'''|1|−1}}'''<var>s</var>''')/({{SubSup|'''<var>K</var>'''|1|−1}}'''<var>t</var>''', {{SubSup|'''<var>K</var>'''|1|−1}}'''<var>t</var>''')}}
## {{math|<var>'''x'''<sub>i</sub></var> {{=}} <var>'''h'''</var> + <var>ω<sub>i</sub>'''z'''</var>}}
## If {{math|<var>'''h'''</var>}} is accurate enough, then set {{math|<var>'''xr'''<sub>i</sub></var> {{=}} '''<var>s</var>'''h − <var>ω'''t'''</var>}} and quit
## If {{math|<var>'''x'''<sub>i</sub></var>}} is accurate enough then quit
## {{math|<var>'''r'''ρ<sub>i</sub></var> {{=}} ('''<var>s</var>''' − <var>ω<sub>i0</sub>, '''t<var>r</var>'''<sub><var>i</var></sub>)}}
## {{math|'''<var>yβ</var>''' {{=}} '''(<var>Kρ</varsub>'''i<sup/sub>−1</supvar>'''/<var>pρ</var>'''<sub><var>i</var>−1</sub>)(<var>α</var>/<var>ω</var>)}}
## {{math|<var>'''p'''<sub>i</sub></var> {{=}} '''<var>r</var>'''<sub><var>i</var></sub> + <var>β</var>('''<var>p</var>'''<sub><var>i</var>−1</sub> − <var>ω</var>'''<var>v</var>''')}}
 
This formulation is equivalent to applying unpreconditioned BiCGSTAB to the explicitly preconditioned system
Line 90 ⟶ 92:
:{{math|'''<var>r̃</var>'''<sub><var>i</var></sub> {{=}} <var>Q<sub>i</sub></var>('''<var>A</var>''')<var>P<sub>i</sub></var>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>}}
 
where {{math|<var>Q<sub>i</sub></var>('''<var>A</var>''') {{=}} ('''<var>I</var>''' − <var>ω</var><sub>1</sub>'''<var>A</var>''')('''<var>I</var>''' − <var>ω</var><sub>2</sub>'''<var>A</var>''')⋯('''<var>I</var>''' − <var>ω<sub>i</sub>'''A'''</var>)}} with suitable constants {{math|<var>ω<sub>j</sub></var>}} instead of {{math|<var>'''r'''<sub>i</sub></var> {{=}} <var>P<sub>i</sub></var>('''<var>A</var>''')<var>'''r'''<sub>0</sub></var>}} in the hope that {{math|<var>Q<sub>i</sub></var>('''<var>A</var>''')}} will enable faster and smoother convergence in {{math|<var>'''r̃'''<sub>i</sub></var>}} than {{math|<var>'''r'''<sub>i</sub></var>}}.
 
It follows from the recurrence relations for {{math|<var>P<sub>i</sub></var>('''<var>A</var>''')}} and {{math|<var>T<sub>i</sub></var>('''<var>A</var>''')}} and the definition of {{math|<var>Q<sub>i</sub></var>('''<var>A</var>''')}} that
Line 98 ⟶ 100:
which entails the necessity of a recurrence relation for {{math|<var>Q<sub>i</sub></var>('''<var>A</var>''')<var>T<sub>i</sub></var>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>}}. This can also be derived from the BiCG relations:
 
:{{math|<var>Q<sub>i</sub></var>('''<var>A</var>''')<var>T<sub>i</sub></var>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub> {{=}} <var>Q<sub>i</sub></var>('''<var>A</var>''')<var>P<sub>i</sub></var>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub> + <var>β</var><sub><var>i</var>+1</sub>('''<var>I</var>''' − <var>ω<sub>i</sub>'''A'''</var>)<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>''')<var>PT</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>}}.
 
Similarly to defining {{math|<var>'''r̃'''<sub>i</sub></var>}}, BiCGSTAB defines
Line 132 ⟶ 134:
:{{math|<var>ρ̃</var><sub><var>i</var></sub> {{=}} (<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>) {{=}} ('''<var>r̂</var>'''<sub>0</sub>, <var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>''')<var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>) {{=}} ('''<var>r̂</var>'''<sub>0</sub>, '''<var>r</var>'''<sub><var>i</var>−1</sub>)}}.
 
Due to biorthogonality, {{math|'''<var>r</var>'''<sub><var>i</var>−1</sub> {{=}} <var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>}} is orthogonal to {{math|<var>U</var><sub><var>i</var>−2</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>}} where {{math|<var>U</var><sub><var>i</var>−2</sub>('''<var>A</var>'''<sup>T</sup>)}} is any polynomial of degree {{math|<var>i</var> − 2}} in {{math|'''<var>A</var>'''<sup>T</sup>}}. Hence, only the highest-order terms of {{math|<var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} and {{math|<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} matter in the [[dot productsproduct]]s {{math|(<var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>)}} and {{math| (<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>)}}. The leading coefficients of {{math|<var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} and {{math|<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} are {{math|(−1)<sup><var>i</var>−1</sup><var>α</var><sub>1</sub><var>α</var><sub>2</sub>⋯<var>α</var><sub><var>i</var>−1</sub>}} and {{math|(−1)<sup><var>i</var>−1</sup><var>ω</var><sub>1</sub><var>ω</var><sub>2</sub>⋯<var>ω</var><sub><var>i</var>−1</sub>}}, respectively. It follows that
 
:{{math|<var>ρ<sub>i</sub></var> {{=}} (<var>α</var><sub>1</sub>/<var>ω</var><sub>1</sub>)(<var>α</var><sub>2</sub>/<var>ω</var><sub>2</sub>)⋯(<var>α</var><sub><var>i</var>−1</sub>/<var>ω</var><sub><var>i</var>−1</sub>)<var>ρ̃</var><sub><var>i</var></sub>}},
Line 144 ⟶ 146:
:{{math|<var>α<sub>i</sub></var> {{=}} <var>ρ<sub>i</sub></var>/('''<var>p̂</var>'''<sub><var>i</var></sub>, <var>'''Ap'''<sub>i</sub></var>) {{=}} (<var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>)/(<var>T</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>'''A'''T</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>)}}.
 
Similarly to the case above, only the highest-order terms of {{math|<var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} and {{math|<var>T</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} matter in the [[dot productsproduct]]s thanks to biorthogonality and biconjugacy. It happens that {{math|<var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} and {{math|<var>T</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} have the same leading coefficient. Thus, they can be replaced simultaneously with {{math|<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} in the formula, which leads to
 
:{{math|<var>α<sub>i</sub></var> {{=}} (<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>)/(<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>'''A'''T</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>) {{=}} <var>ρ̃</var><sub><var>i</var></sub>/('''<var>r̂</var>'''<sub>0</sub>, '''<var>A</var>'''<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>''')<var>T</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>) {{=}} <var>ρ̃</var><sub><var>i</var></sub>/('''<var>r̂</var>'''<sub>0</sub>, '''<var>Ap̃</var>'''<sub><var>i</var></sub>)}}.
Line 167 ⟶ 169:
 
==References==
{{reflist}}
* {{Cite journal | doi = 10.1137/0913035 | title = Bi-CGSTAB: A Fast and Smoothly Converging Variant of Bi-CG for the Solution of Nonsymmetric Linear Systems | year = 1992 | last1 = Van der Vorst | first1 = H. A. | journal = [[SIAM Journal on Scientific Computing|SIAM J. Sci. Stat. Comput.]] | volume = 13 | issue = 2 | pages = 631–644 | hdl = 10338.dmlcz/104566 | hdl-access = free }}
* {{cite book
Line 180 ⟶ 183:
|publisher = SIAM
|isbn = 978-0-89871-534-7
|doi = 10.2277/0898715342
}}
* {{note|bicgstab2}}{{Cite journal| doi = 10.1137/0914062| title = Variants of BICGSTAB for Matrices with Complex Spectrum| year = 1993| last1 = Gutknecht | first1 = M. H.| journal = [[SIAM Journal on Scientific Computing|SIAM J. Sci. Comput.]]| volume = 14| issue = 5| pages = 1020–1033 }}
* {{note|bicgstab(l)}}{{cite journal
| last1 = Sleijpen
| first1 = G. L. G.
| last2 = Fokkema
| first2 = D. R.
| date = November 1993
| title = BiCGstab(''l'') for linear equations involving unsymmetric matrices with complex spectrum
| journal = [[Electronic Transactions on Numerical Analysis]]
Line 194 ⟶ 198:
| ___location = Kent, OH
| issn = 1068-9613
| url = http://www.emis.ams.orgde/journals/ETNA/vol.1.1993/pp11-32.dir/pp11-32.pdf
}}