Content deleted Content added
m →BiCG in polynomial form: l Recurrence relation |
m Reverted edit by 186.177.184.151 (talk) to last version by Weberjoh |
||
(41 intermediate revisions by 29 users not shown) | |||
Line 1:
{{Short description|Concept in mathematics}}
In [[numerical linear algebra]], the '''biconjugate gradient stabilized method''', often abbreviated as '''BiCGSTAB''', is an [[iterative method]] developed by [[Henk van der Vorst|H. A. van der Vorst]] for the numerical solution of nonsymmetric [[System of linear equations|linear system]]s. It is a variant of the [[biconjugate gradient method]] (BiCG) and has faster and smoother convergence than the original BiCG as well as other variants such as the [[conjugate gradient squared method]] (CGS). It is a [[Krylov subspace]] method.▼
{{Technical|date=May 2015}}
▲In [[numerical linear algebra]], the '''biconjugate gradient stabilized method''', often abbreviated as '''BiCGSTAB''', is an [[iterative method]] developed by [[Henk van der Vorst|H. A. van der Vorst]] for the numerical solution of nonsymmetric [[System of linear equations|linear system]]s. It is a variant of the [[biconjugate gradient method]] (BiCG) and has faster and smoother convergence than the original BiCG as well as other variants such as the [[conjugate gradient squared method]] (CGS). It is a [[Krylov subspace]] method. Unlike the original BiCG method, it doesn't require multiplication by the transpose of the system matrix.
==Algorithmic steps==
===Unpreconditioned BiCGSTAB===
In the following sections, {{math|1=('''<var>x</var>''','''<var>y</var>''') = '''<var>x</var>'''<sup>T</sup> '''<var>y</var>'''}} denotes the [[dot product]] of vectors. To solve a linear system {{math|'''<var>Ax</var>''' {{=}} '''<var>b</var>'''}}, BiCGSTAB starts with an initial guess {{math|'''<var>x</var>'''<sub>0</sub>}} and proceeds as follows:
# {{math|'''<var>r</var>'''<sub>0</sub> {{=}} '''<var>b</var>''' − '''<var>Ax</var>'''<sub>0</sub>}}
# Choose an arbitrary vector {{math|'''<var>r̂</var>'''<sub>0</sub>}} such that {{math|('''<var>r̂</var>'''<sub>0</sub>, '''<var>r</var>'''<sub>0</sub>) ≠ 0}}, e.g., {{math|'''<var>r̂</var>'''<sub>0</sub> {{=}} '''<var>r</var>'''<sub>0</sub>}}
# {{math|<var>ρ</var><sub>0</sub> {{=}} ('''<var>
# {{math|'''<var>
# For {{math|<var>i</var> {{=}} 1, 2, 3, …}}
## {{math|'''<var>
## {{math|<var>
## {{math|<var>'''
## {{math|<var> ## If {{math|<var>'''
## {{math|<var>α</var> {{=}} <var>ρ<sub>i</sub></var>/('''<var>r̂</var>'''<sub>0</sub>, <var>'''v'''<sub>i</sub></var>)}}▼
## {{math|'''<var>s</var>''' {{=}} '''<var>r</var>'''<sub><var>i<var>−1</sub> − <var>α'''v'''<sub>i</sub></var>}}▼
## {{math|'''<var>t</var>''' {{=}} '''<var>As</var>'''}}
## {{math|<var>ω
## {{math|<var>'''x'''<sub>i</sub></var> {{=}}
##
## If {{math|<var>'''
## {{math|<var>ρ<sub>i</sub></var> {{=}} ('''<var>r̂</var>'''<sub>0</sub>, '''<var>r</var>'''<sub><var>i</var></sub>)}}
▲## {{math|<var>
## {{math|<var>'''p'''<sub>i</sub></var> {{=}} '''<var>r</var>'''<sub><var>i</var></sub> + <var>β</var>('''<var>p</var>'''<sub><var>i</var>−1</sub> − <var>ω</var>'''<var>v</var>''')}}
In some cases, choosing the vector {{math|'''<var>r̂</var>'''<sub>0</sub>}} randomly improves numerical stability.<ref>{{Cite journal |last=Schoutrop |first=Chris |last2=Boonkkamp |first2=Jan ten Thije |last3=Dijk |first3=Jan van |date=July 2022 |title=Reliability Investigation of BiCGStab and IDR Solvers for the Advection-Diffusion-Reaction Equation |url=https://doi.org/10.4208/cicp.OA-2021-0182 |journal=Communications in Computational Physics |language=en |volume=32 |issue=1 |pages=156–188 |doi=10.4208/cicp.oa-2021-0182 |issn=1815-2406}}</ref>
===Preconditioned BiCGSTAB===
Line 27 ⟶ 33:
# {{math|'''<var>r</var>'''<sub>0</sub> {{=}} '''<var>b</var>''' − '''<var>Ax</var>'''<sub>0</sub>}}
# Choose an arbitrary vector {{math|'''<var>r̂</var>'''<sub>0</sub>}} such that {{math|('''<var>r̂</var>'''<sub>0</sub>, '''<var>r</var>'''<sub>0</sub>) ≠ 0}}, e.g., {{math|'''<var>r̂</var>'''<sub>0</sub> {{=}} '''<var>r</var>'''<sub>0</sub>}}
# {{math|<var>ρ</var><sub>0</sub> {{=}} ('''<var>
# {{math|'''<var>
# For {{math|<var>i</var> {{=}} 1, 2, 3, …}}
## {{math|'''<var>
## {{math|<var>
## {{math|<var>
## {{math|<var>'''h'''
## {{math|
## If {{math|<var>
## {{math|'''<var>
▲## {{math|'''<var>z</var>''' {{=}} '''<var>K</var>'''<sup>−1</sup>'''<var>s</var>'''}}
## {{math|'''<var>t</var>''' {{=}} '''<var>Az</var>'''}}
## {{math|<var>ω
## {{math|<var>'''x'''<sub>i</sub></var> {{=}}
## {{math|<var>'''r'''<sub>i</sub></var> {{=}} '''<var>s</var>''' − <var>ω'''t'''</var>}}
## If {{math|<var>'''x'''<sub>i</sub></var>}} is accurate enough then quit
## {{math|<var>
▲## {{math|
## {{math|<var>'''p'''<sub>i</sub></var> {{=}} '''<var>r</var>'''<sub><var>i</var></sub> + <var>β</var>('''<var>p</var>'''<sub><var>i</var>−1</sub> − <var>ω</var>'''<var>v</var>''')}}
This formulation is equivalent to applying unpreconditioned BiCGSTAB to the explicitly preconditioned system
Line 77 ⟶ 85:
:{{math|<var>P<sub>i</sub></var>('''<var>A</var>''') {{=}} <var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''') − <var>α<sub>i</sub>'''A'''T</var><sub><var>i</var>−1</sub>('''<var>A</var>''')}},
:{{math|<var>T<sub>i</sub></var>('''<var>A</var>''') {{=}} <var>P<sub>i</sub></var>('''<var>A</var>''')
===Derivation of BiCGSTAB from BiCG===
Line 84 ⟶ 92:
:{{math|'''<var>r̃</var>'''<sub><var>i</var></sub> {{=}} <var>Q<sub>i</sub></var>('''<var>A</var>''')<var>P<sub>i</sub></var>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>}}
where {{math|<var>Q<sub>i</sub></var>('''<var>A</var>''') {{=}} ('''<var>I</var>''' − <var>ω</var><sub>1</sub>'''<var>A</var>''')('''<var>I</var>''' − <var>ω</var><sub>2</sub>'''<var>A</var>''')⋯('''<var>I</var>''' − <var>ω<sub>i</sub>'''A'''</var>)}} with suitable constants {{math|<var>ω<sub>j</sub></var>}} instead of {{math|<var>'''r'''<sub>i</sub></var> {{=}} <var>P<sub>i</sub></var>('''<var>A</var>''')<var>'''r'''<sub>0</sub></var>}} in the hope that {{math|<var>Q<sub>i</sub></var>('''<var>A</var>''')}} will enable faster and smoother convergence in {{math|<var>'''r̃'''<sub>i</sub></var>}} than {{math|<var>'''r'''<sub>i</sub></var>}}.
It follows from the recurrence relations for {{math|<var>P<sub>i</sub></var>('''<var>A</var>''')}} and {{math|<var>T<sub>i</sub></var>('''<var>A</var>''')}} and the definition of {{math|<var>Q<sub>i</sub></var>('''<var>A</var>''')}} that
Line 92 ⟶ 100:
which entails the necessity of a recurrence relation for {{math|<var>Q<sub>i</sub></var>('''<var>A</var>''')<var>T<sub>i</sub></var>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>}}. This can also be derived from the BiCG relations:
:{{math|<var>Q<sub>i</sub></var>('''<var>A</var>''')<var>T<sub>i</sub></var>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub> {{=}} <var>Q<sub>i</sub></var>('''<var>A</var>''')<var>P<sub>i</sub></var>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub> + <var>β</var><sub><var>i</var>+1</sub>('''<var>I</var>''' − <var>ω<sub>i</sub>'''A'''</var>)<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>''')<var>
Similarly to defining {{math|<var>'''r̃'''<sub>i</sub></var>}}, BiCGSTAB defines
Line 126 ⟶ 134:
:{{math|<var>ρ̃</var><sub><var>i</var></sub> {{=}} (<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>) {{=}} ('''<var>r̂</var>'''<sub>0</sub>, <var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>''')<var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>) {{=}} ('''<var>r̂</var>'''<sub>0</sub>, '''<var>r</var>'''<sub><var>i</var>−1</sub>)}}.
Due to biorthogonality, {{math|'''<var>r</var>'''<sub><var>i</var>−1</sub> {{=}} <var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>}} is orthogonal to {{math|<var>U</var><sub><var>i</var>−2</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>}} where {{math|<var>U</var><sub><var>i</var>−2</sub>('''<var>A</var>'''<sup>T</sup>)}} is any polynomial of degree {{math|<var>i</var> − 2}} in {{math|'''<var>A</var>'''<sup>T</sup>}}. Hence, only the highest-order terms of {{math|<var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} and {{math|<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} matter in the [[dot
:{{math|<var>ρ<sub>i</sub></var> {{=}} (<var>α</var><sub>1</sub>/<var>ω</var><sub>1</sub>)(<var>α</var><sub>2</sub>/<var>ω</var><sub>2</sub>)⋯(<var>α</var><sub><var>i</var>−1</sub>/<var>ω</var><sub><var>i</var>−1</sub>)<var>ρ̃</var><sub><var>i</var></sub>}},
Line 138 ⟶ 146:
:{{math|<var>α<sub>i</sub></var> {{=}} <var>ρ<sub>i</sub></var>/('''<var>p̂</var>'''<sub><var>i</var></sub>, <var>'''Ap'''<sub>i</sub></var>) {{=}} (<var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>)/(<var>T</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>'''A'''T</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>)}}.
Similarly to the case above, only the highest-order terms of {{math|<var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} and {{math|<var>T</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)}} matter in the [[dot
:{{math|<var>α<sub>i</sub></var> {{=}} (<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>P</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>)/(<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>'''<sup>T</sup>)'''<var>r̂</var>'''<sub>0</sub>, <var>'''A'''T</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>) {{=}} <var>ρ̃</var><sub><var>i</var></sub>/('''<var>r̂</var>'''<sub>0</sub>, '''<var>A</var>'''<var>Q</var><sub><var>i</var>−1</sub>('''<var>A</var>''')<var>T</var><sub><var>i</var>−1</sub>('''<var>A</var>''')'''<var>r</var>'''<sub>0</sub>) {{=}} <var>ρ̃</var><sub><var>i</var></sub>/('''<var>r̂</var>'''<sub>0</sub>, '''<var>Ap̃</var>'''<sub><var>i</var></sub>)}}.
Line 151 ⟶ 159:
==Generalization==
BiCGSTAB can be viewed as a combination of BiCG and [[GMRES]] where each BiCG step is followed by a GMRES({{math|1}}) (i.e., GMRES restarted at each step) step to repair the irregular convergence behavior of CGS, as an improvement of which BiCGSTAB was developed. However, due to the use of degree-one minimum residual polynomials, such repair may not be effective if the matrix {{math|'''<var>A</var>'''}} has large complex eigenpairs. In such cases, BiCGSTAB is likely to stagnate, as confirmed by numerical experiments.
One may expect that higher-degree minimum residual polynomials may better handle this situation. This gives rise to algorithms including BiCGSTAB2{{ref|bicgstab2}} and the more general BiCGSTAB({{math|<var>l</var>}}){{ref|bicgstab(l)}}. In BiCGSTAB({{math|<var>l</var>}}), a GMRES({{math|<var>l</var>}}) step follows every {{math|<var>l</var>}} BiCG steps. BiCGSTAB2 is equivalent to BiCGSTAB({{math|<var>l</var>}}) with {{math|<var>l</var> {{=}} 2}}.
Line 161 ⟶ 169:
==References==
{{reflist}}
* {{Cite journal | doi = 10.1137/0913035 | title = Bi-CGSTAB: A Fast and Smoothly Converging Variant of Bi-CG for the Solution of Nonsymmetric Linear Systems | year = 1992 | last1 = Van der Vorst | first1 = H. A. | journal = [[SIAM Journal on Scientific Computing|SIAM J. Sci. Stat. Comput.]] | volume = 13 | issue = 2 | pages = 631–644 | hdl = 10338.dmlcz/104566 | hdl-access = free }}
* {{cite book
|last = Saad
|first = Y.
|title = Iterative Methods for Sparse Linear Systems
|chapter-url = https://archive.org/details/iterativemethods0000saad
|chapter-url-access = registration
|edition = 2nd
|chapter = §7.4.2 BICGSTAB
|pages = [https://archive.org/details/iterativemethods0000saad/page/231 231–234]
|year = 2003
|publisher = SIAM
|isbn = 978-0-89871-534-7
}}
* {{note|bicgstab2}}{{Cite journal| doi = 10.1137/0914062| title = Variants of BICGSTAB for Matrices with Complex Spectrum| year = 1993| last1 = Gutknecht | first1 = M. H.| journal = [[SIAM Journal on Scientific Computing|SIAM J. Sci. Comput.]]| volume = 14| issue = 5| pages = 1020–1033 }}
* {{note|bicgstab(l)}}{{cite journal
| last1 = Sleijpen
| first1 = G. L. G. | last2 = Fokkema
| first2 = D. R. | date = November 1993
| title = BiCGstab(''l'') for linear equations involving unsymmetric matrices with complex spectrum
| journal = [[Electronic Transactions on Numerical Analysis]]
| volume = 1
| pages = 11–32
Line 186 ⟶ 198:
| ___location = Kent, OH
| issn = 1068-9613
| url = http://www.emis.
}}
|