Talk:Lanczos algorithm

Old comments that did not have a section of their own

Latest comment: 13 years ago5 comments3 people in discussion

[[1]] - an unfocused variety of Lanczos algorithm —Preceding unsigned comment added by 134.219.166.104 (talk • contribs) 21:23, 1 September 2005

This doesn't have much but it does have a reference to a book mathworld on Lanczos Algorithm]—Preceding unsigned comment added by RJFJR (talk • contribs) 23:36, 25 September 2005

I don't believe this is the Lanczos algorithm at all. It is the power method. —Preceding unsigned comment added by 130.126.55.123 (talk • contribs) 01:04, 5 August 2006

I don't know if the algorithm is correct, but it's certainly different than the power method, and presented pretty clearly. I think it's gotten me on the right track at least... Thanks. --Jjdonald (talk) 22:22, 17 December 2007 (UTC)Reply

It is not easy to say it's wrong or correct, since quite some information is missing ~~in order to apply it: (a) how to choose v[1], (b) how to choose m, (c) how to recognize the eigenvalues of A among those of T_mm~~. Unfortunately, this vagueness is by no means eliminated by the Numerical stability section. — MFH:Talk 21:57, 12 September 2008 (UTC)Reply

It is certainly not completely correct: there's at least something faulty with the indices. — MFH:Talk 19:59, 8 December 2011 (UTC)Reply

It should state that "it applies to Hermitian matrices" at the start of the article and not somewhere in the middle. limweizhong (talk) 09:54, 11 November 2008 (UTC)Reply

There is a paper about Non-Symmetric Lanczos' algorithm (compared to Arnoldi) by Jane Cullum. — MFH:Talk 20:07, 8 December 2011 (UTC)Reply

In Latent Semantic Indexing, for...

Latest comment: 15 years ago1 comment1 person in discussion

I really think that this sentense has nothing to do in the first paragraph! Please someone who understand anything about it should create a separate section and explain what this is about! Alain Michaud (talk) 16:52, 19 February 2010 (UTC)Reply

Block Lanczos algorithm

Latest comment: 15 years ago1 comment1 person in discussion

I suppose that Peter Montgomery`s 1995 paper was very good, but I do not see the need to inform everyone about its existence. This topic is much too advanced to be discussed at the top of the page. Please move this (second paragraph) towards the end of the page.

Alain Michaud (talk) 16:50, 19 February 2010 (UTC)Reply

Extracting information from tridiagonal matrix

Latest comment: 8 years ago5 comments4 people in discussion

So Lanczos gives you a tridiagonal matrix. I think a link would be helpful which explains how to extract low eigenvalues/eigenvectors from this matrix. —Preceding unsigned comment added by 209.6.144.249 (talk) 06:30, 2 March 2008 (UTC)Reply

Agree - or largest eigenvalues: anyway, the article starts by saying that it's for calculating eigenvalues, but then stops with the tridiag. matrix.

B.t.w., the algorithm calculates up to v[m+1], I think this could be avoided. (also, "unrolling" the 1st part of the m=1 case as initialization should allow to avoid using v[0].) — MFH:Talk 03:09, 11 September 2008 (UTC)Reply

PS: also, it should be said what is 'm'...

Seconded — the article spends a lot of ink on the Lanczos iteration (but could do a better job at explaining it) for producing a tridiagonal matrix, says various other algorithms can be used for calculating eigenvalues and eigenvectors of that tridiagonal matrix, but is almost silent on how the two are related. As far as I can tell, early steps of the iteration tends to put most of the weight on the extreme eigenvalues (largest and smallest both, regardless of their absolute values), meaning those are fairly accurately reproduced in the tridiagonal matrix, and the algorithm proceeds towards less extreme eigenvalues the longer it is run; it's 'tends' because the initial weight distribution depends on the initial vector, which is chosen at random. What is not clear from mere thought experiments is how concentrated the distribution is … 130.243.68.202 (talk) 14:22, 2 May 2017 (UTC)Reply

Rewrite of algorithm

Thinking some more about this, I find it desirable to modify the way the algorithm is stated — partly to address the case $\beta _{j+1}=0$ , and partly to do something about the "changes to" remark at the end, which is confusing in that no variable assigned in the algorithm is ever changed. It turns out the remark is boilerplate text in the {{algorithm-end}} template, and there is no option to omit it. Since Wikipedia:WikiProject_Computer_science/Manual_of_style#Algorithms_and_data_structures does not recommend using that template, and instead recommends using (numbered) lists with explicit input and output headings, a rewrite from scratch seems in order.

Input a Hermitian matrix

A

of size

n\times n

, and optionally a number of iterations

m

(as default let

m=n

)

Strictly speaking, the algorithm does not need access to the the explicit matrix, but only a function $v\mapsto Av$ that computes the product of the matrix by an arbitrary vector. This function is called at most $m$ times.

Output an

n\times m

matrix

V

with orthonormal columns and a tridiagonal real symmetric matrix

T=V^{*}AV

of size

m\times m

. If

m=n

then

V

is unitary and

A=VTV^{*}

.

Warning The Lanczos iteration is prone to numerical instability. When executed in non-exact arithmetic, additional measures (as outlined further down) should be taken to ensure validity of the results.

Let $v_{1}\in \mathbb {C} ^{n}$ be an arbitrary vector with Euclidean norm $1$ .
Abbreviated initial iteration step:
1. Let $w_{1}'=Av_{1}$ .
2. Let $\alpha _{1}=w_{1}'^{*}v_{j}$ .
3. Let $w_{1}=w_{1}'-\alpha _{1}v_{1}$ .
For $j=2,\dots ,m$ do:
1. Let $\beta _{j}=\left\|w_{j-1}\right\|$ (also Euclidean norm).
2. If $\beta _{j}\neq 0$ then let $v_{j}=w_{j-1}/\beta _{j}$ ,
  else pick as $v_{j}$ an arbitrary vector with Euclidean norm $1$ that is orthogonal to all of $v_{1},\dots ,v_{j-1}$ .
3. Let $w_{j}'=Av_{j}$ .
4. Let $\alpha _{j}=w_{j}'^{*}v_{j}$ .
5. Let $w_{j}=w_{j}'-\alpha _{j}v_{j}-\beta _{j}v_{j-1}$ .
Let $V$ be the matrix with columns $v_{1},\dots ,v_{m}$ . Let $T={\begin{pmatrix}\alpha _{1}&\beta _{2}&&&&0\\\beta _{2}&\alpha _{2}&\beta _{3}&&&\\&\beta _{3}&\alpha _{3}&\ddots &&\\&&\ddots &\ddots &\beta _{m-1}&\\&&&\beta _{m-1}&\alpha _{m-1}&\beta _{m}\\0&&&&\beta _{m}&\alpha _{m}\\\end{pmatrix}}$ .

Note

Av_{j}=w_{j}'=\beta _{j+1}v_{j+1}+\alpha _{j}v_{j}+\beta _{j}v_{j-1}

for

1<j<m

.

There are in principle four ways to write the iteration procedure. Paige and other works show that the above order of operations is the most numerically stable.^[1]^[2] In practice the initial vector $v_{1}$ may be taken as another argument of the procedure, with $\beta _{j}=0$ and indicators of numerical errors being included as additional loop termination conditions.

Not counting the matrix–vector multiplication, each iteration does $O(n)$ arithmetical operations. If $d$ is the average number of nonzero elements in a row of $A$ , then the matrix–vector multiplication can be done in $O(dn)$ arithmetical operations. Total complexity is thus $O(dmn)$ , or $O(dn^{2})$ if $m=n$ ; the Lanczos algorithm can be really fast for sparse matrices. Schemes for improving numerical stability are typically judged against this high performance.

The vectors $v_{j}$ are called Lanczos vectors. The vector $w_{j}'$ is not used after $w_{j}$ is computed, and the vector $w_{j}$ is not used after $v_{j+1}$ is computed. Hence one may use the same storage for all three. Likewise, if only the tridiagonal matrix $T$ is sought, then the raw iteration does not need $v_{j-1}$ after having computed $w_{j}$ , although some schemes for improving the numerical stability would need it later on. Sometimes the subsequent Lanczos vectors are recomputed from $v_{1}$ when needed. 130.243.68.122 (talk) 15:20, 26 May 2017 (UTC)Reply

Application to the eigenproblem

The Lanczos algorithm is most often brought up in the context of finding the eigenvalues and eigenvectors of a matrix, but whereas an ordinary diagonalization of a matrix would make eigenvectors and eigenvalues apparent from inspection, the same is not true for the tridiagonalization performed by the Lanczos algorithm; nontrivial additional steps are needed to compute even a single eigenvalue or eigenvector. None the less, applying the Lanczos algorithm is often a significant step forward in computing the eigendecomposition. First observe that when $T$ is $n\times n$ , it is similar to $A$ : if $\lambda$ is an eigenvalue of $T$ then it is also an eigenvalue of $A$ , and if $Tx=\lambda x$ ( $x$ is an eigenvector of $T$ ) then $y=Vx$ is the corresponding eigenvetor of $A$ (since $Ay=AVx=VTV^{*}Vx=VTIx=VTx=V(\lambda x)=\lambda Vx=\lambda y$ ). Thus the Lanczos algorithm transforms the eigendecomposition problem for $A$ into the eigendecomposition problem for $T$ .

For tridiagonal matrices, there exist a number of specialised algorithms, often with better computational complexity than general-purpose algorithms. For example if $T$ is an $m\times m$ tridiagonal symmetric matrix then:
- The continuant recursion allows computing the characteristic polynomial in $O(m^{2})$ operations, and evaluating it at a point in $O(m)$ operations.
- The divide-and-conquer eigenvalue algorithm can be used to compute the entire eigendecomposition of $T$ in $O(m^{2})$ operations.
- The Fast Multipole Method^[3] can compute all eigenvalues in just $O(m\log m)$ operations.
Some general eigendecomposition algorithms, notably the QR algorithm, are known to converge faster for tridiagonal matrices than for general matrices. Asymptotic complexity is $O(m^{2})$ just as for the divide-and-conquer algorithm (though the constant factor may be different); since the eigenvectors together have $m^{2}$ elements, this is asymptotically optimal.
Even algorithms whose convergence rates are unaffected by unitary transformations, such as the power method and inverse iteration, may enjoy low-level performance benefits from being applied to the tridiagonal matrix $T$ rather than the original matrix $A$ . Since $T$ is very sparse with all nonzero elements in highly predictable positions, it permits compact storage with excellent performance visavi caching. Likewise, $T$ is a real matrix with all eigenvectors and eigenvalues real, whereas $A$ in general may have complex elements and eigenvectors, so real arithmetic is sufficient for finding the eigenvectors and eigenvalues of $T$ .
If $n$ is very large, then reducing $m$ so that $T$ is of a manageable size will still allow finding the more extreme eigenvalues and eigenvectors of $A$ ; in the $m\ll n$ region, the Lanczos algorithm can be viewed as a lossy compression scheme for Hermitian matrices, that emphasises preserving the extreme eigenvalues.

The combination of good performance for sparse matrices and the ability to compute several (without computing all) eigenvalues are the main reasons for choosing to use the Lanczos algorithm.

Application to tridiagonalization

Though the eigenproblem is often the motivation for applying the Lanczos algorithm, the operation that the algorithm primarily performs is rather tridiagonalization of a matrix, but for that the numerically stable Householder transformations have been favoured ever since the 1950's, and during the 1960's the Lanczos algorithm was disregarded. Interest in it was rejuvenated by the Kaniel–Paige theory and the development of methods to prevent numerical instability, but the Lanczos algorithm remains the alternative algorithm that one tries only if Householder is not satisfactory.^[4]

Aspects in which the two algorithms differ include:

Lanczos takes advantage of $A$ being a sparse matrix, whereas Householder does not, and will generate fill-in.
Lanczos works throughout with the original matrix $A$ (and has no problem with it being known only implicitly), whereas raw Householder rather wants to modify the matrix during the computation (although that can be avoided).
Each iteration of the Lanczos algorithm produces another column of the final transformation matrix $V$ , whereas an iteration of Householder rather produces another factor in a unitary factorisation $Q_{1}Q_{2}\dots Q_{n}$ of $V$ . Each factor is however determined by a single vector, so the storage requirements are the same for both algorithms, and $V=Q_{1}Q_{2}\dots Q_{n}$ can be computed in $O(n^{3})$ time.
Householder is numerically stable, whereas raw Lanczos is not.

130.243.68.122 (talk) 14:46, 26 May 2017 (UTC)Reply

Define variables

Latest comment: 14 years ago1 comment1 person in discussion

It would be nice if variables are defined before (or just after) being used. For example, at the begining, $U$ and $\sigma _{i}$ are not defined and its confusing for non-expert public.

Felipebm (talk) 13:34, 17 May 2011 (UTC)Reply

problematic matrix decomposition

Latest comment: 8 years ago2 comments2 people in discussion

In the section "Power method for finding eigenvalues", the matrix A is represented as $A=U{\text{diag}}(\sigma _{i})U'$ , which is true only for normal matrices. For the general case, SVD decomposition should be used, i.e. $A=U{\text{diag}}(\sigma _{i})V'$ where U and V are some orthogonal matrices. — Preceding unsigned comment added by 89.139.52.157 (talk) 12:14, 24 April 2016 (UTC)Reply

It's not stated explicitly at that point, but presumably

A

is already taken to be Hermitian (as it needs to be for the Lanczos algorithm to work), which means it has an eigendecomposition of the form stated. Instead using the SVD decomposition in this argument won't work, because the entire point is that

U'U=I

so that the product telescopes! Possibly it would be clearer to just use

A=U\operatorname {diag} (\sigma _{i})U^{-1}

, i.e., hold off on requiring orthogonality — the reason being that the paragraph in question is about the plain power method, which applies in a greater generality. 130.243.68.202 (talk) 13:01, 2 May 2017 (UTC)Reply

^ Cullum; Willoughby. Lanczos Algorithms for Large Symmetric Eigenvalue Computations. Vol. 1. ISBN 0-8176-3058-9.
^ Yousef Saad. Numerical Methods for Large Eigenvalue Problems. ISBN 0-470-21820-7.
^ Coakley, Ed S.; Rokhlin, Vladimir (2013). "A fast divide-and-conquer algorithm for computing the spectra of real symmetric tridiagonal matrices". Applied and Computational Harmonic Analysis. 34: 379–414. doi:10.1016/j.acha.2012.06.003.
^ Golub, Gene H.; Van Loan, Charles F. (1996). Matrix computations (3. ed. ed.). Baltimore: Johns Hopkins Univ. Press. ISBN 0-8018-5413-X. {{cite book}}: |edition= has extra text (help)

[CW1985-1] Cullum; Willoughby. Lanczos Algorithms for Large Symmetric Eigenvalue Computations. Vol. 1. ISBN 0-8176-3058-9.

[Saad1992-2] Yousef Saad. Numerical Methods for Large Eigenvalue Problems. ISBN 0-470-21820-7.

[3] Coakley, Ed S.; Rokhlin, Vladimir (2013). "A fast divide-and-conquer algorithm for computing the spectra of real symmetric tridiagonal matrices". Applied and Computational Harmonic Analysis. 34: 379–414. doi:10.1016/j.acha.2012.06.003.

[4] Golub, Gene H.; Van Loan, Charles F. (1996). Matrix computations (3. ed. ed.). Baltimore: Johns Hopkins Univ. Press. ISBN 0-8018-5413-X. {{cite book}}: |edition= has extra text (help)

[1]

[2]

[3]

[4]