The workflow of the blocked-Davidson iterative matrix diagonalization scheme implemented in VASP is as follows:[1][2]
- Take a subset (block) of
orbitals out of the total set of NBANDS orbitals:
.
- Extend the subspace spanned by
by adding the preconditioned residual vectors of
:
data:image/s3,"s3://crabby-images/eaf61/eaf6111da2b24962dde6d5080b409fb480ee7e90" alt="{\displaystyle
\left \{ \psi^1_k \, / \, g^1_k = \left (1- \sum_{n=1}^{N_{\rm bands}} | \psi_n \rangle \langle\psi_n | {\bf S} \right) {\bf K} \left ({\bf H} - \epsilon_{\rm app} {\bf S} \right ) \psi^1_k \, | \, k=1,..,n_1 \right \}.
}"
- Rayleigh-Ritz optimization ("subspace rotation") within the
-dimensional space spanned by
, to determine the
lowest eigenvectors:
data:image/s3,"s3://crabby-images/a53b9/a53b9c8f797683c0333262066f9c4a5b5522e432" alt="{\displaystyle {\rm diag}\{\psi^1/g^1\} \Rightarrow \{ \psi^2_k| k=1,..,n_1\}}"
- Extend the subspace with the residuals of
:
data:image/s3,"s3://crabby-images/f46eb/f46eb2ede0759690c8261762961881ac30e00ed3" alt="{\displaystyle
\left \{ \psi^2_k \,/ \, g^1_k \, / \, g^2_k = \left (1- \sum_{n=1}^{N_{\rm bands}} | \psi_n \rangle \langle\psi_n | {\bf S} \right ) {\bf K} \left ({\bf H} - \epsilon_{\rm app} {\bf S} \right) \psi^2_k \, | \, k=1,..,n_1 \right \}.
}"
- Rayleigh-Ritz optimization ("subspace rotation") within the
-dimensional space spanned by
:
data:image/s3,"s3://crabby-images/be5f9/be5f9a702ee9b4e9c9247cdecdb741569b2f68ea" alt="{\displaystyle {\rm diag}\{\psi^1/g^1/g^2\} \Rightarrow \{ \psi^3_k| k=1,..,n_1\}}"
- If need be the subspace may be extended by repetition of this cycle of adding residual vectors and Rayleigh-Ritz optimization of the resulting subspace:
data:image/s3,"s3://crabby-images/e360a/e360ad4eeb8d766e89ddc5ef099f65bd7494c749" alt="{\displaystyle {\rm diag}\{\psi^1/g^1/g^2/../g^{d-1}\}\Rightarrow \{ \psi^d_k| k=1,..,n_1\}}"
- Per default VASP will not iterate deeper than
, though it may break off even sooner when certain criteria that measure the convergence of the orbitals have been met.
- When the iteration is finished, store the optimized block of orbitals back into the set:
.
- Move on to the next block
.
- When LDIAG=.TRUE. (default), a Rayleigh-Ritz optimization in the complete subspace
is performed after all orbitals have been optimized.
The blocksize
used in the blocked-Davidson algorithm can be set by means of the NSIM tag.
In principle
NSIM, but for technical reasons it needs to be dividable by an integer N:
data:image/s3,"s3://crabby-images/40134/40134f43204b8fac396aa5769b37d622b3aa4c61" alt="{\displaystyle
n_1 = {\rm int}\left(\frac{2*{\rm NSIM} + N - 1}{N}\right) N
}"
where
is the "number of band groups per k-point group":
data:image/s3,"s3://crabby-images/434de/434de6a00b82f0bcc0c20223fee52f7a39d5d0a3" alt="{\displaystyle
N = \frac{{\rm \#\; of\; MPI\; ranks}}{{\rm IMAGES}*{\rm KPAR}*{\rm NCORE}}
}"
(see the section on parallelization basics).
As mentioned before, the optimization of a block of orbitals is stopped when either the maximum iteration depth (NRMM), or a certain convergence threshold has been reached. The latter may be fine-tuned by means of the EBREAK, DEPER, and WEIMIN tags. Note: we do not recommend you to do so! Rather rely on the defaults instead.
The blocked-Davidson algorithm is approximately a factor of 1.5-2 slower than the RMM-DIIS, but more robust.
References