Direct optimization of the orbitals: Difference between revisions
Vaspmaster (talk | contribs) (Created blank page) |
Vaspmaster (talk | contribs) No edit summary |
||
(29 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
With "direct optimization of the orbitals" we denote a category of electronic minimization algorithms that use the gradient of the free energy with respect to the orbitals to move towards the ground state of the system: the orbitals are changed such that the total energy is lowered, using, ''e.g.'' the conjugate gradient approximation, or damped molecular dynamics. | |||
In direct optimization methods, the orthogonality constraints are directly incorporated into the functional that is optimized, through Lagrange multipliers. | |||
So in fact, instead of minimizing the [[Electronic_minimization#KohnShamFreeEnergy | Kohn-Sham free energy functional]] ''F'', one minimizes the following Langrangian: | |||
:<math> | |||
\bar{F} = F - \sum_{mn} \gamma_{mn} \left( \langle \psi_m |S| \psi_n \rangle - \delta_{mn} \right) - | |||
\mu \left( \sum_n f_n - N_{\rm el} \right) | |||
</math> | |||
The gradient of this Langrangian with respect to an orbital <math>\psi_n</math>, is given by: | |||
:<math> | |||
| g_n \rangle = f_n \Big(1-\sum^N_{m=1} \hat{S} \vert \psi_m \rangle \langle \psi_m \vert\Big) \hat{H} \vert \psi_n \rangle + | |||
\sum^N_{m=1} \frac{1}{2} {\bf H}_{nm} (f_n - f_m) \hat{S} \vert \psi_m \rangle | |||
</math> | |||
where <math>\{ f_i | i=1,..,N \}</math> are the partial occupancies, and | |||
:<math> | |||
{\bf H}_{nm}=\langle \psi_m \vert \hat{H} \vert \psi_n \rangle | |||
</math> | |||
is the Hamiltonian expressed within the subspace spanned by the current orbitals <math>\{ \psi_i | i=1,..,N \}</math>.{{cite|kresse:cms:1996}} | |||
The structure of the gradient may be understood as follows: the first part on the right-hand side describes the change of the free energy with respect to changes in the orbitals that are outside (orthogonal) the subspace spanned by the current set of orbitals, | |||
whereas the second part describes the changes of the free energy due to a unitary transformation between the orbitals within this subspace. | |||
To derive a search direction, ''i.e.'', actual change in the orbitals from the gradient these aforementioned parts are treated separately. | |||
The search direction related to the out-of-subspace part of the gradient is: | |||
:<math> | |||
\vert p_n \rangle = f_n K \Big(1-\sum^N_{m=1} \hat{S} \vert \psi_m \rangle \langle \psi_m \vert\Big) \hat{H} \vert \psi_n \rangle | |||
</math> | |||
where <math>K</math> is a [[preconditioning]] function. | |||
The search direction associated with the subspace rotational part of the gradient may be constructed using Loewdin perturbation theory: | |||
:<math> | |||
U_{nm} = \delta_{nm} - \Delta \frac{H_{nm}}{H_{mm}-H_{nn}} | |||
</math> | |||
where <math>\Delta</math> denotes the stepsize.{{cite|gillan:jpc:89}} | |||
Note that taking a step along this search direction amounts to a rotation of the orbitals (''rotation'' on account of <math>U</math> being unitary): | |||
:<math> | |||
\vert \psi_n \rangle = \sum^N_{m=1} U_{nm} \vert \psi_m \rangle | |||
</math> | |||
Per default, however, {{VASP}} constructs a search direction for the subspace rotational part of the gradient in the manner proposed by Freysoldt ''et al.''{{cite|freysoldt:prb:2009}} | |||
Changes in the partial occupancies are computed in accordance with the aforementioned work as well. | |||
When the three contributions to the "search direction" (''out-of-subspace'', ''subspace rotational'', and ''change in the partial occupancies'') have been determined, they are used to update the orbitals and partial occupancies, either by steepest descent, by means of the conjugate-gradient approximation, or using a damped molecular dynamics scheme. | |||
After every change of the orbitals and partial occupancies, the total energy and electronic density are recomputed. | |||
Per default, the electronic density is constructed directly from the orbitals and partial occupancies at each step along the way, without any density mixing. | |||
Optionally, though, density mixing may be used to stabilise these optimisation procedures when charge sloshing occurs. | |||
The direct optimization of the orbitals stops when the change of the total energy drops below {{TAG|EDIFF}}. | |||
Note that, when starting from scratch ({{TAG|ISTART}} = 0), the direct optimization procedures in {{VASP}} always begin with several ({{TAG|NELMDL}}) [[Self-consistency cycle|self-consistency cycles]] where the density is kept fixed at the initial approximation (overlapping atomic charge densities), using the blocked-Davidson algorithm to optimize the orbitals. | |||
This ensures that the orbitals, that are initialised with random numbers, have converged to reasonable starting point for the subsequent direct optimization. | |||
== References == | |||
<references/> | |||
---- | |||
[[Category:Electronic minimization]][[Category:Theory]] |
Latest revision as of 12:02, 15 July 2024
With "direct optimization of the orbitals" we denote a category of electronic minimization algorithms that use the gradient of the free energy with respect to the orbitals to move towards the ground state of the system: the orbitals are changed such that the total energy is lowered, using, e.g. the conjugate gradient approximation, or damped molecular dynamics.
In direct optimization methods, the orthogonality constraints are directly incorporated into the functional that is optimized, through Lagrange multipliers. So in fact, instead of minimizing the Kohn-Sham free energy functional F, one minimizes the following Langrangian:
The gradient of this Langrangian with respect to an orbital , is given by:
where are the partial occupancies, and
is the Hamiltonian expressed within the subspace spanned by the current orbitals .[1]
The structure of the gradient may be understood as follows: the first part on the right-hand side describes the change of the free energy with respect to changes in the orbitals that are outside (orthogonal) the subspace spanned by the current set of orbitals, whereas the second part describes the changes of the free energy due to a unitary transformation between the orbitals within this subspace.
To derive a search direction, i.e., actual change in the orbitals from the gradient these aforementioned parts are treated separately. The search direction related to the out-of-subspace part of the gradient is:
where is a preconditioning function.
The search direction associated with the subspace rotational part of the gradient may be constructed using Loewdin perturbation theory:
where denotes the stepsize.[2] Note that taking a step along this search direction amounts to a rotation of the orbitals (rotation on account of being unitary):
Per default, however, VASP constructs a search direction for the subspace rotational part of the gradient in the manner proposed by Freysoldt et al.[3]
Changes in the partial occupancies are computed in accordance with the aforementioned work as well.
When the three contributions to the "search direction" (out-of-subspace, subspace rotational, and change in the partial occupancies) have been determined, they are used to update the orbitals and partial occupancies, either by steepest descent, by means of the conjugate-gradient approximation, or using a damped molecular dynamics scheme.
After every change of the orbitals and partial occupancies, the total energy and electronic density are recomputed. Per default, the electronic density is constructed directly from the orbitals and partial occupancies at each step along the way, without any density mixing. Optionally, though, density mixing may be used to stabilise these optimisation procedures when charge sloshing occurs.
The direct optimization of the orbitals stops when the change of the total energy drops below EDIFF.
Note that, when starting from scratch (ISTART = 0), the direct optimization procedures in VASP always begin with several (NELMDL) self-consistency cycles where the density is kept fixed at the initial approximation (overlapping atomic charge densities), using the blocked-Davidson algorithm to optimize the orbitals. This ensures that the orbitals, that are initialised with random numbers, have converged to reasonable starting point for the subsequent direct optimization.