Category:Bethe-Salpeter equations: Difference between revisions

From VASP Wiki
No edit summary
No edit summary
(43 intermediate revisions by 2 users not shown)
Line 1: Line 1:
The formalism of the Bethe-Salpeter equation (BSE) allows for calculating the polarizability with the electron-hole interaction and constitutes the state of the art for calculating absorption spectra in solids.
== Theory ==
== Theory ==
=== Bethe-Salpeter equation ===
In the BSE, the excitation energies correspond to the eigenvalues <math>\omega_\lambda</math> of the following linear problem{{cite|sander:prb:15}}
::<math>
\left(\begin{array}{cc}
\mathbf{A} & \mathbf{B} \\
\mathbf{B}^* & \mathbf{A}^*
\end{array}\right)\left(\begin{array}{l}
\mathbf{X}_\lambda \\
\mathbf{Y}_\lambda
\end{array}\right)=\omega_\lambda\left(\begin{array}{cc}
\mathbf{1} & \mathbf{0} \\
\mathbf{0} & -\mathbf{1}
\end{array}\right)\left(\begin{array}{l}
\mathbf{X}_\lambda \\
\mathbf{Y}_\lambda
\end{array}\right)~.
</math>
The matrices <math>A</math> and <math>A^*</math> describe the resonant and anti-resonant transitions between the occupied <math>v,v'</math> and unoccupied <math>c,c'</math> states
::<math>
A_{vc}^{v'c'} = (\varepsilon_v-\varepsilon_c)\delta_{vv'}\delta_{cc'} + \langle cv'|V|vc'\rangle - \langle cv'|W|c'v\rangle.
</math>
The energies and orbitals of these states are usually obtained in a <math>G_0W_0</math> calculation, but DFT and Hybrid functional calculations can be used as well.
The electron-electron interaction and electron-hole interaction are described via the bare Coulomb <math>V</math> and the screened potential <math>W</math>.
The coupling between resonant and anti-resonant terms is described via terms <math>B</math> and <math>B^*</math>
::<math>
B_{vc}^{v'c'} = \langle vv'|V|cc'\rangle - \langle vv'|W|c'c\rangle.
</math>
Due to the presence of this coupling, the Bethe-Salpeter Hamiltonian is non-Hermitian.
=== Tamm-Dancoff approximation ===
A common approximation to the BSE is the Tamm-Dancoff approximation (TDA), which neglects the coupling between resonant and anti-resonant terms, i.e., <math>B</math> and <math>B^*</math>.
Hence, the TDA reduces the BSE to a Hermitian problem
::<math>
AX_\lambda=\omega_\lambda X_\lambda~.
</math>
In reciprocal space, the matrix <math>A</math> is written as 
::<math>
A_{vc\mathbf{k}}^{v'c'\mathbf{k}'} = (\varepsilon_v-\varepsilon_c)\delta_{vv'}\delta_{cc'}\delta_{\mathbf{kk}'}+
\frac{2}{\Omega}\sum_{\mathbf{G}\neq0}\bar{V}_\mathbf{G}(\mathbf{q})\langle c\mathbf{k}|e^{i\mathbf{Gr}}|v\mathbf{k}\rangle\langle v'\mathbf{k}'|e^{-i\mathbf{Gr}}|c'\mathbf{k}'\rangle
  -\frac{2}{\Omega}\sum_{\mathbf{G,G}'}W_{\mathbf{G,G}'}(\mathbf{q},\omega)\delta_{\mathbf{q,k-k}'}
\langle c\mathbf{k}|e^{i(\mathbf{q+G})}|c'\mathbf{k}'\rangle
\langle v'\mathbf{k}'|e^{-i(\mathbf{q+G})}|v\mathbf{k}\rangle,
</math>
where <math>\Omega</math> is the cell volume, <math>\bar{V}</math> is the bare Coulomb potential without the long-range part
::<math>
\bar{V}_{\mathbf{G}}(\mathbf{q})=\begin{cases}
    0 & \text { if } G=0 \\
    V_{\mathbf{G}}(\mathbf{q})=\frac{4 \pi}{|\mathbf{q}+\mathbf{G}|^2} & \text { else }
\end{cases}~,
</math>
and the screened Coulomb potential
<math>
W_{\mathbf{G}, \mathbf{G}^{\prime}}(\mathbf{q}, \omega)=\frac{4 \pi \epsilon_{\mathbf{G}, \mathbf{G}^{\prime}}^{-1}(\mathbf{q}, \omega)}{|\mathbf{q}+\mathbf{G}|\left|\mathbf{q}+\mathbf{G}^{\prime}\right|}.
</math>
Here, the dielectric function <math>\epsilon_\mathbf{G,G'}(\mathbf{q})</math> describes the screening in <math>W</math> within the random-phase approximation (RPA)
::<math>
\epsilon_{\mathbf{G}, \mathbf{G}^{\prime}}^{-1}(\mathbf{q}, \omega)=\delta_{\mathbf{G}, \mathbf{G}^{\prime}}+\frac{4 \pi}{|\mathbf{q}+\mathbf{G}|^2} \chi_{\mathbf{G}, \mathbf{G}^{\prime}}^{\mathrm{RPA}}(\mathbf{q}, \omega).
</math>
Although the dielectric function is frequency-dependent, the static approximation <math>W_{\mathbf{G}, \mathbf{G}^{\prime}}(\mathbf{q}, \omega=0)</math> is considered a standard for practical BSE calculations.
== Scaling ==
The scaling of the BSE equation strongly limits its application for large systems. The main limiting factor is the diagonalization of the BSE Hamiltonian. The rank of the Hamiltonian is
:<math>N_{\rm rank} = N_k\times N_c\times N_v</math>,
where <math>N_k</math> is the number of k-points in the Brillouin zone and <math>N_c</math> and <math>N_v</math> are the number of conduction and valence bands, respectively.  The diagonalization of the matrix scales cubically with the matrix rank, i.e.,  <math>N_{\rm rank}^3</math>.
Despite the fact that this matrix diagonalization is usually the bottleneck for bigger systems, the construction of the BSE Hamiltonian also scales unfavorably and can play a dominant role in big systems, i.e.,
:<math>N_k\times N_q\times (N_v\times N_v\times N_G\times N_c\times N_c)</math>,
where <math>N_q</math> is the number of q-points and <math>N_G</math> number of G-vectors.
== Exact diagonalization ==
The diagonalization of the BSE Hamiltonian can be perform using various eigensolvers provided in ScaLAPACK, ELPA, and cuSolver libraries. The advantage of this approach is that the eigenvectors can be directly obtained and used for the analysis of the excitons.
Using the eigenvalues <math>\omega_\lambda</math> and eigenvectors <math>X_\lambda</math> of the BSE Hamiltonian, the macroscopic dielectric which accounts for the excitonic effects can be found
::<math>
\epsilon_M(\mathbf{q},\omega)=
1+2\lim_{\mathbf{q}\rightarrow 0}v(q)\sum_{\lambda}
\left|\sum_{c,v,k}\langle c\mathbf{k}|e^{i\mathbf{qr}}|v\mathbf{k}\rangle X_\lambda^{cv\mathbf{k}}\right|^2
\times
\left(\frac{1}{\omega_\lambda - \omega - i\delta}\right)~.
</math>
The following features are currently supported:
* [[BSE calculations#Bethe-Salpeter equation calculation|Calculating the dielectric function and eigenvectors]]
* [[BSE calculations#Calculations beyond Tamm-Dancoff approximation|Calculations beyond Tamm-Dancoff approximation]]
* [[BSE calculations#Calculations at finite wavevectors|Calculations of <math>\varepsilon(\mathbf{q},\omega)</math> for <math>\mathbf{q}\neq0</math>]]
* [[Plot BSE fatbands|Fatband plot]]
== Time evolution ==
Alternatively, it is possible to use the time-evolution algorithm which applies a short Dirac delta pulse of electric field and then follows the evolution of the dipole moments.
The dielectric function is found via a Fourier transform {{cite|sander:jcp:2017}}
::<math>
\epsilon_M(\omega)=1-\frac{4\pi}{\Omega}\int_0^{\infty} \mathrm{d} t
\sum_{c,v,\mathbf{k}}\left(\langle\mu_{cv\mathbf{k}}| \xi_{cv\mathbf{k}}(t)\rangle+c . c .\right) e^{-i(\omega-i \delta) t}
</math>,
where <math>\mu</math> and <math>\xi(t)</math> are the dipole moments.
The solution found this way is strictly equivalent to the same solution as the exact diagonalization and can be used for obtaining the absorption spectrum, but does not yield the eigenvectors, which can be limiting for the analysis of the excitons. The advantage of this approach is the quadratic scaling with the size of the BSE Hamiltonian <math>N_{rank}^2</math>.
The time-evolution algorithm can be selected by setting {{TAG|IBSE}} = 1 in a BSE calculation. The required number of steps in the time-evolution calculation depends on the broadening {{TAG|CSHIFT}} and the maximum energy {{TAG|OMEGAMAX}}. The precision can be selected via tag {{TAG|BSEPREC}}.
{{NB|mind|The required number of steps does not depend on the size of the Hamiltonian|}}
<!--{{NB|warning|The eigenvectors of the BSE Hamiltonian are not available in the time-evolution algorithm and the optical transitions and not written into {{FILE|vasprun.xml}}|}}-->
The following features are currently supported:
* Calculating the dielectric function
* Calculations beyond the Tamm-Dancoff approximation
==Lanczos algorithm==
The expression for the dielectric function can be re-written as a continued fraction
::<math>
\epsilon_{\alpha\beta}(\omega) = \delta_{\alpha\beta} - \frac{4\pi}{\Omega}\cfrac{|u_0|^2}{(\omega - a_1 + \mathrm i\eta) - \cfrac{b_1^2}{(\omega -a_2 + \mathrm i\eta)
    - \cfrac{b_2^2}{...}}},
</math>
where <math>|u_0\rangle</math> is an initial guess vector computed from the dipole moments, <math>|u_0\rangle = \sum_{cv\mathbf{k}} \langle c\mathbf{k}|r_\alpha|v\mathbf{k}\rangle \langle v\mathbf{k}|r_\beta|c\mathbf{k}\rangle</math>. The <math>a</math> and <math>b</math> coefficients are evaluated iteratively, with the iterative algorithm stopping once the difference between <math>\epsilon(\omega)</math> from two consecutive iterations is below a certain threshold, set by {{TAG|LANCZOSTHR}}.
Using the dipole moments as the starting point means that the iterative algorithm is sensitive only to optically active transitions, i.e. <math>v\to c</math> transitions with non-zero dipole moment. As such, the algorithm will ignore optically inactive transitions and can reach convergence faster than other methods for larger matrices.
The following features are currently supported:
* Calculating the dielectric function
* Calculating the eigenvalues of bright excitonic states
<!--
expression with the u_0 vector explicitly written
<math\delta_{\alpha\beta} - \frac{4\pi}{\Omega}\sum_{cv\mathbf{k}} \langle|c\mathbf{k}|r_\alpha|v\mathbf{k}\rangle
    \langle | v\mathbf{k}|r_\beta|c\mathbf{k}\rangle
    \cfrac{1}{(\omega - a_1 + \mathrm i\eta) - \cfrac{b_1^2}{(\omega -a_2 + \mathrm i\eta) - \cfrac{b_2^2}{...}}}
</math>
-->
== Performing BSE calculations on GPU ==
As of VASP 6.5, the BSE/TDHF calculations with {{TAGO|IBSE|1}} or {{TAGO|IBSE|2}} can be fully run on NVIDIA GPUs.
To be able to offload the BSE calculations to GPUs one has to compile VASP with the [https://docs.nvidia.com/cuda/cusolvermp cuSOLVERMp] and [https://docs.nvidia.com/cuda/cublasmp cuBLASMp] libraries provided with NVHPC-SDK 24.7 or newer.
To be able to use these libraries VASP has to be compiled with HPC-X (MPI shipped with NVHPC-SDK), which can be loaded via
module load nvhpc-hpcx-cuda12/24.7
To enable these libraries in VASP, make sure to include the following lines in your <code>makefile.include</code>
CPP_OPTIONS+= -DCUSOLVERMP -DCUBLASMP
LLIBS      += -cudalib=cusolvermp,cublasmp -lnvhpcwrapcal
To be able to perform the BSE calculation on GPUs, VASP needs to store the full BSE Hamiltonian in the GPU memory, which is often the limiting factor. The memory required to store the BSE Hamiltonian can be estimated as <math>N_{\rm rank}^2\times 16\cdot 10^{-9}</math> in Gb for {{TAGO|ANTIRES|0}}. In the case of exact diagonalization {{TAGO|IBSE|2}}, the eigensolver requires an additional scratch space.


== How to ==
== How to ==


* [[BSE calculations]]
* Practical guide for solving the Bethe-Salpeter equation via diagonalization [[BSE calculations]]
* Practical guide for solving the Casida equation via diagonalization [[TDDFT calculations]]


== References ==
== References ==


[[Category:VASP|Bethe-Salpeter equation]][[Category:Many-body perturbation theory]]
[[Category:VASP|Bethe-Salpeter equation]][[Category:Many-body perturbation theory]]

Revision as of 09:50, 18 December 2024

The formalism of the Bethe-Salpeter equation (BSE) allows for calculating the polarizability with the electron-hole interaction and constitutes the state of the art for calculating absorption spectra in solids.

Theory

Bethe-Salpeter equation

In the BSE, the excitation energies correspond to the eigenvalues of the following linear problem[1]


The matrices and describe the resonant and anti-resonant transitions between the occupied and unoccupied states

The energies and orbitals of these states are usually obtained in a calculation, but DFT and Hybrid functional calculations can be used as well. The electron-electron interaction and electron-hole interaction are described via the bare Coulomb and the screened potential .

The coupling between resonant and anti-resonant terms is described via terms and

Due to the presence of this coupling, the Bethe-Salpeter Hamiltonian is non-Hermitian.

Tamm-Dancoff approximation

A common approximation to the BSE is the Tamm-Dancoff approximation (TDA), which neglects the coupling between resonant and anti-resonant terms, i.e., and . Hence, the TDA reduces the BSE to a Hermitian problem

In reciprocal space, the matrix is written as

where is the cell volume, is the bare Coulomb potential without the long-range part

and the screened Coulomb potential

Here, the dielectric function describes the screening in within the random-phase approximation (RPA)

Although the dielectric function is frequency-dependent, the static approximation is considered a standard for practical BSE calculations.


Scaling

The scaling of the BSE equation strongly limits its application for large systems. The main limiting factor is the diagonalization of the BSE Hamiltonian. The rank of the Hamiltonian is

,

where is the number of k-points in the Brillouin zone and and are the number of conduction and valence bands, respectively. The diagonalization of the matrix scales cubically with the matrix rank, i.e., .

Despite the fact that this matrix diagonalization is usually the bottleneck for bigger systems, the construction of the BSE Hamiltonian also scales unfavorably and can play a dominant role in big systems, i.e.,

,

where is the number of q-points and number of G-vectors.

Exact diagonalization

The diagonalization of the BSE Hamiltonian can be perform using various eigensolvers provided in ScaLAPACK, ELPA, and cuSolver libraries. The advantage of this approach is that the eigenvectors can be directly obtained and used for the analysis of the excitons. Using the eigenvalues and eigenvectors of the BSE Hamiltonian, the macroscopic dielectric which accounts for the excitonic effects can be found

The following features are currently supported:

Time evolution

Alternatively, it is possible to use the time-evolution algorithm which applies a short Dirac delta pulse of electric field and then follows the evolution of the dipole moments. The dielectric function is found via a Fourier transform [2]

,

where and are the dipole moments.

The solution found this way is strictly equivalent to the same solution as the exact diagonalization and can be used for obtaining the absorption spectrum, but does not yield the eigenvectors, which can be limiting for the analysis of the excitons. The advantage of this approach is the quadratic scaling with the size of the BSE Hamiltonian .

The time-evolution algorithm can be selected by setting IBSE = 1 in a BSE calculation. The required number of steps in the time-evolution calculation depends on the broadening CSHIFT and the maximum energy OMEGAMAX. The precision can be selected via tag BSEPREC.

Mind: The required number of steps does not depend on the size of the Hamiltonian

The following features are currently supported:

  • Calculating the dielectric function
  • Calculations beyond the Tamm-Dancoff approximation


Lanczos algorithm

The expression for the dielectric function can be re-written as a continued fraction

where is an initial guess vector computed from the dipole moments, . The and coefficients are evaluated iteratively, with the iterative algorithm stopping once the difference between from two consecutive iterations is below a certain threshold, set by LANCZOSTHR.

Using the dipole moments as the starting point means that the iterative algorithm is sensitive only to optically active transitions, i.e. transitions with non-zero dipole moment. As such, the algorithm will ignore optically inactive transitions and can reach convergence faster than other methods for larger matrices.

The following features are currently supported:

  • Calculating the dielectric function
  • Calculating the eigenvalues of bright excitonic states

Performing BSE calculations on GPU

As of VASP 6.5, the BSE/TDHF calculations with IBSE = 1 or IBSE = 2 can be fully run on NVIDIA GPUs. To be able to offload the BSE calculations to GPUs one has to compile VASP with the cuSOLVERMp and cuBLASMp libraries provided with NVHPC-SDK 24.7 or newer. To be able to use these libraries VASP has to be compiled with HPC-X (MPI shipped with NVHPC-SDK), which can be loaded via

module load nvhpc-hpcx-cuda12/24.7

To enable these libraries in VASP, make sure to include the following lines in your makefile.include

CPP_OPTIONS+= -DCUSOLVERMP -DCUBLASMP
LLIBS      += -cudalib=cusolvermp,cublasmp -lnvhpcwrapcal

To be able to perform the BSE calculation on GPUs, VASP needs to store the full BSE Hamiltonian in the GPU memory, which is often the limiting factor. The memory required to store the BSE Hamiltonian can be estimated as in Gb for ANTIRES = 0. In the case of exact diagonalization IBSE = 2, the eigensolver requires an additional scratch space.

How to

  • Practical guide for solving the Bethe-Salpeter equation via diagonalization BSE calculations
  • Practical guide for solving the Casida equation via diagonalization TDDFT calculations

References