ML EPS REG: Difference between revisions

From VASP Wiki
No edit summary
m (Wording improved)
Line 17: Line 17:
</math>.
</math>.


All eigenvalues satisfying <math>\lambda_{i} / \lambda_{\mathrm{max}} </math> > {{TAG|ML_EPS_REG}} are contributing by the above equations.
All eigenvalues satisfying <math>\lambda_{i} / \lambda_{\mathrm{max}} </math> > {{TAG|ML_EPS_REG}} are included in the above equations, whereas the other eigenvalues are disregarded.


The smaller the value of {{TAG|ML_EPS_REG}} the smaller the error of the fit. But at the same time the effects of over fitting increase. We determined empirically that the default of {{TAG|ML_EPS_REG}}=1E-14 is safe in most cases. If a user desires higher accuracy of the force field, the user can try to set {{TAG|ML_EPS_REG}}=1E-15 or lower, but the user should very carefully monitor the convergence of the calculations as described below.  
The smaller the value of {{TAG|ML_EPS_REG}}, the smaller the error of the fit can potentially become. But at the same time, the danger of over fitting potentially increases. We determined empirically that the default of {{TAG|ML_EPS_REG}}=1E-14 is fairly safe. If a user desires higher accuracy of the force field, the user can try to set {{TAG|ML_EPS_REG}}=1E-15 (lower values are not recommended), but the user should very carefully monitor the convergence of the calculations as described below.  


If at any point in the iterations of the Evidence Approximation  of the square of the quadratic norm of errors (eigth column of <code>REGR/REGRF</code> in [[ML_LOGFILE]]) gets too big (more than 1.2 times larger than before) then {{TAG|ML_EPS_REG}} is doubled. Whenever the regression doesn't converge within 10 steps {{TAG|ML_EPS_REG}} is increased by a factor of 4. The counter for the 10 strucures is reset whenever {{TAG|ML_EPS_REG}} is changed. The maximum allowed iteration steps are 50 (the number of each step is the second entry of <code>REGR/REGRF</code> in [[ML_LOGFILE]]), but when this value is reached no force field is created and there is most likely something seriously wrong in the calculation.
If at any point during iterating, the approximation of the square of the quadratic norm of errors (eight column of <code>REGR/REGRF</code> in [[ML_LOGFILE]]) becomes too large (more than 1.2 times larger than in previous iterations), then {{TAG|ML_EPS_REG}} is automatically doubled. Whenever the regression doesn't converge within 10 steps, {{TAG|ML_EPS_REG}} is increased by a factor of 4. This counter is also reset whenever {{TAG|ML_EPS_REG}} is changed. The maximum allowed iteration depths is 50 (the iteration number is the second entry of <code>REGR/REGRF</code> in [[ML_LOGFILE]]). When this value is reached, no force field is created and there is most likely something seriously wrong in the calculation.


The seventh entry of <code>REGR/REGRF</code> in the [[ML_LOGFILE]] shows the ratio of the regularization (<math>\sigma_{v}^{2}/ \sigma_{w}^{2}</math>) and the largest eigenvalue. Usually this number is a number with many varying digits. If this number becomes a "well rounded" number (e.g. 1.00000000E-14), it is an indication that the cap for the current {{TAG|ML_EPS_REG}} is reached. That means that regularization becomes crucial.  
The seventh entry of <code>REGR/REGRF</code> in the [[ML_LOGFILE]] shows the ratio of the regularization (<math>\sigma_{v}^{2}/ \sigma_{w}^{2}</math>) and the largest eigenvalue. Usually this number is a number with many varying digits. If this number becomes a "well rounded" number (e.g. 1.00000000E-14), this is an indication that the cap for the current {{TAG|ML_EPS_REG}} is reached. That means that regularization becomes crucial.  


== Related Tags and Sections ==
== Related Tags and Sections ==

Revision as of 17:17, 23 March 2022

ML_EPS_REG = [real]
Default: ML_EPS_REG = 1E-14 

Description: Initial value for the threshold of the eigenvalues of the covariance matrix in the evidence approximation.


This threshold is used to determine which eigenvalues of the covariance matrix are used in the optimization of the regularization parameters and determined by the following equations

.

All eigenvalues satisfying > ML_EPS_REG are included in the above equations, whereas the other eigenvalues are disregarded.

The smaller the value of ML_EPS_REG, the smaller the error of the fit can potentially become. But at the same time, the danger of over fitting potentially increases. We determined empirically that the default of ML_EPS_REG=1E-14 is fairly safe. If a user desires higher accuracy of the force field, the user can try to set ML_EPS_REG=1E-15 (lower values are not recommended), but the user should very carefully monitor the convergence of the calculations as described below.

If at any point during iterating, the approximation of the square of the quadratic norm of errors (eight column of REGR/REGRF in ML_LOGFILE) becomes too large (more than 1.2 times larger than in previous iterations), then ML_EPS_REG is automatically doubled. Whenever the regression doesn't converge within 10 steps, ML_EPS_REG is increased by a factor of 4. This counter is also reset whenever ML_EPS_REG is changed. The maximum allowed iteration depths is 50 (the iteration number is the second entry of REGR/REGRF in ML_LOGFILE). When this value is reached, no force field is created and there is most likely something seriously wrong in the calculation.

The seventh entry of REGR/REGRF in the ML_LOGFILE shows the ratio of the regularization () and the largest eigenvalue. Usually this number is a number with many varying digits. If this number becomes a "well rounded" number (e.g. 1.00000000E-14), this is an indication that the cap for the current ML_EPS_REG is reached. That means that regularization becomes crucial.

Related Tags and Sections

ML_LMLFF, ML_IALGO_LINREG, ML_IREG, ML_SIGV0, ML_SIGW0

Examples that use this tag