ML EPS REG: Difference between revisions

From VASP Wiki
No edit summary
No edit summary
Line 19: Line 19:
All eigenvalues satisfying <math>\lambda_{i} / \lambda_{\mathrm{max}} </math> > {{TAG|ML_EPS_REG}} are contributing by the above equations.
All eigenvalues satisfying <math>\lambda_{i} / \lambda_{\mathrm{max}} </math> > {{TAG|ML_EPS_REG}} are contributing by the above equations.


The smaller the value of {{TAG|ML_EPS_REG}} the the smaller the error of the fit. But at the same time the effects of overfitting increase. We determined empirically that the default value of 1E-14 is a safe value in most cases. If a user desires higher accuracy of the force field, the user can try to set {{TAG|ML_EPS_REG}}=1E-15 or lower, but the user should very carefully monitor the convergence of the calculations as described below.  
The smaller the value of {{TAG|ML_EPS_REG}} the smaller the error of the fit. But at the same time the effects of over fitting increase. We determined empirically that the default of {{TAG|ML_EPS_REG}}1E-14 is safe in most cases. If a user desires higher accuracy of the force field, the user can try to set {{TAG|ML_EPS_REG}}=1E-15 or lower, but the user should very carefully monitor the convergence of the calculations as described below.  


If at any point in the iterations of the Evidence Approximation  of the square of the quadratic norm of errors (eigth column of <code>REGR/REGRF</code> in [[ML_LOGFILE]]) gets too big (more than 1.2 times larger than before) then {{TAG|ML_EPS_REG}} is doubled. Whenever the regression doesn't converge within 10 steps {{TAG|ML_EPS_REG}} is increased by a factor of 4. The counter for the 10 strucures is reset whenever {{TAG|ML_EPS_REG}} is changed. The maximum allowed iteration steps are 50, but when this value is reached no force field is created and there is most likely something seriously wrong in the calculation.
If at any point in the iterations of the Evidence Approximation  of the square of the quadratic norm of errors (eigth column of <code>REGR/REGRF</code> in [[ML_LOGFILE]]) gets too big (more than 1.2 times larger than before) then {{TAG|ML_EPS_REG}} is doubled. Whenever the regression doesn't converge within 10 steps {{TAG|ML_EPS_REG}} is increased by a factor of 4. The counter for the 10 strucures is reset whenever {{TAG|ML_EPS_REG}} is changed. The maximum allowed iteration steps are 50 (the number of each step is the second entry of <code>REGR/REGRF</code> in [[ML_LOGFILE]]), but when this value is reached no force field is created and there is most likely something seriously wrong in the calculation.


The seventh entry of <code>REGR/REGRF</code> in the [[ML_LOGFILE]] shows the ratio of the regularization (<math>\sigma_{v}^{2}/ \sigma_{w}^{2}</math>) and the largest eigenvalue. Usually this number is a number with many varying digits. If this number becomes a "well rounded" number (e.g. 1.00000000E-14), it is an indication that the cap for the current {{TAG|ML_EPS_REG}} is reached. That means that regularization becomes crucial.  
The seventh entry of <code>REGR/REGRF</code> in the [[ML_LOGFILE]] shows the ratio of the regularization (<math>\sigma_{v}^{2}/ \sigma_{w}^{2}</math>) and the largest eigenvalue. Usually this number is a number with many varying digits. If this number becomes a "well rounded" number (e.g. 1.00000000E-14), it is an indication that the cap for the current {{TAG|ML_EPS_REG}} is reached. That means that regularization becomes crucial.  

Revision as of 10:59, 11 January 2022

ML_EPS_REG = [real]
Default: ML_EPS_REG = 1E-14 

Description: Initial value for the threshold of the eigenvalues of the covariance matrix in the evidence approximation.


This threshold is used to determine which eigenvalues of the covariance matrix are used in the optimization of the regularization parameters and determined by the following equations

.

All eigenvalues satisfying > ML_EPS_REG are contributing by the above equations.

The smaller the value of ML_EPS_REG the smaller the error of the fit. But at the same time the effects of over fitting increase. We determined empirically that the default of ML_EPS_REG1E-14 is safe in most cases. If a user desires higher accuracy of the force field, the user can try to set ML_EPS_REG=1E-15 or lower, but the user should very carefully monitor the convergence of the calculations as described below.

If at any point in the iterations of the Evidence Approximation of the square of the quadratic norm of errors (eigth column of REGR/REGRF in ML_LOGFILE) gets too big (more than 1.2 times larger than before) then ML_EPS_REG is doubled. Whenever the regression doesn't converge within 10 steps ML_EPS_REG is increased by a factor of 4. The counter for the 10 strucures is reset whenever ML_EPS_REG is changed. The maximum allowed iteration steps are 50 (the number of each step is the second entry of REGR/REGRF in ML_LOGFILE), but when this value is reached no force field is created and there is most likely something seriously wrong in the calculation.

The seventh entry of REGR/REGRF in the ML_LOGFILE shows the ratio of the regularization () and the largest eigenvalue. Usually this number is a number with many varying digits. If this number becomes a "well rounded" number (e.g. 1.00000000E-14), it is an indication that the cap for the current ML_EPS_REG is reached. That means that regularization becomes crucial.


Related Tags and Sections

ML_LMLFF, ML_IALGO_LINREG, ML_IREG, ML_SIGV0, ML_SIGW0

Examples that use this tag