Professional Documents
Culture Documents
villard-et-al-2023-plane-waves-versus-correlation-consistent-basis-sets-a-comparison-of-mp2-non-covalent-interaction
villard-et-al-2023-plane-waves-versus-correlation-consistent-basis-sets-a-comparison-of-mp2-non-covalent-interaction
villard-et-al-2023-plane-waves-versus-correlation-consistent-basis-sets-a-comparison-of-mp2-non-covalent-interaction
pubs.acs.org/JCTC Article
calculation of interaction energies is sometimes viewed more as was observed between H2O total energies coming from
an estimate. Such an estimate can be considered neither an different MP2-R12 approximations, and a 1 kcal/mol differ-
upper nor a lower bound for the actual BSSE.8,11,15 ence was identified between different R12 basis sets.26
An additional complication with GTO bases is that their Deviations of the order of 0.1 kcal/mol were also noticed
intrinsic construction is not necessarily systematic as their size when it came to interaction energies.28 Nevertheless, MP2-
increases, which possibly leads to difficulties in converging (CCSD(T))-R12/F12 calculations have so far been the only
properties in a smooth and monotonic way, which is a known CBS references available to assess the reliability of GTO
problem for Hartree−Fock (HF) or correlated methods such extrapolations from the (aug-)cc-pVXZ bases.21,28,30
as, e.g., second-order Møller−Plesset perturbation (MP2)16 or In this work, we follow a different route to obtain converged
coupled cluster (CC)17 energies. For this reason, Dunning’s values in the CBS limit by evaluating the MP2 correlation
cc-pVXZ correlation-consistent polarized bases18 (where X = energy in a converged plane wave (PW) basis set. PWs have
D,T,Q,5,6 is the cardinal number), as well as their augmented the advantage of forming an orthogonal basis, the complete-
aug-cc-pVXZ analogues containing diffuse functions,19,20 are ness of which is established regularly and monotonically with
among the most popular for post-HF approaches because of the increase in a single parameter, the kinetic energy cutoff,
their meticulous design, which makes it possible to gradually regardless of the level of theory employed. Since the basis
recover a maximum of electron correlation by increasing the functions are fixed in space rather than being located at atomic
basis’ cardinality. centers, there is no BSSE from the outset, and the BSIE is
The development of such basis sets subsequently led to the systematically and progressively reduced to reach the ultimate
proposal of numerous and mostly empirical extrapolation intrinsic level of precision achievable by the quantum chemical
schemes to estimate values in the complete basis set (CBS) method itself. In contrast, PWs generally describe only the
limit,21−23 some of which are detailed later in Section 2.3.1. valence electrons explicitly in order to reduce the number of
Although the BSSE vanishes in the CBS limit, these basis functions and limit calculation costs, with pseudopoten-
extrapolation procedures are not intended to directly correct tials replacing the effects of the core electrons in accommodat-
for it, but rather attempt to eliminate the error due to the finite ing the variations of the wave function near the nuclei that
basis that we call here the basis set incompleteness error would require the inclusion of rapidly varying basis functions,
(BSIE).6,14,24 While most extrapolation schemes are of a fully i.e., high energy cutoffs leading to computationally unfeasible
empirical nature, some are based on theoretical motivations basis set sizes. Even if pseudopotentials are employed, a PW
about the leading behavior of approximate atomic wave calculation can necessitate a number of basis functions of up to
functions, so their applicability to molecular systems requires a few hundred times that of GTOs (typically of the order of
that the correlation energy be dominated by the electron− 105 PWs) for a similar level of convergence with respect to the
electron (Coulomb) cusp and that assumptions are trans- basis set limit, thus requiring a highly optimized parallel
ferable to polyatomic systems.1,22,25 In addition, it is generally implementation.31,32
assumed that extrapolations are applicable from one correlated In principle, whether obtained with GTOs or PWs, CBS-
method to another. For example, the usual X−3 scheme converged energies must be identical. However, fundamental
introduced by Helgaker1,26 relies on the finding that the differences exist between those two types of basis functions
principal expansion of the helium configuration-interaction and have never been thoroughly compared. We hence attempt
(CI) energy holds for energies obtained with correlation- to fill this gap with the present work. More specifically, due to
consistent bases because both converge according to the the CP correction and extrapolation schemes required for
principal quantum number n for this two-electron atom. GTOs or due to some peculiarities of PWs in the treatment of
Transferred to molecules, this one would then assume that isolated systems (where the interaction between periodic
lower-order terms as well as chemical bonding effects are replicas intrinsic to PWs has to be explicitly removed), it is
negligible and that the expression applies equally to all system fundamental to clarify how the results obtained with these two
sizes and quantum chemical methods.1 different approaches may differ in practice. To answer this
As an alternative to cope with the BSIE, explicitly correlated question, noncovalent interaction energies of dimer systems
methods1,22,27,28 (e.g., MP2-R12/F12) are particularly suited provide a sensitive test case because the description of weak
for fast convergence of correlation energies due to their dispersion interactions often requires an accuracy of the order
correlating many-electron basis functions that depend explicitly of 0.05−0.5 kcal/mol.1,33,34 Such systems thus challenge the
on the interelectronic coordinates r12. R12 or F12 refers to ability of a basis set to best capture the short-range
whether the explicit two-electron functions (geminals) are components of the correlation energy around the Coulomb
given by linear or Gaussian-type expressions, respectively. cusp while at the same time incorporating the long-range
Apart from the wave function Ansatz, the R12/F12 methods features of the intermolecular interactions. A priori, the
are similar to their standard counterparts and thus converge in delocalized and balanced nature of PWs seems more
theory to values very close to the CBS limit. However, such appropriate for such a treatment of, e.g., hydrogen-bonded
calculations are neither free of BSSE for smaller bases nor of and van der Waals complexes, but at the cost of a much larger
linear dependencies for larger molecules, and they can suffer number of basis functions. On the other hand, polarization and
from numerical instabilities.15,28 In addition, while recovering delocalization of the electronic wave function necessitate larger
most of the correlation energy for a much smaller number of and/or augmented GTO bases for better coverage of real
basis functions, they may still have difficulty incorporating the space.22 If these effects are accounted for, the presence of
very last bits required to reach very high accuracy (≤0.1 kcal/ diffuse functions causes the basis set to be more prone to the
mol) due to the choice of the geminal and integral BSSE, which also makes noncovalent interactions a problem of
approximations, such as the resolution of identity or neglect choice for examining the effect of the CP correction.
of terms, that are necessary to maintain reasonable calculation In the following, we first give some general information on
costs.1,22,26,28,29 For instance, a deviation of about 0.5 kcal/mol how to obtain MP2 interaction energies in the CBS limit with
9212 https://doi.org/10.1021/acs.jctc.3c00952
J. Chem. Theory Comput. 2023, 19, 9211−9227
Journal of Chemical Theory and Computation pubs.acs.org/JCTC Article
PWs (Section 2.2) as well as with correlation-consistent GTOs considering the Γ-point sampling of the Brillouin zone only;
(Section 2.3); thereafter, specific computational details are orbitals are therefore expanded in reciprocal space as
reported (Section 3). We then demonstrate how to efficiently Gmax
and accurately converge MP2 relative energies in PW basis sets
|i = (G)|G
as implemented in the CPMD software35 (Section 4.1) and i
G=0 (4)
present the results of applying this approach to the calculation
of noncovalent interaction energies of 20 systems from the S22 with the reciprocal space (G) coefficients i(G) that are the
benchmark set,33,34 which we then compare to their (aug-)cc- Fourier components of the molecular orbitals and a PW ⟨r|G⟩
pVXZ analogues of different sizes X = D,T,Q,5 (Section 4.2). = Ω−1/2 eiG·r in the simulation supercell of volume Ω.
In the following, we will call our test subset S22* for brevity’s Computationally, the infinite basis is truncated by restricting
sake. We then searched among 13 GTO extrapolation schemes the maximum norm Gϕmax of G vectors to respect
reported in the literature in order to identify those that agree
the best with PWs in the CBS limit (Section 4.3) and 1 2 1
G (Gmax )2 :=Ecut
investigate the capability of new, different extrapolation laws 2 2 (5)
(Section 4.4). We find that the CBS limits reached with the The wave function cutoff energy Eϕcut,
as well as the volume
best GTO extrapolations and PWs show no significant Ω (both user-defined and system-dependent), act as
difference, i.e., they do not deviate by more than 0.2 kcal/ parameters to ensure the convergence of the energy with
mol for all systems studied herein. However, it is observed that respect to the basis size. The number NG of basis functions is
some residual deviations increase with system size (Section dictated by the following estimate37
4.5). Finally, we conclude with some general recommendations
concerning the choice of correlation-consistent basis sets for 1
NG (Ecut)3/2
the calculation of correlated energies in the CBS limit (Section 2 2 (6)
5).
In reciprocal space, the two-electron integrals (eq 3) can be
2. METHODS evaluated with linear scaling with respect to NG since the
Coulomb operator takes the diagonal form
2.1. Second-Order Møller−Plesset Perturbation
Gmax
ia
Theory. In Møller−Plesset perturbation theory, the dynamic 1
correlation energy is estimated as a series of perturbative terms ij|ab = (G) ia (G) jb ( G)
originating from Rayleigh−Schrödinger perturbation theory G=0 (7)
around a zero-order Hamiltonian given by the sum of Fock where Φ(G) is the generalized Coulomb potential that
operators.16 By taking the ground state Slater determinant that depends on the dimensionality and boundary conditions of
solves the HF problem as the unperturbed wave function, the the system studied, which we describe in more detail in Section
total electronic energy E is approximated at second order by 2.2.2. The overlap pair densities appearing in eq 7 are obtained
the sum of the HF energy and the second-order Møller− from the Fourier transforms
Plesset (MP2c) correlation contribution
iG·r
(G) = [ i* a](G) = dr i*(r) a(r)e
E E MP2 = EHF + EcMP2 (1) ia (8)
As a consequence of Brillouin’s theorem, single excitations of which are replaced by fast Fourier transforms (FFTs) in the
the HF reference do not couple to the HF ground state case of the discrete representation of the ϕi. In principle,
determinant, and only doubly excited determinants contribute because the charge density ρ depends on the square of the
to the MP2 correlation (MP2c) energy, leading to an orbitals, the maximum radius for the density expansion in the
expression that includes double sums over occupied and reciprocal space should be as high as 2Gϕmax, meaning that a
virtual molecular orbitals. For the spin-restricted case, where density cutoff energy Eρcut = 4Eϕcut (cf. eq 5) is needed to
two opposite-spin electrons occupy the same spatial orbital, the maintain a consistent resolution between orbitals and densities
expression reads2,36 in reciprocal space.37 For example, a factor of 4 is used here by
Nocc Nocc Nvir Nvir
default in the calculation of the exchange and Coulomb
ij|ab (2 ab|ij ab|ji ) integrals involved in the zero-order HF calculation. For the
EcMP2 =
i + j a b MP2 correlation energy (eq 2), whatever the ratio between Ecutia
i j a b (2)
and Eϕcut, the canonical scaling in the PW basis set remains
where i and j denote (valence-only) spatially occupied orbitals 2 2
quintic and behaves like (Nocc Nvir NGia) if the pair densities
ϕi,j, and a and b denote their virtual counterparts that are all ρia(G) are precalculated and stored in memory. Consequently,
eigenstates of the Fock operator with respective eigenvalues the number NGia of G vectors entering the expansion of ρia has
εi,j,a,b. The numerator in eq 2 accounts for Coulomb-type
a drastic effect on the overall performance of the MP2 energy
interactions between occupied-virtual pairs of orbitals in the
evaluation, as it also affects the prefactor and memory
evaluation of two-electron integrals
requirements. For this reason, it is imperative to study below
*(r) *(r ) (r) (r ) to what extent a reduction in Ecutia alters the accuracy of the
i j a b
ij|ab = dr dr MP2 energy (Section 4.1).
|r r| (3)
2.2.1. Extrapolation of PWs to the Basis Set Limit. The
The evaluation of this term in atom-centered basis sets is convergence of post-HF energies with respect to the number of
straightforward and documented elsewhere.1,2 GTO basis functions is markedly slower than it is for HF or
2.2. PW Basis Set. In this work, we address the calculation DFT. This is attributed to the need for large atom-centered
of the MP2 energy for isolated systems in the PW basis set, bases to fully accommodate the asymptotic behavior of the
9213 https://doi.org/10.1021/acs.jctc.3c00952
J. Chem. Theory Comput. 2023, 19, 9211−9227
Journal of Chemical Theory and Computation pubs.acs.org/JCTC Article
wave function around the electron−electron cusp.1 For PWs Despite this, MP2 computations still involve the storage of
associated with effective pseudopotentials, we observe that the Noccnmax NGia values of the pair densities (eq 8) for calculating
value of Eϕcut that converges relative HF energies is in general the integrals over NGia ∼104 to 105 integrands (eq 7) as well as
close to that required for recovering most of the MP2c
the contribution of ∼109 to 1011 ijab sums (eq 9) so that only a
contribution, and both require a fairly large NG. In contrast to
parallel approach can handle such intense RAM and CPU
atom-centered bases, the sluggish convergence of the MP2
requirements in a reasonable time. The PW/pseudopotential
energy in PWs is rather reflected in the progression of the sum
MP2 method used herein has been developed and
Nocc n
ij|ab (2 ab|ij ab|ji ) implemented in the CPMD program,35,47 for which we give
Ec,MP2
n = the pseudocode of the parallel implementation in Supporting
i,j a,b i + j a b (9) Information (Algorithm 1).
2.2.2. Treatment of the Coulomb Potential for Isolated
with respect to the contribution of an additional virtual orbital
Systems. The PW basis set offers the possibility to evaluate the
n (≤Nvir). In theory, the effective Hilbert space defined by the
nonlocal and cumbersome Coulomb potential
numerical basis has the size NG = Nocc + Nvir, and the virtual
r|v12|r = 1/|r r | in reciprocal space, with its Fourier
space is the algebraic consequence of the basis set being larger
than the number of electrons in the system. As an illustration, transform (sampling the Γ-point only) being37
the MP2 calculation with the largest basis set considered in this 4
work has NG = 408 126 and Nocc = 37, which makes the entire (G) = G|v12|G = [ r|v12|r ] = G, G
G2 (12)
Fock matrix diagonalization and the direct evaluation of eq 2
simply intractable (Nvir = 408 089, ∼1013 summands, ∼6 × but the long-range nature of the Coulomb interactions in direct
1012ρia(G) points to be stored with double precision in 45 TB space poses problems for the evaluation of multicenter
of RAM). For comparison, the same calculation with the GTO integrals such as those appearing in HF48−53 or MP2
aug-cc-pV5Z basis set requires only 2945 basis functions. correlation energy.39 Indeed, discrete sums of the type of eq
Therefore, the enormous size of the basis set coupled with the 7 with Φ(G) = Φ̃(G) are facing a singularity in G = 0, which is
steep scaling of the methods constitute the main challenges only properly integrable in the thermodynamic limit (Ω → ∞,
when carrying out correlated calculations with PWs. ∑G → Ω/(2π)3∫ dG) and is a consequence of the finite
Fortunately, as it was established by numerical38−42 and simulation cell imposed by the numerical computation. Simply
analytical43 considerations, the PW correlation energy can be ignoring the problematic component makes the convergence of
extrapolated to the CBS limit with respect to the virtual the integrals very slow and requires either many replicas of the
orbitals. Relying on the model of the homogeneous electron unit cell (large supercell) or a much finer and more careful
gas (HEG) in a finite cell, Shepherd et al. showed that the sampling of the eventual k-point mesh. Therefore, schemes
MP2 correlation energy in the large basis set limit (Eϕcut → ∞) have been suggested in order to screen the G = 0 divergence
behaves like43 and obtain faster convergence.48,49,52
In the context of hybrid functionals, be it for isolated or
Ec,MP2 Ec,MP2
E
(Ecut) 3/2
NG 1 (10) periodic systems, Broqvist et al. (BAP)53 proposed to use an
cut
auxiliary function f(G), which acts as a singularity correction
By noticing that the eigenstates of the HEG Fock matrix are by transforming the summand of eq 7 regular function with
nothing else than pure PWs |G⟩, any HF orbital of a many-
electron system can be interpreted as the results of a unitary BAP(G)
transformation of the HEG HF problem, so that the same l
o 4
o
o for G 0
extrapolation law generalizes to single-reference quantum o
o G2
chemical methods of solids and molecules (in the limit of a o
o
=m
o
complete and sufficiently large basis).43 In another interpre- o
o
o
o = dGf (G) f (G) for G=0
tation, one can assume that the virtual states of very high o
o (2 )3
(continuum) energy lose their molecular character and n G 0
Coulomb potential greatly improves the convergence of the actions after an Ewald-type splitting of the Coulomb
total energy as well as the HOMO−LUMO gap with respect to potential.59,60 Very few analyses of the BAP or the MT
the supercell size, as opposed to simply neglecting the G = 0 treatment have been performed on HF calculations with full
component. Hence, we focus on the behavior of this correction exact exchange, and to the best of our knowledge, none has
when applied to the MP2 correlation energy. been done on the MP2 correlation energy. It is therefore
An alternative treatment specific to isolated molecules is the crucial to understand how these act on such energy
effective decoupling of the Coulomb interactions between the contributions to ensure the convergence and accuracy of PW
system and its unphysical periodic replicas. For this purpose, results in what follows.
special Poisson solvers54−57 provide an expression of the 2.3. Correlation-Consistent GTO Basis Sets. In the
potential induced by the cluster charge density when modeled realm of GTOs, the cc-pVXZ18,20 and aug-cc-pVXZ19,20 basis
in an infinitely replicated periodic setup. The method of families of Dunning and co-workers have been designed to
Martyna and Tuckerman (MT)56 assumes that the density recover most of the correlation energy due to the valence
vanishes far enough from the boundaries of the box so that the electrons. More precisely, these are called correlation-
potential can be seen as having the same periodicity as the consistent since the basis functions that are added at each
simulation domain D. In this first/nearest image picture, the level of cardinality X = D, T, Q,... contribute similar amounts of
potential in the MT method converges toward the isolated energy, independently of their type (s, p, d, ...), and in a
system limit when the supercell is sufficiently expanded. consistent manner even in the presence of polarization or
Separating its action at short and long distances with the help diffuse functions. All of these are optimized so as to maximize
of the parameter α (1/r = [erfc(αr) + erf(αr)]/r), the latter their contributions to the atomic correlation energy. For
can be recast as example, to balance the set, s and p functions are added when
the polarization space is extended so that the correlation
l
o 4 erf( r ) iG·r energy error due to the s and p functions does not exceed the
o
o
o
o G2 + D( ) dr r e for G 0
o
o
error from the polarization space. The exponents of such s and
o
o ´ÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖlong
ÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÖÆ
o
o p functions are optimized with respect to atomic HF energies,
MT(G) = m
o while the correlating polarization functions come from valence
o
o 4
o
2 2
o
o e G /4 for G=0 energy minimization at the atomic CISD level of theory. The
o
o 2
G 2
o
o G 0 ´ÖÖÖÖÖÖÖÖÖÖÖÖÖÖlong ÖÖÖÖÖÖÖÖÖÖÖÖÖÖÆ aug-cc-pVXZ bases are derived from the original cc-pVXZ,
o
n with the addition of one diffuse function per angular
(16) momentum present in the set, whose (smaller) exponents
are determined by minimizing the atomic CISD energy of
where the new terms that add up to (G)=4 /G2 come from anions. Although first intended for a better description of
the difference between the Fourier transform long (G) and the electron affinities, the aug-cc-pVXZ basis family has been
long shown to progressively enhance the convergence of other
Fourier series components (G) of the long distance part,
molecular properties such as proton affinities,61 dipoles, and
which acts as a screen of the interactions between the isolated polarizabilities,62−64 or energies of weakly bound sys-
system and its infinite periodic images. Note that the tems.11,63,65−67
singularity of Φ̃(0) would be exactly canceled by that of The main advantage of the (aug-)cc-pVXZ basis sets is their
long
(0), and both were ignored in eq 16, but the nonsingular ability to converge results toward the basis set limit in a
long (semi)systematic manner at the cost of increasing the number
difference limG 0[ (G) (G)] = 2 must be included.
of contracted basis functions, Nb. For first-row atoms, this
It has been shown in practice that for αL ∼ 7, where L is the latter increases with the cardinal number X as26
smallest size of the parallelepiped box, long (G) can be
1 i 3y
efficiently evaluated via a FFT that converges rapidly with Nbcc pVXZ = (X + 1)jjjX + zzz(X +2)
respect to the Cartesian grid. In the framework of PW/ 3 k 2 { (17)
pseudopotential DFT, the evaluation of Coulomb-like integrals
with the MT Poisson solver provides accurate energies, Nbaug cc pVXZ = Nbcc pVXZ + (X +1)2 (18)
provided that the integration domain spans about twice the
size of the electron density. Thus, as noted by MT, increasing such that, for a computer time associated with MP2 that scales
the size of the supercell becomes analogous to converging the as N5Nb4, where N is the number of atoms, improving the
energy according to the largest-width diffuse function included correlation energy with a larger basis grows as N5X12. Q, 5, or 6
in a Gaussian basis set. zeta calculations may therefore be prohibitively expensive for
BAP (eq 13) and MT (eq 16) schemes look very similar for larger systems of interest.68−70
G = 0, where the sum over G ≠ 0 vectors actually corresponds This is in contrast to PWs, for which the basis size does not
to the electrostatic energy of a Gaussian charge distribution depend explicitly on the number of atoms N but only on the
interacting with a compensating uniform background in a volume of the supercell and the cutoff energy so that the MP2
periodic setup.53 The first term in eq 15 accounts for the energy scales as Nocc 2 2
nmax (Ecutia )3/2 (Section 2.2). Assuming
electrostatic self-energy of an isolated Gaussian charge in the that a sufficiently high energy cutoff may be chosen to faithfully
supercell, so intuitively, χ corrects the singularity using the describe pair densities over a wide range of systems and that
difference between the electrostatic energy of an isolated probe nmax increases less than linearly with N (as we have
charge and its periodically repeated analogues in a compensat- observed47), the PW basis set then becomes more favorable
ing background.53,58 For MT, instead, the Gaussian charge in the limit of large systems,2 provided that Ω does not
distribution is used to construct the screening function increase significantly for the correct convergence of Coulomb
long long
(G) (G) of the long-range electrostatic inter- interactions in a periodic setup (Section 2.2.2).
9215 https://doi.org/10.1021/acs.jctc.3c00952
J. Chem. Theory Comput. 2023, 19, 9211−9227
Journal of Chemical Theory and Computation pubs.acs.org/JCTC Article
2.3.1. Extrapolations of GTOs to the Basis Set Limit. On the other hand, one possible theoretical motivation
Although the correlation-consistent basis sets provide gradual holds its origin in the partial-wave expansion of a two-electron
and monotonic progress, their associated computational cost atom:1,22,23 By treating the Hamiltonian of the bare nucleus of
grows faster than the rate of convergence. As a rule of thumb,22 the helium atom as zero-order and the electron interaction as a
it is globally said that an improvement of the energy accuracy perturbation, Schwartz studied the convergence of the
by a factor of 10 necessitates a computational effort increased correlated atomic wave function on a one-electron basis.22,78
by a factor of 104, and the convergence of the correlation He established that the partial wave increments δEl(2) to the
energy, be it MP2, CCSD, CCSD(T), ..., remains so slow that second-order energy E(2) = l = 0 El(2) follow an asymptotic
basis set limit estimates can only be reached by extrapola-
formula in the limit of large l, which behaves like
tion.21−23,71 As a consequence, expressions based on the
comparison with very large basis calculations or, most i 1y 4
i 1y 6
commonly, with explicitly correlated methods (e.g., MP2- El(2) = A jjjl + zzz + Bjjjl + zzz + (l 8)
R12/F12 and CCSD(T)-R12/F12) have been suggested, but k 2{ k 2{ (33)
the computational overhead for obtaining such accurate with l being the degree of Legendre polynomials entering the
references has often restricted the size of the validation partial wave expansion of the first-order wave function.
systems to a few dozen electrons. Interestingly, the l-th component in the partial wave expansion
Even though nonexhaustive, we report in Scheme 1 some corresponds to a one-electron atomic function with angular
formulas found in the literature to estimate correlated energies momentum l.79 Translated to many-electron atoms, this
implies that δEl(2) is equivalent to the energy increase due to
Scheme 1. Extrapolation Expressions Tested in This Work the addition of a saturated shell of basis functions of angular
for the (aug-)cc-pVXZ Basis Sets momentum l to the basis set that expands the first-order wave
function. For standard electronic structure methods (e.g.,
MPn79 or CI80,81) and many-electron atoms, similar forms
were derived with odd terms that may also arise, where one
makes the general assumption that the increment of the
correlation energy can be expanded as
m
i 1y
El(2) = A mjjjl + zzz
m =4 k 2{ (34)
Am m +1
= (L +1)
m= 4
m 1 (36)
Furthermore, by comparing the MP2-R12 references of Ne, which greatly accelerates the energy convergence compared to
HF, H2O, and N2, Wilson and Dunning found that the ansatz individual extrapolations. This allows us to set smaller nmax
virtual orbitals to be diagonalized and processed in the MP2c
A B
Ec,MP2 MP2
X = Ec, + + +1
double summation, which drastically reduces the computa-
(X + D) (X + D) tional requirements. No significant difference (>0.001 kcal/
C mol) was observed if the extrapolation was done as a function
+ of n−1 or ε−3/2 , the latter eigenvalues corresponding to the
(X + D) + 2 (37) n
dimer being therefore used. Extrapolation points are spaced by
was giving the best match for (α = 3, B = 0, and D = 0) and (α an increment of 100 virtual orbitals, and better accuracy is
= 4, C = 0, and D = 1), consequently proposing Wilson35 (26) obtained with a linear fit according to
and Wilson45 (27) for extrapolating correlated energies.30
Alternatively, Helgaker et al. put forward the use of eq Ec,MP2
n ·
3/2
n = 3/2
n + (38)
Helgaker (29),15,26,29,84 which corresponds to the leading
order of eq 36, and the identification of L ∼ X − 1 based on a where α = ΔEMP2 c recovers the PW CBS MP2c energy by
ensuring that nmax is chosen large enough in order for eq 38,
better agreement with R12 results.26 This allows the
respectively eq 11, to be valid. For the systems studied, nmax is
extrapolation of the correlation energy from a two-point linear
between 10 000 and 20 000. To account for the sensitivity of
fit and was later motivated by analyzing the energy increments
the extrapolated value with respect to the fitting range, the
of the principal expansion of the ground-state helium atom,
yielding in this case a convergence with respect to the principal results from all possible intervals ending at nmax are calculated,
quantum number n, which is a priori more in line with the and the final ΔEMP2
c value is averaged among the intervals that
progressive construction of the (aug-)cc-pVXZ bases.1,22 Since respect eq 38.
the number of basis functions increases cubically with X (eq The cutoff energy Eϕcut of the wave function has been set to
17), Helgaker (29) is equivalent to a convergence of the 150 Ry, and the density cutoff is set to the usual Eρcut = 4Eϕcut for
correlation energy as a function of 1/Nb. Interestingly, this is all systems and supercell sizes. No change larger than ∼0.01
analogous to PWs, for which the correlation energy converges kcal/mol was observed on the extrapolated MP2 interaction
to 1/NG (eq 10). A different rate of convergence was observed energies at larger cutoffs (Table S1 in the Supporting
for the HF energy, so Helgaker argued for its separate Information). The effects of the cutoff energy for the MP2c
treatment, according to Helgaker (28).26,84 pair densities, the supercell dimensions, and the decoupling
Finally, Varandas investigated the universality of the B between periodic images are discussed below in Section 4.1.
parameter in Helgaker (29) and found that eq Varandas34 3.2. Correlation-Consistent GTO Basis Sets. The (aug-
(31) minimizes the difference with a set of CCSD(T)/MP2 )cc-pVXZ calculations were performed with Orca 5.0.3,88,89 or
energies of small molecules and different basis sets.70 However, for the larger systems and augmented bases, with Turbomole
the match was better if considering a fourth-order term with V7.190 after checking that both programs gave identical results.
the general form of Varandas34 (30) or by exploiting an The HF wave function and energies are obtained for all-
empirical interdependence between the parameters that leads electron calculations, while the frozen-core approximation is
to eq Var.34-fit (32), which has the advantage of requiring only used for the MP2 correlation energy, i.e., occupied orbitals
two points for extrapolation. corresponding to core electrons are omitted in the MP2c
Subsequently, we will examine which of these extrapolations evaluation. The convergence threshold for the SCF wave
provide better or worse agreement between the GTO and PW function was set to VeryTightSCF for Orca, respectively, and
MP2 interaction energies in the CBS limit. to 10−7 au for the energy gradient in Turbomole.
The CP correction scheme is used to correct for the BSSE.5,6
3. COMPUTATIONAL DETAILS Therefore, uncorrected and BSSE-corrected interaction en-
ergies (HF or MP2c) are calculated as follows
The geometries of the test systems were taken from the paper
defining the S22 data set.33 Like in the original work on the Euncorr. = E{ABAB} E{AA} EB{B} (39)
S22 data set and its revised version,34 deformation energies are
neglected, monomer structures are kept identical in the dimer ECP corr. = E{ABAB} E{AAB} EB{AB} (40)
configuration, and no monomer relaxation is carried out. To
save computational resources, we opted to exclude the two 1
adenine−thymine dimers from our test set, resulting in a set of E half CP = ( Euncorr.+ ECP corr.)
2 (41)
20 structures that we refer to as S22*.
3.1. PW Basis Set. A development version of CPMD 4.3 where E{A}
A designates the energy of monomer A calculated in
has been used for all MP2 calculations with PWs35 in its basis {A}, and E{AB}
A its corrected energy calculated in the
combination with hard norm-conserving Goedecker−Teter− full {AB} basis that includes ghost functions located on system
Hutter (GTH)85 pseudopotentials specifically parametrized for B. Since it was noticed that ΔEuncorr. and ΔECP‑corr. may
HF.86 The HF wave function has been optimized with either converge to the basis set limit from opposite sides, it is
DIIS87 or preconditioned conjugate gradient optimization up sometimes assumed that the average ΔEhalf‑CP energy provides
to a maximum residual component of the gradient on occupied faster convergence,15,91−93 a strategy that we also examine.
orbitals lower than 10−7 au, respectively, 10−5 au for the nmax
virtual orbitals obtained via subsequent Davidson diagonaliza- 4. RESULTS AND DISCUSSION
tion.46 4.1. Converging Accurate MP2 Energies with PWs. In
The MP2c contribution to the interaction energy of the AB this section, we discuss a number of technical details that are
complex is extrapolated according to eq 11, i.e., the essential for making calculations of the MP2 interaction
extrapolation is performed on ΔEMP2 MP2 MP2 MP2
c,n = EAB,c,n − EA,c,n − EB,c,n , energies with PWs tractable. As already mentioned, the leading
9217 https://doi.org/10.1021/acs.jctc.3c00952
J. Chem. Theory Comput. 2023, 19, 9211−9227
Journal of Chemical Theory and Computation pubs.acs.org/JCTC Article
c o m p u t a t i o n a l e ff o r t f o r t h i s t a s k s c a l e s a s
2 2
(Nocc nmax (Ecutia )3/2 ), for which nmax is reduced by the joint
extrapolation of relative energies in the virtual space.
Moreover, Ω and Ecutia define the number Noccnmax NGia of
ρia(G) pair density values (eq 6) to be stored for the two-
electron integrals (eq 7) and therefore have a strong influence
on the memory requirements.
The effect of Ecutia on the correlation energy is reported in
Table 1, which shows that no difference greater than 0.01 kcal/
Ecutia
S22 system rx ry rz Ω [Å3] [Ry] ΔEMP2
c σMP2
c
Figure 2. Box plots of the differences ΔEGTO − ΔEPW between the GTO and PW interaction energies of the S22* test systems. Separate HF and
MP2c contributions to the total MP2 energies are given. Signed differences are given in (a) while (b) reports absolute values. Medians are shown as
horizontal black lines, and yellow lines stand for the mean signed deviation (MSD) in (a) and the mean absolute error (MAE) in (b), respectively.
Dots represent outliers that are located further than 1.5 times the interquartile range from the first and third quartiles (i.e., the limits of the
rectangular boxes).
would impose a too large box size and intractable computa- support the fact that the use of pseudopotentials does not
tional cost. cause any spurious differences between the PW (pseudopo-
Within these settings, we have shown how various factors tential) and GTO (all-electron) results.
can push the limits of MP2 calculations with PWs. The first The MP2c correlation and, hence, the MP2 energies are
factor consists of truncating the virtual space thanks to an more sensitive to the basis set size and slower to converge than
analytical extrapolation (eq 11); the second relates to the HF, with, e.g., MP2 deviations of about 0.7−0.8 kcal/mol for
reduced number of PWs necessary to expand the pair densities; the T zeta bases. This is because the MP2c energy is more
and finally, the last refers to the choice of an efficient Coulomb prone to the BSIE, which is noticeably exacerbated for the
operator for treating isolated systems and correlation energies. smaller cc-pVDZ and cc-pVTZ bases once CP corrections are
Thanks to these findings, it has been made possible to access applied. Overall, D zeta basis sets do not provide a satisfactory
the MP2 interaction energies of systems with up to ∼100 level of convergence, with deviations that might surpass the
electrons that are listed in Table S2 of the Supporting order of 2 kcal/mol, whatever the augmentation or the BSSE
Information. Convergence was achieved by progressively correction. At small D and T cardinalities, the best match with
expanding an orthorhombic cell (rx,y,z = 1.2, 1.4, 1.6, 1.8, 2.0, PWs is observed when considering only half of the CP
and 2.2 when possible) with the MT Poisson solver. The HF correction with the (aug-)cc-pVDZ or cc-pVTZ bases,
components were retained when no variations larger than 0.01 respectively (Table 2), which confirms that the accuracy at
kcal/mol were measured, while MP2c contributions were such levels is mainly due to a fortuitous cancellation of the
extrapolated first via eq 38 and then via eq 43. Standard BSSE, which lowers the energies, and the BSIE, which
deviations due to this extrapolation procedure are also increases them. Although not formally recommended, this
reported in Table S2 and do not exceed 0.05 kcal/mol for can be exploited for a crude first estimate in the case of a
energies spanning a range of −0.50 to −20.19 kcal/mol. limited computational budget, as provided, for example, by the
4.2. HF/MP2 Energies in PWs versus GTO Bases. cc-pVTZ/half-CP combination (with a potential ∼1 kcal/mol
Figure 2 displays the statistics of the differences between the error). For the same reason, the match between nonaugmented
GTO and PW contributions to the MP2 interaction energy. Q/5 bases and PWs is better with only a half-CP correction.
For HF, the CP uncorrected or half-corrected energies For the augmented bases, however, the same conclusions as for
converge from below and confirm that the BSSE tends to HF hold: the MP2c and MP2 energies are generally too low
overbind dimer systems at the HF level. Once the BSSE is without a full BSSE correction and approach the CBS PW
removed by the CP correction, HF energies converge faster values from above when corrected. Thanks to this, the overall
and from above, which is the expected behavior of a gradual best agreement between GTOs and PWs is measured on the
aug-cc-pV5Z basis with CP correction, which shows a MAE of
decrease of the (sole) BSIE as the size of the basis set
only 0.06 kcal/mol.
increases.15 The augmented basis converges slightly faster than
As a result, the gradual convergence of GTOs toward PW
its standard counterpart, certainly due to its larger size and
values validates both the MP2 implementation in CPMD and
spatial extent for the same cardinal number. When CP-
the previously proposed extrapolation procedure to compute
corrected, the HF energies already converged within less than
PW interaction energies in the CBS limit (Section 4.1).
0.2 kcal/mol for the triple (T) zeta bases. Overall, at each
Furthermore, our results highlight the importance of diffuse
cardinal number, the CP-corrected results obtained with the basis functions needed to incorporate the long-range
augmented basis sets provide the best agreement with PWs, as components of the electron correlation in weakly bound
also reported in Table 2. The remarkably small deviations at systems,a and in this respect, they provide further confirmation
the HF level between the Q/5 zeta all-electron GTOs and PWs of the use of (aug-)cc-pVXZ bases for converging binding/
interaction energies with correlated methods, although they
Table 2. Best Agreement between GTO and PW Interaction were originally designed for the treatment of anions and
Energies of the S22* Test Set at Each Cardinal Numbera electron affinities.19,20,64
level size set BSSE corr. MAE max dev. 4.3. HF/MP2 Energies in PWs Versus Extrapolated
HF D non-aug CP 0.16 0.69
GTO Bases. We are now interested in exploring whether the
aug CP 0.07 0.48
different extrapolations of Scheme 1 for consecutive X =
T non-aug CP 0.06 0.15
D,T,Q,5 cardinal numbers improve or deteriorate the GTO
aug CP 0.04 0.12 CBS estimates compared to PW energies. When uncorrected
Q non-aug CP 0.02 −0.08 for the BSSE, HF, and MP2 interaction energies, they converge
aug CP 0.02 0.05 nonsystematically and sometimes nonmonotonically because
5 non-aug CP 0.02 0.04 of the varying balance between BSSE and BSIE (as illustrated
aug CP 0.02 0.05 in Figure S2 or ref 15). For this reason, GTO extrapolations
MP2 D non-aug half-CP 0.97 2.71 performed on relative energies yield results that are very similar
aug half-CP 0.68 −1.88 to or worse than those for absolute total energies. Thus, in
T non-aug half-CP 0.24 0.70 what follows, the energy of each subsystem will be extrapolated
aug CP 0.30 0.82 individually.
Q non-aug half-CP 0.11 0.31 Truhlar (21) and Helgaker (28) treat the HF contribution
aug CP 0.11 0.23 separately,b and Figure 3 shows how they perform in this
5 non-aug half-CP 0.07 −0.20 regard. Both contain three parameters and require at least three
aug CP 0.06 0.16 data points for extrapolation. The power expression of Truhlar
with D, T, and Q (DTQ) data points generally worsens the
a deviations from PWs compared with simple Q energies. When
Mean absolute errors (MAE) and maximum deviations are in kcal/
mol. including the 5 points (TQ5 and DTQ5), the results are also
9220 https://doi.org/10.1021/acs.jctc.3c00952
J. Chem. Theory Comput. 2023, 19, 9211−9227
Journal of Chemical Theory and Computation pubs.acs.org/JCTC Article
better for the augmented sets. Note that the extrapolations that suggested to use the Helgaker (28)(29) scheme on aug-cc-
include the (aug-)cc-pV5Z data point do not further improve pVXZ CP-corrected energies in order to obtain the most
the interaction energies (Table 3), with an average MAE (max accurate estimates in the CBS limit. If not, then the Martin4
dev) against PWs of 0.06 (0.17) kcal/mol that is comparable to (23) expression is recommended.
the 0.07 (0.18) kcal/mol for the nonextrapolated results 4.4. Other GTO Extrapolations. Motivated by the best
(Table 2). This remaining difference is discussed in Section agreements found so far, as well as the general expression of eq
4.5. 36, we have tested all possible extrapolations in the form of
To summarize, our results demonstrate that the extrapolated
EXMP2 = E MP2 + A(X + a) (44)
GTO energies are always closer to PW reference values when
the CP correction is used to tackle the BSSE with sets that are EXMP2 = E MP2 + A(X + a) + B(X + b) (45)
augmented by diffuse functions (Table 3). For nonaugmented
bases, the interplay between the BSIE and some residual BSSE 1 1
with α, β = 3, 4, 5, 6 and a , b = 1, 2 ,0, 2 ,1 to investigate
can make the Helgaker and Martin4 two-point extrapolations whether different schemes could universally improve the
perform fortuitously better in conjunction with the half-CP remaining deviations reported in Table 3. For the first eq 44,
correction. The empirical Peterson scheme surprisingly no combination gives overall better results, whether for the
provides interaction energies that are very close to the PW augmented or nonaugmented bases, reinforcing the recom-
results, comparable to Helgaker or Martin4, but the latter two mendation of Helgaker (29) for DT points only and Martin4
appear to be more robust candidates because of their (23) for TQ or Q5 pairs. If three points are available, however,
theoretical foundations and the fact that they depend on two new expressions stand out as providing very similar or
only two parameters. To confirm this, the same analysis has lower deviations than those of Helgaker and Martin4. We call
been carried out with the omission of four outlier systems (the them, from now on, Rovibi34 and Rovibi45 defined by
two uracil, benzene−water, and T-shaped indole benzene
3 4
i 1 yz i 1y
EXMP2 = E MP2 + AjjjX + BjjjX + zzz
dimers for which the aug-cc-pV5Z/CP interaction energies are
zz Rovibi34
already lower than the CBS PW values). For this smaller test k 2{ k 2{
set, Feller occasionally beats Peterson, but Helgaker and (46)
Martin4 perform consistently better too. Therefore, if the
extrapolation sequence includes the D zeta level, then it is EXMP2 = E MP2 + AX 4
+ B(X + 1) 5
Rovibi45 (47)
9222 https://doi.org/10.1021/acs.jctc.3c00952
J. Chem. Theory Comput. 2023, 19, 9211−9227
Journal of Chemical Theory and Computation pubs.acs.org/JCTC Article
whose results are also reported in Figure 4 and Table 4 for Such laws indicate that the −3 and −4 orders are indeed
comparison. good leading candidates and that terms of higher orders may
also be significant. If some rational explanation were to be
Table 4. Best Agreement between Rovibi Extrapolations and found, Rovibi34 (46) is compatible with contributions
PW Interaction Energies for the S22* Test Seta resulting from the principal expansion proposed by Hel-
gaker1,22 (power −3) and those of the highest angular
points set scheme BSSE corr. MAE max dev. worth? momentum L present in the basis set from the partial wave
MP2 expansions put forward by Carroll,80 Hill,81 and Kutzelnigg.79
DTQ non-aug Rovibi34 CP 0.07 0.18 √ On the other hand, Rovibi45 (47) suggests that an additional
aug Rovibi34 CP 0.06 −0.15 √ order to Martin4 improves the extrapolation and that the
TQ5 non-aug Rovibi34 CP 0.06 0.16 √ (minus) third order does not dominate. For both, the X-shifts
aug Rovibi34 CP 0.06 −0.15 ≃ reflect not only the balance between the orders but also
DTQ5 non-aug Rovibi34 CP 0.06 −0.15 √ between the basis functions that have L = X − 1 for H and L =
aug Rovibi34 CP 0.06 −0.15 ≃ X for C, N, and O atoms in the systems studied here. Based on
DTQ non-aug Rovibi45 CP 0.07 0.22 √ these considerations and because the leading order of Martin4
aug Rovibi45 CP 0.05 −0.15 √ (−4) was motivated empirically by comparison with
TQ5 non-aug Rovibi45 CP 0.05 −0.16 √
experimental atomization energies of small molecules,25,82,83
aug Rovibi45 CP 0.05 −0.15 ≃
the Rovibi34 scheme seems more formally justified.
DTQ5 non-aug Rovibi45 CP 0.06 0.15 √
Up to this point, only the relative energies extrapolated to
aug Rovibi45 CP 0.05 −0.15 ≃
a
the CBS limit have been compared to PW results, but the
Mean absolute errors (MAE) and maximum deviations are in kcal/ quality of the GTO extrapolations can also be assessed by how
mol. Worth indicates that the extrapolated energies are statistically faithfully they reproduce the single data points. Averaged on all
closer to the CBS PW values than the corresponding direct results
obtained at the highest extrapolation point (cf. Table 2). systems, the fitting curves of Rovibi34 and Rovibi45 show a
MAE relative to the data points of no more than 0.5 kcal/mol
Figure 5. Deviations between the aug-cc-pVXZ and PW MP2 interaction energies as a function of the number of electrons Ne in the dimer system.
(a) For plain basis set results and (b) for energies extrapolated to the CBS limit with the best schemes of Tables 3 and 4.
9223 https://doi.org/10.1021/acs.jctc.3c00952
J. Chem. Theory Comput. 2023, 19, 9211−9227
Journal of Chemical Theory and Computation pubs.acs.org/JCTC Article
(Table S3), while the latter lies between 3.8 and 7.6 kcal/mol residual completeness mismatch between GTOs and PWs in
for Helgaker and Martin4. As a reminder, these errors refer to the limit of large systems, originating rather from an
the total (absolute) energies of the dimers and monomers that incomplete or linearly dependent (overcomplete) GTO
were extrapolated individually. Hence, Rovibi34 and Rovibi45 description of the wave function to capture the long-range
not only provide interaction energies close to the PWs in the (polarization and dispersion) contributions to the correlation
CBS limit but are also capable of interpolating total energies energy. This encourages further comparison of atom-centered
well. In this respect, Rovibi34 performs best with a MAE of bases against other basis sets, either explicitly correlated, PWs,
0.15−0.3 kcal/mol, compared to 0.3−0.5 kcal/mol for or purely numerical, in the calculation of the correlation
Rovibi45. We also stress that the double exponential scheme energies of large systems.
of Peterson, although purely empirical, shows even smaller
fitting MAEs of ∼0.1 kcal/mol (Table S3) and provides 5. CONCLUSIONS AND OUTLOOK
deviations against PWs that are similar to those of Rovibi34 The main motivation for this work was to analyze the effects of
(Figure 4). Consequently, our results do not disprove it as a the basis sets on correlated energies. To this end, the MP2
good extrapolation law. However, based not only on agreement interaction energies of 20 complexes belonging to the S22 test
with PWs and the ability to interpolate GTO energies but also set were computed in the most common Gaussian-type
on theoretical indications, it appears that Rovibi34 (46) is correlation-consistent bases as well as in PWs, for which we
likely the best global choice when resorting to three-/four- implemented the MP2 method in the CPMD plane-wave
point extrapolations (DTQ, TQ5, and DTQ5) to the CBS pseudopotential package. Although more computationally
limit. demanding at such system scales, PW calculations have been
4.5. System Size Dependency. Finally, whatever the made accessible on conventional computing architectures
effort in order to match GTOs with PW results, via CP through an extrapolation protocol involving both the virtual
correction, basis set augmentation, or extrapolation, one space orbitals and the supercell volume, ultimately requiring no
notices that the best MP2 errors do not reach MAEs lower more than a few days per molecule on several high-memory
than 0.05 kcal/mol with maximum differences of at least 0.15 compute nodes.
kcal/mol (Tables 2−4). While this might at first be taken as an By comparing atom-centered interaction energies with PW
indication that interaction energies are essentially converged at results, we established that both basis set types provide
the aug-cc-pV5Z/CP level within an 0.05 kcal/mol numerical consistent values, especially when the CP correction eliminates
accuracy, further analysis instead reveals that residual the BSSE from the former. Indeed, (aug-)cc-pVXZ relative
discrepancies occur because GTO results tend to further energies generally converge toward PW values, free of BSSE,
deviate from PWs as the system size increases. Figure 5a shows for progressive enlargements of X = D,T,Q,5, and differ by less
indeed that GTO interaction energies that are not corrected than 1 kcal/mol for X ≥ T. However, the slower convergence
for the BSSE are lower than those from PWs and that the BSSE of the cc-pVXZ bases makes their agreement with PWs
tends to become larger with an increasing number of electrons occasionally better, with only half of the CP correction due to a
in the system. Interestingly, assuming that the CP correction fortuitous error cancellation between the BSSE and their BSIE.
removes the majority of the BSSE, only the BSIE remains and, Overall, the aug-cc-pV5Z basis set with the CP correction
in turn, increases with the system size. Hence, although the size provides the closest interaction energies within 0.16 kcal/mol
of a GTO basis grows with the number of atoms, the to the fully converged PW results. This demonstrates the
incomplete coverage (lack of completeness) of this expansion benefits of diffuse functions in the description of long-range
leads to a smaller and smaller correction of the BSIE with an interactions as occurring in weakly bound systems, although
increasing system size. In other words, for a given GTO basis their faster energy convergence may slow down for stronger
set, its capacity to capture the (correlation) energy decreases as (covalent) interactions.71
the size of the system increases. Such a size inconsistency is Hence, based on the agreement with PW results at the CBS
even more pronounced for the smaller cc-pVXZ bases (Figure limit, theoretical foundations, and interpolation capabilities, we
S9a). In the limits of large bases and systems, however, the can confidently make the following recommendations for the
occurrence of linear dependencies can further interfere with extrapolation of (aug-)cc-pV[D,T,Q,5]Z correlated energy
this behavior. sequences to the CBS limit:
Once extrapolated, the GTO CBS estimates follow a similar • Use the CP correction for interaction/binding energies.
trend with larger (absolute) differences attributed to larger • Resort to the aug-cc-pVXZ bases if long-range effects are
numbers of electrons Ne (Figures 5b and S9b), thus sizable.
questioning the agreement between GTOs and PWs in the • Extrapolate total energies separately according to
limit of (very) large systems. Note that such a difference 1 3 1 4
applies to all promising extrapolations found in this work • (D)TQ5 points and A X ( 2 ) (
+B X+ 2 )
(Figures S10 and S11). The reasons can be multiple and arise 1 3 1 4
from a combination of the BSSE, the accuracy of the • DTQ points and A X ( 2 ) +B X+ ( 2 )
extrapolation scheme, and the intrinsic nature of the basis 4
functions. Nevertheless, as observed earlier (Section 4.2), the ( 12 )
• Q5 points and B X +
CP correction seems adequate to eliminate the BSSE so that 1 4
• TQ points and B(X + 2 )
the last two factors will dominate, which are directly linked to
the BSIE. Let us recall that the initial motivation behind the • DT points and A X−3
extrapolation schemes is to cope with the electron−electron where the choice of the extrapolation points (and basis
cusp that hampers the basis set convergence.1,15 Therefore, augmentation) is left to the practitioner since these will depend
although not firmly established by our results, the fact that on the computational budget and the problem/software at
extrapolated energies deviate from PWs could indicate a hand. However, note that in principle, the higher the point in
9224 https://doi.org/10.1021/acs.jctc.3c00952
J. Chem. Theory Comput. 2023, 19, 9211−9227
Journal of Chemical Theory and Computation pubs.acs.org/JCTC Article
the sequence, the more accurate the extrapolated value is, and Complete contact information is available at:
the DT scheme should rather be taken as a first rough estimate. https://pubs.acs.org/10.1021/acs.jctc.3c00952
In practice, the universal application of such a procedure across
correlated wave function methods (MP2, CCSD, CCSD(T), Notes
...) has been widely accepted.15,22,23,25,26,29,70,76,83 The authors declare no competing financial interest.
Finally, getting the electron correlation in the CBS limit The development of this work is intended to be available in a
relies on the saturation of the one-electron basis, whose basis future release of CPMD.35 Data, extrapolation, and analysis
functions should adequately span short distances and be scripts are provided on Zenodo at https://doi.org/10.5281/
flexible enough to incorporate long-range components of the zenodo.7838778.
wave function. The latter are directly linked to the delocalized
nature of the virtual states that contribute to the correlation
energy and therefore necessitate balanced and complete space
coverage. In that sense, PWs are capable of capturing high-
■ ACKNOWLEDGMENTS
U.R. acknowledges funding from the Swiss National Science
lying (continuum-like) states as well as localized occupied Foundation via individual grant no. 200020_185092 and the
states, at the cost of a sufficiently large cutoff energy.47 When NCCR MUST.
compared to PWs, we noticed that the ability of the
correlation-consistent bases to cope with the BSIE decreases
as the number of electrons increases, therefore questioning the
■
a
ADDITIONAL NOTES
For stronger (e.g., covalent) interactions, however, diffuse
capability of localized basis sets to recover most of the augmentation was sometimes found to hamper the basis-set
correlation energy as the system size increases. PW, explicitly convergence of correlation energies.71
correlated, or numerical basis calculations on larger systems b
Note that Martinα (25) was also proposed for HF energies
would confirm (or refute) this statement, but their computa- and yields identical conclusions to Truhlar (21) (not
tional overhead seems to compromise their application for the reported).
time being. It is therefore not excluded that, with the
improvement of wave function-based methods, the precision
of correlated energies becomes comparable to the basis set
errors, such that the nature of the basis or its extrapolation to
■ REFERENCES
(1) Helgaker, T.; Jørgensen, P.; Olsen, J. Molecular Electronic-
the CBS limit ultimately becomes dominant and leads to Structure Theory, 1st ed.; John Wiley & Sons: Chichester, 2000.
noticeable deviations that exceed chemical accuracy (1 kcal/ (2) Jensen, F. How Large is the Elephant in the Density Functional
mol) for larger molecules.93 Theory Room? J. Phys. Chem. A 2017, 121, 6104−6107.
(3) Hohenberg, P.; Kohn, W. Inhomogeneous Electron Gas. Phys.
■
*
ASSOCIATED CONTENT
sı Supporting Information
Rev. 1964, 136, B864−B871.
(4) Kohn, W.; Sham, L. J. Self-consistent equations including
exchange and correlation effects. Phys. Rev. 1965, 140, A1133−A1138.
The Supporting Information is available free of charge at (5) Boys, S. F.; Bernardi, F. The calculation of small molecular
https://pubs.acs.org/doi/10.1021/acs.jctc.3c00952. interactions by the differences of separate total energies. Some
procedures with reduced errors. Mol. Phys. 1970, 19, 553−566.
Implementation details of the PW MP2 correlation (6) van Duijneveldt, F. B.; van Duijneveldt-van de Rijdt, J. G. C. M.;
energy in CPMD; energy convergence against the PW van Lenthe, J. H. State of the Art in Counterpoise Theory. Chem. Rev.
wave function energy cutoff; energy convergence versus 1994, 94, 1873−1885.
the box volume for different treatments of the (7) Daudey, J. P.; Claverie, P.; Malrieu, J. P. Perturbative ab initio
electrostatic couplings in the PW periodic setup; PW calculations of intermolecular energies. I. Method. Int. J. Quantum
and (aug-)cc-pV5Z S22* interaction energies; additional Chem. 1974, 8, 1−15.
deviations between GTO and PW interaction energies (8) Schwenke, D. W.; Truhlar, D. G. Systematic study of basis set
superposition errors in the calculated interaction energy of two HF
against cardinal numbers and the number of electrons in molecules. J. Chem. Phys. 1985, 82, 2418−2426.
the complexes; and figures of merit for the interpolation (9) Loushin, S. K.; Liu, S. Y.; Dykstra, C. E. Improved counterpoise
quality of various GTO extrapolations (PDF) corrections for the ab initio calculation of hydrogen bonding
interactions. J. Chem. Phys. 1986, 84, 2720−2725.
■ AUTHOR INFORMATION
Corresponding Author
(10) Gutowski, M.; Van Lenthe, J. H.; Verbeek, J.; Van Duijneveldt,
F. B.; Chałasinski, G. The basis set superposition error in correlated
electronic structure calculations. Chem. Phys. Lett. 1986, 124, 370−
Ursula Rothlisberger − Laboratory of Computational 375.
Chemistry and Biochemistry, Institute of Chemical Sciences (11) Feller, D. Application of systematic sequences of wave functions
and Engineering, É cole Polytechnique Fédérale de Lausanne to the water dimer. J. Chem. Phys. 1992, 96, 6104−6114.
(EPFL), CH-1015 Lausanne, Switzerland; orcid.org/ (12) Gutowski, M.; Van Duijneveldt-Van De Rijdt, J. G.; Van
0000-0002-1704-8591; Email: ursula.roethlisberger@ Lenthe, J. H.; Van Duijneveldt, F. B. Accuracy of the Boys and
Bernardi function counterpoise method. J. Chem. Phys. 1993, 98,
epfl.ch
4728−4737.
Authors (13) Van Mourik, T.; Wilson, A. K.; Peterson, K. A.; Woon, D. E.;
Dunning, T. H. The Effect of Basis Set Superposition Error (BSSE)
Justin Villard − Laboratory of Computational Chemistry and on the Convergence of Molecular Properties Calculated with the
Biochemistry, Institute of Chemical Sciences and Engineering, Correlation Consistent Basis Sets. Adv. Quantum Chem. 1998, 31,
É cole Polytechnique Fédérale de Lausanne (EPFL), CH-1015 105−135.
Lausanne, Switzerland; orcid.org/0000-0003-4606-319X (14) Mentel, M.; Baerends, E. J. Can the counterpoise correction for
Martin P. Bircher − Computational and Soft Matter Physics, basis set superposition effect be justified? J. Chem. Theory Comput.
Universität Wien, A-1090 Wien, Austria 2014, 10, 252−267.
9225 https://doi.org/10.1021/acs.jctc.3c00952
J. Chem. Theory Comput. 2023, 19, 9211−9227
Journal of Chemical Theory and Computation pubs.acs.org/JCTC Article
(15) Halkier, A.; Klopper, W.; Helgaker, T.; Jorgensen, P.; Taylor, P. (37) Marx, D.; Hutter, J. Ab Initio Molecular Dynamics: Basic Theory
R. Basis set convergence of the interaction energy of hydrogen- and Advanced Methods, 1st ed.; Cambridge University Press:
bonded complexes. J. Chem. Phys. 1999, 111, 9157−9167. Cambridge, 2009.
(16) Møller, C.; Plesset, M. S. Note on an Approximation Treatment (38) Harl, J.; Kresse, G. Cohesive energy curves for noble gas solids
for Many-Electron Systems. Phys. Rev. 1934, 46, 618−622. calculated by adiabatic connection fluctuation-dissipation theory.
(17) Č ížek, J. On the Correlation Problem in Atomic and Molecular Phys. Rev. B 2008, 77, 045136.
Systems. Calculation of Wavefunction Components in Ursell-Type (39) Marsman, M.; Grüneis, A.; Paier, J.; Kresse, G. Second-order
Expansion Using Quantum-Field Theoretical Methods. J. Chem. Phys. Møller−Plesset perturbation theory applied to extended systems. I.
1966, 45, 4256−4266. Within the projector-augmented-wave formalism using a plane wave
(18) Dunning, T. H. Gaussian basis sets for use in correlated basis set. J. Chem. Phys. 2009, 130, 184103.
molecular calculations. I. The atoms boron through neon and (40) Grüneis, A.; Marsman, M.; Kresse, G. Second-order Møller-
hydrogen. J. Chem. Phys. 1989, 90, 1007−1023. Plesset perturbation theory applied to extended systems. II. Structural
(19) Kendall, R. A.; Dunning, T. H.; Harrison, R. J. Electron and energetic properties. J. Chem. Phys. 2010, 133, 074107.
affinities of the first-row atoms revisited. Systematic basis sets and (41) Schäfer, T.; Ramberger, B.; Kresse, G. Quartic scaling MP2 for
wave functions. J. Chem. Phys. 1992, 96, 6796−6806. solids: A highly parallelized algorithm in the plane wave basis. J. Chem.
(20) Woon, D. E.; Dunning, T. H. Gaussian basis sets for use in Phys. 2017, 146, 104101.
correlated molecular calculations. III. The atoms aluminum through (42) Schäfer, T.; Ramberger, B.; Kresse, G. Laplace transformed
argon. J. Chem. Phys. 1993, 98, 1358−1371. MP2 for three dimensional periodic materials using stochastic orbitals
(21) Feller, D.; Peterson, K. A. An examination of intrinsic errors in in the plane wave basis and correlated sampling. J. Chem. Phys. 2018,
electronic structure methods using the Environmental Molecular 148, 064103.
Sciences Laboratory computational results database and the Gaussian- (43) Shepherd, J. J.; Grüneis, A.; Booth, G. H.; Kresse, G.; Alavi, A.
2 set. J. Chem. Phys. 1998, 108, 154−176. Convergence of many-body wave-function expansions using a plane-
(22) Klopper, W.; Bak, K. L.; Jørgensen, P.; Olsen, J.; Helgaker, T. wave basis: From homogeneous electron gas to solid state systems.
Highly accurate calculations of molecular electronic structure. J. Phys. Phys. Rev. B 2012, 86, 035111.
B: At., Mol. Opt. Phys. 1999, 32, R103−R130. (44) Lanczos, C. An Iteration Method for the Solution of the
(23) De Lara-Castells, M. P.; Krems, R. V.; Buchachenko, A. A.; Eigenvalue Problem of Linear Differential and Integral Operators. J.
Delgado-Barrio, G.; Villarreal, P. Complete basis set extrapolation Res. Natl. Bur. Stand. 1950, 45, 255−282.
limit for electronic structure calculations: Energetic and nonenergetic (45) Paige, C. C. Computational Variants of the Lanczos Method for
properties of HeBr and HeBr2 Van Der Waals dimers. J. Chem. Phys. the Eigenproblem. IMA J. Appl. Math. 1972, 10, 373−381.
(46) Davidson, E. R. The iterative calculation of a few of the lowest
2001, 115, 10438−10449.
eigenvalues and corresponding eigenvectors of large real-symmetric
(24) Dunning, T. H. A road map for the calculation of molecular
matrices. J. Comput. Phys. 1975, 17, 87−94.
binding energies. J. Phys. Chem. A 2000, 104, 9062−9080.
(47) Bircher, M. P.; Villard, J.; Rothlisberger, U. Efficient Treatment
(25) Martin, M. Ab initio total atomization energies of small
of Correlation Energies at the Basis-Set Limit by Monte Carlo
molecules � towards the basis set limit. Chem. Phys. Lett. 1996, 259,
Summation of Continuum States. J. Chem. Theory Comput. 2020, 16,
669−678.
6550−6559.
(26) Helgaker, T.; Klopper, W.; Koch, H.; Noga, J. Basis-set
(48) Gygi, F.; Baldereschi, A. Self-consistent Hartree-Fock and
convergence of correlated calculations on water. J. Chem. Phys. 1997,
screened-exchange calculations in solids: Application to silicon. Phys.
106, 9639−9646. Rev. B 1986, 34, 4405−4408.
(27) Kutzelnigg, W. r12-Dependent terms in the wave function as (49) Massidda, S.; Posternak, M.; Baldereschi, A. Hartree-Fock
closed sums of partial wave amplitudes for large l. Theor. Chim. Acta LAPW approach to the electronic properties of periodic systems. Phys.
1985, 68, 445−469. Rev. B 1993, 48, 5058−5068.
(28) Marchetti, O.; Werner, H.-J. Accurate calculations of (50) Chawla, S.; Voth, G. A. Exact exchange in ab initio molecular
intermolecular interaction energies using explicitly correlated wave dynamics: An efficient plane-wave based algorithm. J. Chem. Phys.
functions. Phys. Chem. Chem. Phys. 2008, 10, 3400−3409. 1998, 108, 4697−4700.
(29) Halkier, A.; Helgaker, T.; Jørgensen, P.; Klopper, W.; Koch, H.; (51) Sorouri, A.; Foulkes, W. M.; Hine, N. D. Accurate and efficient
Olsen, J.; Wilson, A. K. Basis-set convergence in correlated method for the treatment of exchange in a plane-wave basis. J. Chem.
calculations on Ne, N2, and H2O. Chem. Phys. Lett. 1998, 286, Phys. 2006, 124, 064105.
243−252. (52) Carrier, P.; Rohra, S.; Görling, A. General treatment of the
(30) Wilson, A. K.; Dunning Jr, T. H. Benchmark calculations with singularities in Hartree-Fock and exact-exchange Kohn-Sham
correlated molecular wave functions. X. Comparison with ”exact” methods for solids. Phys. Rev. B 2007, 75, 205126.
MP2 calculations on Ne, HF, H2O, and N2. J. Chem. Phys. 1997, 106, (53) Broqvist, P.; Alkauskas, A.; Pasquarello, A. Hybrid-functional
8718−8726. calculations with plane-wave basis sets: Effect of singularity correction
(31) Hutter, J.; Curioni, A. Dual-level parallelism for ab initio on total energies, energy eigenvalues, and defect energy levels. Phys.
molecular dynamics: Reaching teraflop performance with the CPMD Rev. B 2009, 80, 085114.
code. Parallel Comput. 2005, 31, 1−17. (54) Hockney, R. W. The potential calculation and some
(32) Hutter, J.; Curioni, A. Car-Parrinello molecular dynamics on applications. Methods Comput. Phys. 1970, 9, 135−211.
massively parallel computers. ChemPhysChem 2005, 6, 1788−1793. (55) Blöchl, P. E. Electrostatic decoupling of periodic images of
(33) Jurečka, P.; Š poner, J.; Č erný, J.; Hobza, P. Benchmark plane-wave-expanded densities and derived atomic point charges. J.
database of accurate (MP2 and CCSD(T) complete basis set limit) Chem. Phys. 1995, 103, 7422−7428.
interaction energies of small model complexes, DNA base pairs, and (56) Martyna, G. J.; Tuckerman, M. E. A reciprocal space based
amino acid pairs. Phys. Chem. Chem. Phys. 2006, 8, 1985−1993. method for treating long range interactions in ab initio and force-field-
(34) Takatani, T.; Hohenstein, E. G.; Malagoli, M.; Marshall, M. S.; based calculations in clusters. J. Chem. Phys. 1999, 110, 2810−2821.
Sherrill, C. D. Basis set consistent revision of the S22 test set of (57) Genovese, L.; Deutsch, T.; Neelov, A.; Goedecker, S.; Beylkin,
noncovalent interaction energies. J. Chem. Phys. 2010, 132, 144104. G. Efficient solution of Poisson’s equation with free boundary
(35) IBM-MPI-CPMD. Car-Parrinello Molecular Dynamics Code, conditions. J. Chem. Phys. 2006, 125, 074105.
2019, http://www.cpmd.org. (58) Paier, J.; Hirschl, R.; Marsman, M.; Kresse, G. The Perdew-
(36) Szabo, A.; Ostlund, N. S. Modern Quantum Chemistry, 1st ed.; Burke-Ernzerhof exchange-correlation functional applied to the G2−1
Dover Publications: New York, 1996. test set using a plane-wave basis set. J. Chem. Phys. 2005, 122, 234102.
9226 https://doi.org/10.1021/acs.jctc.3c00952
J. Chem. Theory Comput. 2023, 19, 9211−9227
Journal of Chemical Theory and Computation pubs.acs.org/JCTC Article
(59) Ewald, P. P. Die Berechnung optischer und elektrostatischer (82) Martin, J. M. The total atomization energy and heat of
Gitterpotentiale. Ann. Phys. 1921, 369, 253−287. formation of HCN(g). Chem. Phys. Lett. 1996, 259, 679−682.
(60) Allen, M. P.; Tildesley, D. J. Computer Simulation of Liquids, (83) Martin, J. M. L.; Taylor, P. R. Benchmark quality total
2nd ed.; Oxford University Press: Oxford, 2017. atomization energies of small polyatomic molecules. J. Chem. Phys.
(61) Del Bene, J. E. Proton affinities of ammonia, water, and 1997, 106, 8620−8623.
hydrogen fluoride and their anions: a quest for the basis-set limit (84) Halkier, A.; Helgaker, T.; Jørgensen, P.; Klopper, W.; Olsen, J.
using the Dunning augmented correlation-consistent basis sets. J. Basis-set convergence of the energy in molecular Hartree−Fock
Phys. Chem. 1993, 97, 107−110. calculations. Chem. Phys. Lett. 1999, 302, 437−446.
(62) Woon, D. E.; Dunning, T. H. Gaussian basis sets for use in (85) Goedecker, S.; Teter, M.; Hutter, J. Separable dual-space
correlated molecular calculations. IV. Calculation of static electrical Gaussian pseudopotentials. Phys. Rev. B 1996, 54, 1703−1710.
response properties. J. Chem. Phys. 1994, 100, 2975−2988. (86) Del Ben, M.; Hutter, J.; Vandevondele, J. Second-Order
(63) Peterson, K. A.; Dunning, T. H. Benchmark calculations with Møller−Plesset Perturbation Theory in the Condensed Phase: An
correlated molecular wave functions. VII. Binding energy and Efficient and Massively Parallel Gaussian and Plane Waves Approach.
structure of the HF dimer. J. Chem. Phys. 1995, 102, 2032−2041. J. Chem. Theory Comput. 2012, 8, 4177−4188.
(64) Peterson, K. A.; Dunning, T. H. Benchmark calculations with (87) Hutter, J.; Lüthi, H. P.; Parrinello, M. Electronic structure
correlated molecular wave functions. 11. Energetics of the elementary optimization in plane-wave-based density functional calculations by
reactions F + H2, O + H2, and H’ + HCl. J. Phys. Chem. A 1997, 101, direct inversion in the iterative subspace. Comput. Mater. Sci. 1994, 2,
6280−6292. 244−248.
(65) Woon, D. E. Accurate modeling of intermolecular forces: a (88) Neese, F. The ORCA program system. Wiley Interdiscip. Rev.:
systematic Møller-Plesset study of the argon dimer using correlation Comput. Mol. Sci. 2012, 2, 73−78.
consistent basis sets. Chem. Phys. Lett. 1993, 204, 29−35. (89) Neese, F. Software update: The ORCA program system�
(66) Woon, D. E. Benchmark calculations with correlated molecular Version 5.0. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2022, 12,
wave functions. V. The determination of accurate ab initio No. e1606.
intermolecular potentials for He2, Ne2, and Ar2. J. Chem. Phys. (90) TURBOMOLE V7.1, a Development of University of Karlsruhe
1994, 100, 2838−2850. and Forschungszentrum Karlsruhe GmbH, 1989−2007, TURBOMOLE
(67) Woon, D. E.; Dunning, T. H.; Peterson, K. A. Ab initio GmbH, 2016. http://www.turbomole.com.
investigation of the N2-HF complex: Accurate structure and (91) Burns, L. A.; Marshall, M. S.; Sherrill, C. D. Comparing
energetics. J. Chem. Phys. 1996, 104, 5883−5891. counterpoise-corrected, uncorrected, and averaged binding energies
(68) Raghavachari, K.; Anderson, J. B. Electron correlation effects in for benchmarking noncovalent interactions. J. Chem. Theory Comput.
molecules. J. Phys. Chem. 1996, 100, 12960−12973. 2014, 10, 49−57.
(69) Truhlar, D. G. Basis-set extrapolation. Chem. Phys. Lett. 1998, (92) Miliordos, E.; Aprà, E.; Xantheas, S. S. Benchmark theoretical
294, 45−48. study of the π-π binding energy in the benzene dimer. J. Phys. Chem. A
(70) Varandas, A. J. Basis-set extrapolation of the correlation energy. 2014, 118, 7568−7578.
J. Chem. Phys. 2000, 113, 8880−8887. (93) Al-Hamdani, Y. S.; Nagy, P. R.; Zen, A.; Barton, D.; Kállay, M.;
(71) Eshuis, H.; Furche, F. Basis set convergence of molecular Brandenburg, J. G.; Tkatchenko, A. Interactions between large
correlation energy differences within the random phase approxima- molecules pose a puzzle for reference quantum mechanical methods.
tion. J. Chem. Phys. 2012, 136, 084105. Nat. Commun. 2021, 12, 3927.
(72) Woon, D. E.; Dunning, T. H. Benchmark calculations with (94) Del Ben, M.; Hutter, J.; Vandevondele, J. Electron correlation
correlated molecular wave functions. I. Multireference configuration in the condensed phase from a resolution of identity approach based
interaction calculations for the second row diatomic hydrides. J. Chem. on the gaussian and plane waves scheme. J. Chem. Theory Comput.
Phys. 1993, 99, 1914−1929. 2013, 9, 2654−2671.
(73) Peterson, K. A.; Kendall, R. A.; Dunning, T. H. Benchmark
calculations with correlated molecular wave functions. II. Config-
uration interaction calculations on first row diatomic hydrides. J.
Chem. Phys. 1993, 99, 1930−1944.
(74) Feyereisen, M. W.; Feller, D.; Dixon, D. A. Hydrogen bond
energy of the water dimer. J. Phys. Chem. 1996, 100, 2993−2997.
(75) Peterson, K. A.; Woon, D. E.; Dunning, T. H. Benchmark
calculations with correlated molecular wave functions. IV. The
classical barrier height of the H+H2 → H2+H reaction. J. Chem.
Phys. 1994, 100, 7410−7415.
(76) Woon, D. E.; Dunning, T. H. Benchmark calculations with
correlated molecular wave functions. VI. Second row A2 and first
row/second row AB diatomic molecules. J. Chem. Phys. 1994, 101,
8877−8893.
(77) Feller, D.; Sordo, J. A. A CCSDT study of the effects of higher
order correlation on spectroscopic constants. I. First row diatomic
hydrides. J. Chem. Phys. 2000, 112, 5604−5610.
(78) Schwartz, C. Importance of Angular Correlations between
Atomic Electrons. Phys. Rev. 1962, 126, 1015−1019.
(79) Kutzelnigg, W.; Morgan, J. D. Rates of convergence of the
partial-wave expansions of atomic correlation energies. J. Chem. Phys.
1992, 96, 4484−4508.
(80) Carroll, D. P.; Silverstone, H. J.; Metzger, R. M. Piecewise
polynomial configuration interaction natural orbital study of 1s2
helium. J. Chem. Phys. 1979, 71, 4142−4163.
(81) Hill, R. N. Rates of convergence and error estimation formulas
for the Rayleigh-Ritz variational method. J. Chem. Phys. 1985, 83,
1173−1196.
9227 https://doi.org/10.1021/acs.jctc.3c00952
J. Chem. Theory Comput. 2023, 19, 9211−9227