PHYSICAL REVIEW E 89, 033304 (2014) Calculation of space localized properties in correlated quantum Monte Carlo methods with reweighting: The nonlocality of statistical uncertainties Roland Assaraf and Dominik Domin Laboratoire de Chimie Th´ eorique, CNRS-UMR 7616, Universit´ e Pierre et Marie Curie Paris VI, Case 137, 4 place Jussieu, 75252 Paris Cedex 05, France (Received 3 December 2013; published 10 March 2014) We study the efﬁciency of quantum Monte Carlo (QMC) methods in computing space localized ground state properties (properties which do not depend on distant degrees of freedom) as a function of the system size N . We prove that for the commonly used correlated sampling with reweighting method, the statistical ﬂuctuations σ 2 (N ) do not obey the locality property. σ 2 (N ) grow at least linearly with N and with a slope that is related to the ﬂuctuations of the reweighting factors. We provide numerical illustrations of these tendencies in the form of QMC calculations on linear chains of hydrogen atoms. DOI: 10.1103/PhysRevE.89.033304 PACS number(s): 02.70.Ss, 31.15.−p I. INTRODUCTION Many important quantities of chemical or physical interest are localized in space; that is, they do not depend on spatially distant degrees of freedom. For example, the forces exerted on any particular nucleus in a molecule are barely inﬂuenced by the presence of neutral molecules that are very far away from the molecule of interest. One would expect that the distant neutral molecules would represent irrelevant degrees of freedom in computation of the force experienced by the nucleus and could be eliminated in the computation of such property. Quantum chemistry exploits locality in many different ways, for example, the Lewis description of covalent bonding [1] and the closely related valence bond theory [2,3]. Deterministic computational methods often exploit locality to tackle large systems. The most obvious strategy would be to study a fragment (for example, in a protein) and rely on the transferability of the results to a larger system. This would also be the idea behind coarse graining and hybrid methods. [4,5] However, calculations on the full system are more robust since they do not depend on this transferability hypothesis. Exploiting the locality property to lower the cost of a calculation on the entire system is the strategy involved, for example, in local electronic structure methods [6–9]. The latter scale down the computational cost as a function of the system size N (number of particles for a rather homogeneous system at a given scale, or a collection of identical systems). In such methods, molecular orbitals can be localized on single or spatially adjacent atoms according to a variety of criteria [10–12]. For large molecular systems, locality enables local correlation methods to reduce the computational scaling up to a linear dependence with N [13,14]. Quantum Monte Carlo (QMC) methods [15–20], a powerful set of stochastic techniques for solving the Schr¨ odinger equation, are increasingly used for electronic structure cal- culations on molecular systems. This is primarily because of the moderate computational scaling [O(N 3−4 )] and the perceived high accuracy to compute total energies. In the past decade there has been a similar drive to formulate and program linear-scaling QMC algorithms [21–27]. The focus of such works was to generate Monte Carlo sample conﬁgurations and evaluate energies with a reduced computational cost. However, another important factor in the numerical efﬁciency of a Monte Carlo method is the size of the statistical ﬂuctuations in a calculated property. The purpose of this paper is to understand the behavior of QMC statistical ﬂuctuations of spatially localized properties as a function of N . We show that conventional methods (correlated sampling methods) do not have the locality property regarding the statistical ﬂuctuations. Many properties, such as the force on a nucleus, dipole moment, or substitution energy, can be written as a difference of two ground state energies E λ − E 0 . E 0 and E λ are, respectively, the ground state energies of the Hamiltonians H 0 and H λ . H λ is a small perturbation of H 0 and λ is a small perturbation parameter, H λ = H + λO. (1) When considering a localized property, O depends mainly on the positions of particles lying in a small region of the space. We then have to compute a small difference in energies E λ − E 0 or the energy derivative, 〈O〉= dE λ dλ     λ=0 ≃ E λ − E 0 λ . (2) Computing these differences (2) from independent energy calculations is particularly inefﬁcient in QMC. Since energy is size extensive, the statistical uncertainty on the energy usually behaves as √ N . Consequently, the statistical uncertainty on such a calculation of (2) is σ i (λ,N,M) ∝ 1 λ  N M . (3) This formula is valid asymptotically, for small λ, large system size N , and large sample size M. It is obvious from (3) that the smaller λ is the less efﬁcient independent energy calculations are. The so-called correlated sampling with reweighting methods [28–31], which are popular strategies to compute small differences of energies or properties, seem to be much more suitable methods. As we show later these methods encompass improved estimators which are built using the Hellmann-Feynman theorem [28,32–37]. We prove in this paper that the statistical uncertainty in correlated sampling with reweighting methods behaves as σ c (λ,N,M) ∝  N M . (4) 1539-3755/2014/89(3)/033304(10) 033304-1 ©2014 American Physical Society