A general parallelization strategy for random path based geostatistical simulation methods Gre ´ goire Mariethoz n Centre for Hydrogeology, University of Neucha ˆtel, 11 Rue Emile Argand, CP 158, CH-2000 Neucha ˆtel, Switzerland article info Article history: Received 16 July 2009 Received in revised form 17 November 2009 Accepted 21 November 2009 Keywords: Geostatistics Simulation Parallelization Sequential simulation Multiple-point Multiple-points MPI Speed-up Parallel computing Random path abstract The size of simulation grids used for numerical models has increased by many orders of magnitude in the past years, and this trend is likely to continue. Efﬁcient pixel-based geostatistical simulation algorithms have been developed, but for very large grids and complex spatial models, the computational burden remains heavy. As cluster computers become widely available, using parallel strategies is a natural step for increasing the usable grid size and the complexity of the models. These strategies must proﬁt from of the possibilities offered by machines with a large number of processors. On such machines, the bottleneck is often the communication time between processors. We present a strategy distributing grid nodes among all available processors while minimizing communication and latency times. It consists in centralizing the simulation on a master processor that calls other slave processors as if they were functions simulating one node every time. The key is to decouple the sending and the receiving operations to avoid synchronization. Centralization allows having a conﬂict management system ensuring that nodes being simulated simultaneously do not interfere in terms of neighborhood. The strategy is computationally efﬁcient and is versatile enough to be applicable to all random path based simulation methods. & 2010 Elsevier Ltd. All rights reserved. 1. Introduction The size of the simulation grids used for geological models (and more generally for spatial statistics) has increased by many orders of magnitude in the last years. This trend is likely to continue because the only way of modeling different scales together is to use high-resolution models. This is of utmost importance in applications such as hydrogeology, petroleum and mining, due to the critical inﬂuence of small scale heterogeneity on large scale processes (e.g. Mariethoz et al., 2009a). Efﬁcient pixel-based geostatistical simulation algorithms have been developed, but for very large grids and complex spatial models, the computational burden remains heavy. Furthermore, with increasingly sophisticated simulation techniques including complex spatial constraints, the computational cost for simulating one grid node has also raised. As multicore processors and clusters of computers become more and more available, using parallel strategies is necessary for increasing the usable grid size and hence allowing for models of higher complexity. Parallel computers can be divided into two main categories: shared memory machines and distributed memory architectures. Shared memory machines have the advantage of ease and rapidity of the communications between the different computing units. Nevertheless, their price is extremely high and the total amount of memory as well as the total number of processors are limited. Therefore, most of the time it is distributed memory machines (or clusters computers) that are used in the industry or in the academic world. As such machines do not have a common shared memory space, the processors have to communicate by sending and receiving messages. The communication time between processors can be important and is often the bottleneck in a program execution. In this paper, we propose a parallelization strategy applicable in the context of sequential simulation methods and based on the distribution of the grid nodes among all available processors. The method minimizes communication and latency times and can be applied using shared or distributed memory architectures, or a combination of both. It consists in centralizing the simulation on a master processor that calls other slave processors as if they were functions simulating one node each time. The key is to decouple the sending and the receiving operations to avoid waiting for synchronization. Centralization allows having a conﬂict manage- ment system making sure that nodes being simulated simulta- neously do not interfere in terms of neighborhood. The strategy is computationally efﬁcient and is versatile enough to be applicable to all random path based simulation methods. It is illustrated with an example using the Direct Sampling approach (Mariethoz, 2009; Mariethoz and Renard, ARTICLE IN PRESS Contents lists available at ScienceDirect journal homepage: www.elsevier.com/locate/cageo Computers & Geosciences 0098-3004/$ - see front matter & 2010 Elsevier Ltd. All rights reserved. doi:10.1016/j.cageo.2009.11.001 n Tel.: + 41 32 718 26 10. E-mail addresses: gregoire.mariethoz@minds.ch, gregoire.mariethoz@unine.ch. Computers & Geosciences 36 (2010) 953–958