International Journal of Applied Earth Observation and Geoinformation 13 (2011) 396–408 Contents lists available at ScienceDirect International Journal of Applied Earth Observation and Geoinformation journal homepage: www.elsevier.com/locate/jag Comparative analysis of different techniques for spatial interpolation of rainfall data to create a serially complete monthly time series of precipitation for Sicily, Italy A. Di Piazza, F. Lo Conti, L.V. Noto , F. Viola, G. La Loggia Dipartimento di Ingegneria Civile, Ambientale e Aerospaziale, Università di Palermo, viale delle Scienze, Palermo 90128, Italy article info Article history: Received 5 November 2009 Accepted 27 January 2011 Keywords: Precipitation DEM Geostatistics Interpolation methods abstract The availability of good and reliable rainfall data is fundamental for most hydrological analyses and for the design and management of water resources systems. However, in practice, precipitation records often suffer from missing data values mainly due to malfunctioning of raingauge for specific time periods. This is an important issue in practical hydrology because it affects the continuity of rainfall data and ultimately influences the results of hydrologic studies which use rainfall as input. Many methods to estimate missing rainfall data have been proposed in literature and, among these, most are based on spatial interpolation algorithms. In this paper different spatial interpolation algorithms have been evaluated to produce a reasonably good continuous dataset bridging the gaps in the historical series. The algorithms used are deterministic methods such as inverse distance weighting, simple linear regression, multiple regression, geographically weighted regression and artificial neural networks, and geostatistical models such as ordinary kriging and residual ordinary kriging. In some of these methods, the elevation information, provided by a Digital Elevation Model, has been added to improve estimation of missing data. These algorithms have been applied to the mean annual and monthly rainfall data of Sicily (Italy), measured at 247 raingauges. Optimization of different settings of the various interpolation methods has been carried out using a subset of the available rainfall dataset (modeling set) while the remaining subset (validation set) has been used to compare the results obtained by the different algorithms. Validation results indicate that the univariate methods, neglecting the information of elevation, are characterized by the largest errors, which decrease when the elevation is taken into account. The ordinary kriging of residuals from linear regression between precipitation and elevation, which has provided the best performance at annual and monthly scale, has been used to complete the precipitation monthly time series in Sicily. Crown Copyright © 2011 Published by Elsevier B.V. All rights reserved. 1. Introduction The availability of a reliable source of rainfall and climate data is a fundamental prerequisite for the modeling of a wide variety of hydrological and environmental processes. While the nature and the structure of hydrological and environmental models may vary, most of them need a precipitation dataset that is complete and reli- able on a temporal and spatial basis. Unfortunately, measurement of hydrological variables (e.g. rainfall, temperature, streamflows, etc.) can suffer from systematic, random errors and gaps (missing data) (Larson and Peck, 1974; Vieux, 2001) and, among these, the missing data problem is probably the most important one. Corresponding author. E-mail address: valerio@idra.unipa.it (L.V. Noto). Generally there are two different approaches to treat the miss- ing data or data gaps: one possible approach consists of using only continuous records, ignoring the prior (or subsequent) events, while another approach suggests ignoring the gaps, assuming that the data are one continuous series of records. With the former approach many data are wasted and correct statistical inference cannot be made whereas the latter approach reduces the period of recorded events and overestimates the likelihood of occurrence of extreme events. On the other hand, the use of the dataset prone to missing data can result in errors that exhibit temporal and spatial patterns (Stooksbury et al., 1999). A valid alternative to the above mentioned approaches consists of filling the gaps in the rainfall time series by estimating the missing values. The reconstruction of serially incomplete data records has been the subject of a large number of scientific works where numerous techniques for esti- mating missing data values have been implemented and compared. 0303-2434/$ – see front matter. Crown Copyright © 2011 Published by Elsevier B.V. All rights reserved. doi:10.1016/j.jag.2011.01.005