3792 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 46, NO. 11, NOVEMBER 2008 Improving Piecewise Linear Registration of High-Resolution Satellite Images Through Mesh Optimization Vicente Arévalo and Javier González, Associate Member, IEEE Abstract—Piecewise linear transformation is a powerful tech- nique for coping with the registration of images affected by local geometric distortions, as it is usually the case of high-resolution satellite images. A key point when applying this technique is to divide the images to register according to a suitable common triangular mesh. This comprises two different aspects: where to place the mesh vertices (i.e., the mesh geometrical realization) and to set an appropriate topology upon these vertices (i.e., the mesh topological realization). This paper focuses on the latter and presents a novel method that improves the registration of two images by an iterative optimization process that modiﬁes the mesh connectivity by swapping edges. For detecting if an edge needs to be swapped or not, we evaluate the registration improvement of that action on the two triangles connected by the edge. Another contribution of our proposal is the use of the mutual information for measuring the registration consistency within the optimization process, which provides more robustness to image changes than other well-known metrics such as normalized cross-correlation or sum of square differences. The proposed method has been successfully tested with different pairs of panchromatic QuickBird images (0.6 m/pixel of spatial resolution) of a variety of land covers (urban, residential, and rural) acquired under different lighting conditions and viewpoints. Index Terms—High-resolution satellite images, mesh optimiza- tion, piecewise linear (PWL) registration. I. I NTRODUCTION I MAGE registration is an essential step in many remote sens- ing applications like image fusion, change detection, 3-D scene reconstruction, etc. In this process, one image remains ﬁxed (the reference image), whereas the other (the input or moving image), which is acquired on a different date, from a different viewpoint and/or using a different sensor, is spatially transformed until ﬁtting with the ﬁrst one. Traditionally, the registration process is dealt with in two stages. In the ﬁrst one, the positions of a set of pairs of corresponding points are identiﬁed in the images, and in the second stage, this set of correspondence pairs is exploited to robustly estimate a map- ping function which is applied to transform all the pixels of the Manuscript received March 29, 2007; revised October 25, 2007. Current version published October 30, 2008. This work was supported in part by the Spanish Government under Research Contracts CICYT DPI-2005-01391 and AECI PCI-A-7286-06. The authors are with the Department of System Engineering and Automa- tion, University of Málaga, 29071 Málaga, Spain (e-mail: varevalo@ctima. uma.es; jgonzalez@ctima.uma.es). Color versions of one or more of the ﬁgures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identiﬁer 10.1109/TGRS.2008.924003 input image onto the reference one (some kind of interpolation is required in this step) [1]. A variety of mapping functions have been reported in the lit- erature for image registration, including polynomial [2], radial basis functions [3], piecewise linear (PWL) [4] or piecewise- cubic [5] functions, multiquadric functions [6], B-spline func- tions [7], etc. In remote sensing, global polynomial (POL) functions usually perform well with low and medium reso- lution images (National Oceanic and Atmospheric Adminis- tration Advanced Very High Resolution Radiometer, EarthSat, Landsat, Indian Remote Sensing, etc.) but may not be powerful enough to register high-resolution ones (QuickBird, Ikonos, or OrbView), where geometric distortions may become important to attain short revisit time, and high-resolution satellites pitch along their orbit to observe the scene from off nadir; thus, two (temporal) images of a certain scene may have been acquired from quite different angles, which entails large local geometric differences, particularly in urban scenarios and high- relief terrain. For registering such images, PWL functions are particularly suitable (as revealed in [8]), because they divide the images into a mesh of triangular patches, which are individually registered through linear transformations. In this registration method, it is of particular relevance—the case where the camera projection can be approximated by a paraperspective one. Under this assumption, provided that two corresponding image triangles come from the projection of a planar patch of the scene, they must perfectly overlap for a certain afﬁne transformation [9] (see Fig. 1). It is clear that, in order to meet such desirable condition for all mesh triangles, the mesh cannot be an arbitrary one, but it must fulﬁl some geometrical and topological requirements. Current approaches for PWL registration (including those com- mercially available in packages such as ERDAS, ENVI, PCI, etc.) create the triangular meshes for the two images from a set of correspondence (conjugate) pairs, which are localized either automatically or by hand. Typically, they are generated by using the Delaunay’s triangulation method [10] (or other similar one), which produces triangles of balanced size and shape but are not optimal for covering as many planar patches as possible (see Fig. 2). The aim of this paper is to modify the topology (not the vertices) of a given initial mesh by iteratively swapping its edges in order to improve the global registration of a pair of images. This process can be seen as an optimization procedure that, at each step, focuses on a particular edge and improves the registration consistency of the quadrilateral formed by the 0196-2892/$25.00 © 2008 IEEE Authorized licensed use limited to: Universidad de Malaga. Downloaded on November 25, 2008 at 07:09 from IEEE Xplore. Restrictions apply.