FACE RECOGNITION USING ORTHO-DIFFUSION BASES Sravan Gudivada and Adrian G. Bors Dept. of Computer Science, University of York, York YO10 5GH, UK E-mail: adrian.bors@york.ac.uk ABSTRACT This paper proposes a new approach for face recognition by representing inter-face variation using orthogonal decomposi- tions with embedded diffusion. The modiﬁed Gram-Schmidt with pivoting the columns orthogonal decomposition, called also QR algorithm, is applied recursively to the covariance matrix of a set of images forming the training set. At each re- cursion a set of orthonormal bases functions are extracted for a speciﬁc scale. A diffusion step is embedded at each scale in the QR decomposition. The algorithm models the main vari- ations of face features from the training set by preserving only the most signiﬁcant bases while eliminating noise and non- essential features. Each face is represented by a weighted sum of such representative bases functions, called ortho-diffusion faces. Index Terms— Diffusion wavelets, Face recognition, Gram-Schmidt orthogonal decomposition, Eigenfaces. 1. INTRODUCTION A challenge in complex data analysis is when we have to re- cover a low dimensional intrinsic manifold which manifests itself through a very complex representation in the observable space. The use of kernels on undirected graphs was shown to lead to good results in machine learning tasks. Various approaches have been adopted for modeling data represen- tations using diffusion processes on graphs [1, 2, 3]. Diffu- sion maps [2, 3, 4] achieve dimensionality reduction by re- organizing data according to the parametrization of its under- lying geometry on orthogonal sub-spaces. When the diffusion is propagated, it integrates the local data structure to reveal relational properties of the data set at different scales [3, 4]. Maggioni and Mahadevan proposed in [5] a new multi-scale orthogonal decomposition on graphs into sets of bases func- tions and diffusion wavelets. One of the classical face recognition methods was the eigenfaces method proposed by Turk and Pentland [6]. Each face is represented as a linear combination of the eigenvec- tors resulting from eigendecomposing the covariance matrix of a training set of faces. Extensions of this method include the Fisherfaces [7], the kernel PCA method [8] and the Lapla- cianfaces [9]. Orthogonal Laplacianfaces approach was pro- posed in [10] while a multilinear discriminant analysis was employed in [11] for face recognition. Nevertheless, human identiﬁcation depends on incorporating multi-modal human features such as the 3-D appearance and speech characteris- tics as well [12]. In this paper we propose to use orthonormal diffusion wavelet methodology for face recognition. The modiﬁed Gram-Schmidt algorithm with pivoting the columns QR, is used recursively for extracting a set of orthogonal basis functions, each representing an ortho-diffusion face. At each recursion, we consider data diffusion on the graph leading to the application of the QR algorithm in the following step on a dilated scale of the data. Ortho-diffusion bases which do not represent signiﬁcant information are removed from further processing at each level. Each human face is deﬁned by its projections onto the ortho-diffusion face space. In the testing stage, faces are classiﬁed according to the nearest neigh- borhood to the training data deﬁned in the ortho-diffusion face space. The proposed ortho-diffusion faces method is described in Section 2. The face recognition method using ortho-diffusion faces is detailed in Section 3. In Section 4 we provide the experimental results while the conclusions are given in Section 5. 2. ORTHONORMAL DECOMPOSITIONS USING QR ALGORITHM Let us consider a training set of M face images {I i |,i = 1,...,M }, of size m × n, and consider each of them as a vector of mn pixel entries. In the following we model the face variation within the given training set by calculating the deviation of each face from the mean face as in [6]: ¯ I = 1 M M  i=1 I i , (1) where I i represents the mean face S i = I i − ¯ I. (2) Each S i forms a column in a matrix A of size M × mn. The spread of the face variation within the training set is calcu- lated by means of the covariance matrix C: C = A τ A (3) where matrix C is the diagonal covariance matrix of the train- ing set, of size M × mn, with each column corresponding to 20th European Signal Processing Conference (EUSIPCO 2012) Bucharest, Romania, August 27 - 31, 2012 © EURASIP, 2012 - ISSN 2076-1465 1578