458 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 2, APRIL 2006 Illumination Compensation and Normalization for Robust Face Recognition Using Discrete Cosine Transform in Logarithm Domain Weilong Chen, Meng Joo Er, Member, IEEE, and Shiqian Wu, Member, IEEE Abstract—This paper presents a novel illumination normalization ap- proach for face recognition under varying lighting conditions. In the pro- posed approach, a discrete cosine transform (DCT) is employed to compen- sate for illumination variations in the logarithm domain. Since illumination variations mainly lie in the low-frequency band, an appropriate number of DCT coefficients are truncated to minimize variations under different lighting conditions. Experimental results on the Yale B database and CMU PIE database show that the proposed approach improves the performance significantly for the face images with large illumination variations. More- over, the advantage of our approach is that it does not require any modeling steps and can be easily implemented in a real-time face recognition system. Index Terms—Discrete cosine transform, face recognition, illumination normalization, logarithm transform. I. INTRODUCTION Face recognition has attracted significant attention because of its wide range of applications [1]. Recently, more researchers focus on robust face recognition such as face recognition systems invariant to pose, expression and illumination variations. Illumination variation is still a challenging problem in face recognition research area, especially for appearance-based approaches. The same person can appear greatly different under varying lighting conditions. A variety of approaches have been proposed to solve the problem [3]–[16]. These approaches can be generally classified into three main categories. Preprocessing and Normalization: In this approach, face images are preprocessed using some image processing tech- niques to normalize the images to appear stable under different lighting conditions. For instance, histogram equalization (HE), Gamma correction, logarithm transform, etc. are widely used for illumination normalization [3], [4]. However, nonuniform illumination variation is still difficult to deal with using these global processing techniques. Recently, adaptive histogram equalization (AHE) [2], region-based histogram equalization (RHE) [3], and block-based histogram equalization (BHE) [5] have also been proposed to cope with nonuniform illumination variations. Although recognition rates on face databases with nonuniform illumination variations can be improved compared with the HE, their performances are still not satisfactory. In [13], by combining symmetric shape-from-shading (SSFS) and a generic three-dimensional (3-D) model, the performance of face recognition under varying illuminations is enhanced. However, this method is only efficient for exact frontal face images and it is assumed that all faces share a similar common Manuscript received October 16, 2004; revised March 9, 2005. This paper was recommended by Associate Editor Maja Pantic. W. Chen is with the Computer Control Lab, Nanyang Technological Univer- sity, SIngapore 639798. M. J. Er is with the Intelligent Systems Centre, Nanyang Technological Uni- versity, Singapore 637533 (e-mail: EMJER@ntu.edu.sg). S. Wu is with the Institute for Inforcomm Research, Nanyang Technological University, Singapore 119613. Digital Object Identifier 10.1109/TSMCB.2005.857353 shape. In [3], the authors proposed a normalization method called quotient illumination relighting (QIR). This method is based on the assumption that the lighting modes of the images are known or can be estimated. Invariant Feature Extraction: This approach attempts to ex- tract facial features which are invariant to illumination varia- tions. Edge maps, derivatives of the gray-level and Gabor-like filters are investigated in [9]. However, empirical studies show that none of these representations are sufficient to overcome image variations due to changes in the direction of illumination. Another well-known feature extraction method is called Fisher- face [also known as linear discriminant analysis (LDA)] which linearly projects the image space to a low-dimensional subspace to discount variations in lighting and facial expressions [11]. But, this method is a statistical linear projection method which largely relies on representativeness of the training samples. In [12], the quotient image is regarded as the illumination invariant signature image which can be used for face recognition under varying lighting conditions. Bootstrap database is required for this method and the performance degrades when dominant fea- tures between the bootstrap set and the test set are misaligned. Face Modeling: Illumination variations are mainly due to the 3-D shape of human faces under lighting in different directions. Recently, some researchers attempt to construct a generative 3-D face model that can be used to render face images with different poses and under varying lighting conditions [6], [7], [10] and [14]. A generative model called illumination cone was presented in [6], [7]. The main idea of this method is that the set of face images in fixed pose but under different illumination conditions can be represented using an illumination convex cone which can be constructed from a number of images acquired under variable lighting conditions and the illumination cone can be approxi- mated in a low-dimensional linear subspace. In [10], the authors showed that the set of images of a convex Lambertian object obtained under a variety of lighting conditions can be well ap- proximated by a 9D linear subspace. One of the drawbacks of the model-based approaches is that a number of images of the subject under varying lighting conditions or 3-D shape informa- tion are needed during the training phase. This drawback limits its applications in practical face recognition systems. In addi- tion, existing model-based approaches assume that the human face is a convex object, i.e., the casting shadows are not consid- ered. The specularity problem is also ignored even though the human face is not a perfect Lambertian surface. To the best of our knowledge, one ideal way of solving the illumina- tion variation problem is to normalize a face image to a standard form under uniform lighting conditions. In fact, the human visual system usually cares about the main features of a face, such as the shapes and relative positions of the main facial features, and ignores illumination changes on the face while recognizing a person. Accordingly, in this paper, we propose an illumination normalization approach to remove illumination variations while keeping the main facial features unim- paired. The key idea of the proposed approach is that illumination vari- ations can be significantly reduced by truncating low-frequency dis- crete cosine transform (DCT) coefficients in the logarithm DCT do- main. Our approach can be categorized into the first approach group although feature extraction can be carried out directly in the logarithm DCT domain. 1083-4419/$20.00 © 2006 IEEE