Polar representation of covariance descriptors for circular features G. Gualdi, A. Prati and R. Cucchiara The use of polar representation of covariance descriptors, suitable for the classification of circular feature sets, is proposed. It overcomes the implicit limits of state-of-the-art methods based on axis-oriented rectangular patches. The suitability of the proposed solution is verified on two case studies, namely head detection and polymer classification in photomicrograph contexts. Introduction: Object classification exploiting covariance descriptors on Riemannian manifolds [1] has attracted much attention thanks to its broad applicability and high accuracy, e.g. for people detection and texture classification [1, 2]. This approach exploits integral images, used for efficient covariance descriptor computation [2], and axis-oriented rectangular patches (that are sub-windows of the image to classify). When this method is applied to classify objects with non-rectangular shape (e.g. holes, heads, wheels, etc.), the performance in terms of classi- fication accuracy degrades owing to the inclusion of non-discriminative pixels within the rectangular patches. Locating circles in images has been deeply explored in the literature, for both robotic or industrial applications [3], and 3D object reconstruc- tion or traffic sign recognition [4]. All the proposed methods rely on fitting the pixel values or edge points with a certain parametric function, which is difficult to generalise and heavily affected by the unfavourable correlation between strong false positives and weak true positives [4]. This is a typical limit of parametric approaches, such as Hough trans- forms. For this reason, [4] proposed to measure a curve’s distinctiveness through a one-parameter family of curves, in order to gain in accuracy. Moreover, most of these methods are highly time-consuming, tackling the problem as an optimised search in highly dimensional spaces. In [3] the problem is formulated as a maximum likelihood estimator and the method is proved to be fast and accurate also in the case of occlu- sions, but it relies on the good extraction of the points describing the curves. To overcome these problems, this Letter proposes the extension of classification based on covariance descriptors to the case of circular and concave features, by using a polar representation which unrolls the slice of an annulus in a rectangular patch. This approach is suitable also in all cases where the objects to classify are not easily modelled by parametric curves or precise edges cannot be extracted owing to the complexity of the scenes. Finally, this Letter also shows that the compu- tation of covariance descriptors using multi-spectral (colour) image derivatives yields more accurate results than using plain grey-level derivatives, as proposed in [1]. a b c d Fig. 1 Bridging from traditional axis-orientated patches to circular ones, through polar transformation a Some rectangular patches used by classifier proposed in [1] b Polar transformation of a w.r.t. image centre c Rectangular patch on polar image d Transformation of patch in c onto original image Cascade of boosting classifiers based on covariance descriptors: Without giving details, the classifier proposed in [1] is based on a rejection cascade of LogitBoost classifiers (the strong classifiers), each composed of a sequence of logistic regressors (the weak classifiers). The domain of the regressors is the space of symmetric matrices, obtained through an expo- nential mapping from the Riemannian manifold of covariance matrices. Given an input image I and the following eight-dimensional set F of fea- tures (defined over each pixel of I ): F =[x, y, |I x |,|I y |, I 2 x + I 2 y , |I xx |,|I yy |, arctan|I y |/|I x |] (1) where x and y are the pixel coordinates, I x , I y and I xx , I yy are, respectively, the first- and second-order derivatives of the image intensity, the covari- ance matrices are computed from the set of features F over axis-oriented rectangular patches of I. Each weak classifier is associated one-to-one with a rectangular patch (see Fig. 1a). Polar representation for covariance descriptors: Aiming to classify circular features, the use of patches with generic circular shapes would catch variations more accurately than just using axis-oriented rec- tangular shapes. Indeed, using circles or annulus would exclude from the covariance matrix computation all the pixels that do not strictly belong to the circular shape to recognise. To save accurate classification and still exploit integral images (which need axis-oriented rectangular patches), we propose the use of polar images; having defined a reference point C (x C , y C ), r 2 =(x − x C ) 2 +(y − y C ) 2 and q = arctan((y − y C )/ (x − x C )), the polar image of I(x, y) w.r.t. to point C is I p ( p, q) (see Fig. 1b); given an image with centre C and its polar transformation (w.r.t. C ), any slice of annulus on the original image centred in C can be represented as an axis-oriented rectangular patch on the polar image. Therefore, the polar transformation creates a bridge between the circular patches (useful for classification purposes) and the rectangu- lar patches (needed by the intrinsic classifier architecture); given an image to classify, as a first step the polar image transformation is com- puted and then the classifiers are applied: the rectangular patches over the polar image, used by the weak classifiers, represent a slice of annulus over the original image, and this provides a classifier more suited to circular shape classification (see Figs. 1c and d). Multi-spectral image derivatives for covariance descriptors: In appear- ance-based object classification, it is common to avoid the use of chro- minance since in most cases colour does not convey any discriminative information. Instead, since colour can be used successfully to compute image derivatives which are more accurate w.r.t. luminance only images [5], we claim that the use of chrominance can improve the classi- fication results. Considering the RGB and Lab colour spaces, in order to compute covariance descriptors sensitive to the chrominance, we define I RGB x = ∂R ∂x 2 + ∂G ∂x 2 + ∂B ∂x 2 ; I RGB xx = ∂ 2 R ∂x 2 2 + ∂ 2 G ∂x 2 2 + ∂ 2 B ∂x 2 2 (2) (and by analogy we define I RGB y , I RGB yy , I Lab x , I Lab xx , I Lab y , I Lab yy ). Exploiting (2), we then extend (1) to RGB and Lab colour spaces, defining F RGB and F Lab . 0.005 0.05 0.0003 0.003 0.03 miss rate FPPW RGB-polar RGB-straight Lab-polar Lab-straight grey-polar grey-straight Fig. 2 Miss rate (MR) against false positive per window (FPPW) on head images dataset Each marker represents performance up to cascade level. Curves are plotted from cascade 5 (starting at lower right corner) up to 18 (moving towards upper left corner) Experimental results: We tested the proposed approach in head and polymer detection. In the first case, the classifier is applied to find the exact position of the head of pedestrians, after having located them within an image through a pedestrian detector [1]. Candidate head locations are searched around the upper body of the detected pedestrians. In the second case, the classifier is applied to examine photomicro- graphy image datasets and automatically extract the images that ELECTRONICS LETTERS 22nd July 2010 Vol. 46 No. 15 Authorized licensed use limited to: UNIVERSITA MODENA. Downloaded on August 05,2010 at 09:12:23 UTC from IEEE Xplore. Restrictions apply.