Polar representation of covariance descriptors for circular features G. Gualdi, A. Prati and R. Cucchiara The use of polar representation of covariance descriptors, suitable for the classiﬁcation of circular feature sets, is proposed. It overcomes the implicit limits of state-of-the-art methods based on axis-oriented rectangular patches. The suitability of the proposed solution is veriﬁed on two case studies, namely head detection and polymer classiﬁcation in photomicrograph contexts. Introduction: Object classiﬁcation exploiting covariance descriptors on Riemannian manifolds [1] has attracted much attention thanks to its broad applicability and high accuracy, e.g. for people detection and texture classiﬁcation [1, 2]. This approach exploits integral images, used for efﬁcient covariance descriptor computation [2], and axis-oriented rectangular patches (that are sub-windows of the image to classify). When this method is applied to classify objects with non-rectangular shape (e.g. holes, heads, wheels, etc.), the performance in terms of classi- ﬁcation accuracy degrades owing to the inclusion of non-discriminative pixels within the rectangular patches. Locating circles in images has been deeply explored in the literature, for both robotic or industrial applications [3], and 3D object reconstruc- tion or trafﬁc sign recognition [4]. All the proposed methods rely on ﬁtting the pixel values or edge points with a certain parametric function, which is difﬁcult to generalise and heavily affected by the unfavourable correlation between strong false positives and weak true positives [4]. This is a typical limit of parametric approaches, such as Hough trans- forms. For this reason, [4] proposed to measure a curve’s distinctiveness through a one-parameter family of curves, in order to gain in accuracy. Moreover, most of these methods are highly time-consuming, tackling the problem as an optimised search in highly dimensional spaces. In [3] the problem is formulated as a maximum likelihood estimator and the method is proved to be fast and accurate also in the case of occlu- sions, but it relies on the good extraction of the points describing the curves. To overcome these problems, this Letter proposes the extension of classiﬁcation based on covariance descriptors to the case of circular and concave features, by using a polar representation which unrolls the slice of an annulus in a rectangular patch. This approach is suitable also in all cases where the objects to classify are not easily modelled by parametric curves or precise edges cannot be extracted owing to the complexity of the scenes. Finally, this Letter also shows that the compu- tation of covariance descriptors using multi-spectral (colour) image derivatives yields more accurate results than using plain grey-level derivatives, as proposed in [1]. a b c d Fig. 1 Bridging from traditional axis-orientated patches to circular ones, through polar transformation a Some rectangular patches used by classiﬁer proposed in [1] b Polar transformation of a w.r.t. image centre c Rectangular patch on polar image d Transformation of patch in c onto original image Cascade of boosting classiﬁers based on covariance descriptors: Without giving details, the classiﬁer proposed in [1] is based on a rejection cascade of LogitBoost classiﬁers (the strong classiﬁers), each composed of a sequence of logistic regressors (the weak classiﬁers). The domain of the regressors is the space of symmetric matrices, obtained through an expo- nential mapping from the Riemannian manifold of covariance matrices. Given an input image I and the following eight-dimensional set F of fea- tures (deﬁned over each pixel of I ): F =[x, y, |I x |,|I y |,  I 2 x + I 2 y  , |I xx |,|I yy |, arctan|I y |/|I x |] (1) where x and y are the pixel coordinates, I x , I y and I xx , I yy are, respectively, the ﬁrst- and second-order derivatives of the image intensity, the covari- ance matrices are computed from the set of features F over axis-oriented rectangular patches of I. Each weak classiﬁer is associated one-to-one with a rectangular patch (see Fig. 1a). Polar representation for covariance descriptors: Aiming to classify circular features, the use of patches with generic circular shapes would catch variations more accurately than just using axis-oriented rec- tangular shapes. Indeed, using circles or annulus would exclude from the covariance matrix computation all the pixels that do not strictly belong to the circular shape to recognise. To save accurate classiﬁcation and still exploit integral images (which need axis-oriented rectangular patches), we propose the use of polar images; having deﬁned a reference point C (x C , y C ), r 2 =(x − x C ) 2 +(y − y C ) 2 and q = arctan((y − y C )/ (x − x C )), the polar image of I(x, y) w.r.t. to point C is I p ( p, q) (see Fig. 1b); given an image with centre C and its polar transformation (w.r.t. C ), any slice of annulus on the original image centred in C can be represented as an axis-oriented rectangular patch on the polar image. Therefore, the polar transformation creates a bridge between the circular patches (useful for classiﬁcation purposes) and the rectangu- lar patches (needed by the intrinsic classiﬁer architecture); given an image to classify, as a ﬁrst step the polar image transformation is com- puted and then the classiﬁers are applied: the rectangular patches over the polar image, used by the weak classiﬁers, represent a slice of annulus over the original image, and this provides a classiﬁer more suited to circular shape classiﬁcation (see Figs. 1c and d). Multi-spectral image derivatives for covariance descriptors: In appear- ance-based object classiﬁcation, it is common to avoid the use of chro- minance since in most cases colour does not convey any discriminative information. Instead, since colour can be used successfully to compute image derivatives which are more accurate w.r.t. luminance only images [5], we claim that the use of chrominance can improve the classi- ﬁcation results. Considering the RGB and Lab colour spaces, in order to compute covariance descriptors sensitive to the chrominance, we deﬁne I RGB x =  ∂R ∂x         2 + ∂G ∂x         2 + ∂B ∂x         2  ; I RGB xx =  ∂ 2 R ∂x 2         2 + ∂ 2 G ∂x 2         2 + ∂ 2 B ∂x 2         2  (2) (and by analogy we deﬁne I RGB y , I RGB yy , I Lab x , I Lab xx , I Lab y , I Lab yy ). Exploiting (2), we then extend (1) to RGB and Lab colour spaces, deﬁning F RGB and F Lab . 0.005 0.05 0.0003 0.003 0.03 miss rate FPPW RGB-polar RGB-straight Lab-polar Lab-straight grey-polar grey-straight Fig. 2 Miss rate (MR) against false positive per window (FPPW) on head images dataset Each marker represents performance up to cascade level. Curves are plotted from cascade 5 (starting at lower right corner) up to 18 (moving towards upper left corner) Experimental results: We tested the proposed approach in head and polymer detection. In the ﬁrst case, the classiﬁer is applied to ﬁnd the exact position of the head of pedestrians, after having located them within an image through a pedestrian detector [1]. Candidate head locations are searched around the upper body of the detected pedestrians. In the second case, the classiﬁer is applied to examine photomicro- graphy image datasets and automatically extract the images that ELECTRONICS LETTERS 22nd July 2010 Vol. 46 No. 15 Authorized licensed use limited to: UNIVERSITA MODENA. Downloaded on August 05,2010 at 09:12:23 UTC from IEEE Xplore. Restrictions apply.