FINGERPRINT MATCHING BASED ON DISTANCE METRIC LEARNING Dalwon Jang and Chang D. Yoo Div. of EE, School of EECS, KAIST, Yuseong Gu, Daejeon 305-701, Korea dal1@kaist.ac.kr and cdyoo@ee.kaist.ac.kr ABSTRACT This paper considers a method for learning a distance metric in a fingerprinting system which identifies a query content by measuring the distance between its fingerprint and a finger- print stored in a database. A metric having a general form of the Mahalanobis distance is learned with the goal that the dis- tance between fingerprints extracted from perceptually simi- lar contents should be smaller than the distance between fin- gerprints extracted from perceptually dissimilar contents. The metric is learned by minimizing a cost function designed to achieve the goal. The cost function is convex, and the global minimum can be obtained using convex optimization. In our experiment, the distance metric learning is applied in an au- dio fingerprinting system, and it is experimentally shown that the learned distance metric improves the identification perfor- mance. Index Terms— Fingerprinting, Identification, Distance measurement 1. INTRODUCTION There is a growing demand for protecting, managing, and in- dexing digital content, and as a viable solution, fingerprinting is receiving increased attention. Fingerprinting is a technique that identifies an unknown content using a short feature vector called fingerprint. In recent years, various audio/video/image fingerprinting systems have been proposed [1]-[7]. A fingerprinting system for content identification gener- ally consists of three essential components: fingerprint ex- traction, database (DB) search, and fingerprint matching [4]. In the fingerprint extraction process, a query fingerprint is ex- tracted from a query content. In the DB search process, a set of candidate fingerprints from a DB close to the query fingerprint are obtained. In the fingerprint matching process, the distances between the candidate fingerprints and the query fingerprint are computed based on a distance metric. The fin- gerprinting system provides the meta-data associated with the closest candidate fingerprint. This work was supported by the Korea Research Foundation Grant funded by the Korean Government(MOEHRD, Basic Research Promotion Fund)(KRF-2008-314- D00309) The fingerprint extraction and matching processes influ- ence the identification performance more than the DB search process which determines the computational efficiency of the system. The identification performance depends highly on the distance metric used in fingerprint matching process. In this paper, a method for learning a distance metric in fingerprint matching is considered [8, 9, 10, 11]. In recent years, various literatures have shown that distance metric learn- ing can improve classification and clustering performances [11]. The distance metric used in previous fingerprinting sys- tems, which is not determined by learning, may not be suit- able to the fingerprint used in the fingerprinting system and the distortions, thus the identification performance can be im- proved by metric learning. By learning a distance metric from training data consist- ing of original and distorted contents, the identification per- formance can be improved. Fingerprints of original contents are assumed to be fingerprints stored in a DB, and finger- prints of distorted contents are assumed to be the query fin- gerprints. For correct identification, the distance of the fin- gerprint of a distorted content to the fingerprint of the orig- inal content from which the distorted content was obtained - called hereafter corresponding content - should be smaller than the distance to fingerprints of other original contents - called hereafter non-corresponding contents. A large distance margin should be established between fingerprints of the dis- torted and non-corresponding contents [10]. This is the goal of the distance metric learning considered in this paper, and specifically a distance metric having a general form of the Mahalanobis distance is considered. A cost function to be minimized is designed so that the cost increases when the fin- gerprint of the distorted content is further away from the fin- gerprint of the corresponding content than from fingerprints of non-corresponding contents. The parameter of the distance metric is determined by minimizing the cost function by con- vex optimization. We assume that the fingerprint is real val- ued, thus the distance metric learning considered in this paper is effective only for the real-valued fingerprint. The remainder of this paper is organized as follows. Sec- tion 2 explains the distance metric, and Section 3 explains the cost function used to learn the distance metric. Section 4 presents the experimental results, and Section 5 concludes the paper.