http://www.iaeme.com/IJARET/index.asp 350 editor@iaeme.com
International Journal of Advanced Research in Engineering and Technology (IJARET)
Volume 11, Issue 7, July 2020, pp. 350-362, Article ID: IJARET_11_07_035
Available online athttp://www.iaeme.com/IJARET/issues.asp?JType=IJARET&VType=11&IType=7
ISSN Print: 0976-6480 and ISSN Online: 0976-6499
DOI: 10.34218/IJARET.11.7.2020.035
© IAEME Publication Scopus Indexed
PRIVACY PRESERVING RECORD LINKAGE
USING PHONETIC AND BLOOM FILTER
ENCODING
Vijay Maruti Shelake, Dr. Narendra M. Shekokar
Department of Computer Engineering,
D. J. Sanghvi College of Engineering,
Mumbai, Maharashtra, India
ABSTRACT
Now-a-days, there is an increasing demand for data integration and analytics due
to the availability of huge amount of records in multiple data sets. In data integration,
record linkage gains prominent importance to identify and match records across data
sets that belong to the same person. Record linkage becomes complicated with
presence of erroneous identifiers and hence needs approximate matching to find
similarities between the same person records. Also, the data sharing for record
linkage can lead to disclosure of confidential information about the personal records.
Thus, Privacy preserving record linkage (PPRL) involves detecting and matching of
records among two or more data sets in a secure manner. It is useful for the purpose
of research activities and analysis across wide application areas. The utilization of
Bloom filter encoding with its hardened versions are suitable for approximate
matching in PPRL, but some of them are vulnerable to re-identification attacks while
others reduce linkage accuracy. Moreover, phonetic encoding can provide robust
matching with its inherent security characteristic for PPRL. However, most of existing
PPRL techniques had attempted to provide privacy while compromising the linkage
accuracy. This research focuses on designing a new approach named as two factor
encoding for PPRL (2FE-PPRL) using phonetic and Bloom filter encoding to achieve
increased linkage accuracy while maintaining privacy. Our 2FE-PPRL approach
depicts better results than existing PPRL techniques Phonetic and Bloom Filter
encoding as analyzed through precision, recall and f-measure.
Key words: Record linkage, data integration, privacy preserving, phonetic encoding,
Bloom filter
Cite this Article: Vijay Maruti Shelake and Dr. Narendra M. Shekokar, Privacy
Preserving Record Linkage Using Phonetic and Bloom Filter Encoding, International
Journal of Advanced Research in Engineering and Technology, 11(7), 2020,
pp. 350-362.
http://www.iaeme.com/IJARET/issues.asp?JType=IJARET&VType=11&IType=7