Usha Lawrance, Josephine and Nayahi Jesudhasan, Jesu Vedha (2021) Privacy Preserving Parallel Clustering Based Anonymization for Big Data Using MapReduce Framework. Applied Artificial Intelligence, 35 (15). pp. 1587-1620. ISSN 0883-9514
Privacy Preserving Parallel Clustering Based Anonymization for Big Data Using MapReduce Framework.pdf - Published Version
Download (8MB)
Abstract
Big data refers to a massive volume of data collected from heterogeneous data sources including data collected from Internet of Things (IoT) devices. Big data analytics is playing a crucial role in extracting patterns that would benefit efficient and effective decision making. Processing this massive volume of data poses several critical issues such as scalability, security and privacy. To preserve data privacy, numerous privacy-preserving data mining and publishing techniques exist. Data anonymization utilizing data mining techniques for preserving an individual’s privacy is a promising approach to prevent the data against identity disclosure. In this paper, a Parallel Clustering based Anonymization Algorithm (PCAA) is proposed, and the results prove that the algorithm is scalable and also achieves a better tradeoff between privacy and utility. The MapReduce framework is used to parallelize the anonymization process for handling a huge volume of data. The algorithm performs well in terms of classification accuracy, F-measure, and Kullback–Leibler divergence metrics. Moreover, the big data generated from heterogeneous data sources are efficiently protected to meet the ever-growing requirements of the application.
Item Type: | Article |
---|---|
Subjects: | Opene Prints > Computer Science |
Depositing User: | Managing Editor |
Date Deposited: | 17 Jun 2023 05:08 |
Last Modified: | 02 Nov 2023 06:07 |
URI: | http://geographical.go2journals.com/id/eprint/2183 |