Privacy preserving based logistic regression on big data

Title

Privacy preserving based logistic regression on big data

Subject

Machine learning
Big data
Regression analysis
Digital storage
Computation theory
Computational efficiency
Learning algorithms
Cloud computing
Computing power
Privacy-preserving techniques

Description

Cloud computing has strong computing power and huge storage space. Machine learning algorithm, combining with cloud computing, makes the processing of large-scale data practical. Logistic regression algorithm is a widely popular machine learning-based classification algorithm that can be implemented in cloud. However, data privacy cannot be guaranteed in big data processing as privacy leakage of the training data may occur. In order to prevent the privacy leakage of logistic regression algorithm in the cloud and promote the processing efficiency of training data, this paper offers a Privacy Preserving Logistic Regression Algorithm (PPLRA). The homomorphic encryption is used to encrypt the private data when they are uploaded for training. Moreover, the approximation of the Sigmoid function in logistic regression using Taylor's theorem can support the safe calculation using homomorphic encryption. The Experimental results show that PPLRA has significant effects in data privacy preserving, and is more effective in data processing. Comparison with Non-Privacy Preserving Logistic Regression Algorithm (NPPLRA) shows that the computational efficiency is improved by about 1.2 times. 2020 Elsevier Ltd
171

Creator

Fan, Yongkai
Bai, Jianrong
Lei, Xia
Zhang, Yuqing
Zhang, Bin
Li, Kuan-Ching
Tan, Gang

Publisher

Journal of Network and Computer Applications

Date

2020

Type

journalArticle

Identifier

10848045
10.1016/j.jnca.2020.102769

Citation

Fan, Yongkai et al., “Privacy preserving based logistic regression on big data,” Lamar University Midstream Center Research, accessed May 18, 2024, https://lumc.omeka.net/items/show/29150.

Output Formats