Skip to main content
placeholder image

A multistage protocol for aggregated queries in distributed cloud databases with privacy protection

Journal Article


Abstract


  • © 2018 This article is devoted to the novel situation, where a large distributed cloud database is a union of several separate databases belonging to individual database owners who are not allowed to transfer their data for storage in locations different from their already chosen separate cloud service providers. For example, a very large number of medical records may be stored in a distributed cloud database, which is a union of several separate databases from different hospitals, or even from different countries. The owners of the databases may need to provide answers to certain common aggregated queries using all information available without sharing or transferring all data. It is necessary to minimize the communication costs, improve efficiency, and comply with the legal requirements protecting the privacy of confidential data. In this situation, it is impossible to aggregate the whole database in one location, but effective methods for answers to the aggregated queries with privacy protection are required. To solve this important problem, the present article proposes a Multistage Separate Query Processing (MSQP) protocol employing homomorphic encryption with split keys. We show that our protocol can answer a large class of natural queries of practical significance. The running time of the MSQP protocol is O(d+[Formula presented]), where d is the number of database owners and m is the total number of records in the whole database. In practice, d is small, m can be very large, and so the running time is O(m). This means that the protocol is very efficient for large databases. It dramatically reduces the communication costs of computation and completely eliminates the need for exchange of confidential data. We define a new generalized additive homomorphic property and introduce a Multipart ElGamal Cryptosystem (MEC) with split keys, which enjoys this property. MEC is a novel modification of the ElGamal cryptosystem with split keys. This paper presents the results of extensive experiments evaluating the effectiveness of the MSQP protocol employing MEC and comparing it with MSQP employing the ElGamal cryptosystem, for a collection of publicly available medical datasets. The experiments evaluating our protocol on 11 real-life databases and a synthetic database demonstrate that the MSQP protocol employing MEC is more efficient than other options and can be recommended for practical implementations.

UOW Authors


  •   Kelarev, Andrei (external author)
  •   Yi, Xun (external author)
  •   Badsha, Shahriar (external author)
  •   Yang, Xuechao (external author)
  •   Rylands, Leanne (external author)
  •   Seberry, Jennifer

Publication Date


  • 2019

Citation


  • Kelarev, A., Yi, X., Badsha, S., Yang, X., Rylands, L. & Seberry, J. (2019). A multistage protocol for aggregated queries in distributed cloud databases with privacy protection. Future Generation Computer Systems, 90 368-380.

Scopus Eid


  • 2-s2.0-85051959100

Number Of Pages


  • 12

Start Page


  • 368

End Page


  • 380

Volume


  • 90

Place Of Publication


  • Netherlands

Abstract


  • © 2018 This article is devoted to the novel situation, where a large distributed cloud database is a union of several separate databases belonging to individual database owners who are not allowed to transfer their data for storage in locations different from their already chosen separate cloud service providers. For example, a very large number of medical records may be stored in a distributed cloud database, which is a union of several separate databases from different hospitals, or even from different countries. The owners of the databases may need to provide answers to certain common aggregated queries using all information available without sharing or transferring all data. It is necessary to minimize the communication costs, improve efficiency, and comply with the legal requirements protecting the privacy of confidential data. In this situation, it is impossible to aggregate the whole database in one location, but effective methods for answers to the aggregated queries with privacy protection are required. To solve this important problem, the present article proposes a Multistage Separate Query Processing (MSQP) protocol employing homomorphic encryption with split keys. We show that our protocol can answer a large class of natural queries of practical significance. The running time of the MSQP protocol is O(d+[Formula presented]), where d is the number of database owners and m is the total number of records in the whole database. In practice, d is small, m can be very large, and so the running time is O(m). This means that the protocol is very efficient for large databases. It dramatically reduces the communication costs of computation and completely eliminates the need for exchange of confidential data. We define a new generalized additive homomorphic property and introduce a Multipart ElGamal Cryptosystem (MEC) with split keys, which enjoys this property. MEC is a novel modification of the ElGamal cryptosystem with split keys. This paper presents the results of extensive experiments evaluating the effectiveness of the MSQP protocol employing MEC and comparing it with MSQP employing the ElGamal cryptosystem, for a collection of publicly available medical datasets. The experiments evaluating our protocol on 11 real-life databases and a synthetic database demonstrate that the MSQP protocol employing MEC is more efficient than other options and can be recommended for practical implementations.

UOW Authors


  •   Kelarev, Andrei (external author)
  •   Yi, Xun (external author)
  •   Badsha, Shahriar (external author)
  •   Yang, Xuechao (external author)
  •   Rylands, Leanne (external author)
  •   Seberry, Jennifer

Publication Date


  • 2019

Citation


  • Kelarev, A., Yi, X., Badsha, S., Yang, X., Rylands, L. & Seberry, J. (2019). A multistage protocol for aggregated queries in distributed cloud databases with privacy protection. Future Generation Computer Systems, 90 368-380.

Scopus Eid


  • 2-s2.0-85051959100

Number Of Pages


  • 12

Start Page


  • 368

End Page


  • 380

Volume


  • 90

Place Of Publication


  • Netherlands