Skip to main content
placeholder image

A new algorithm for protecting aggregate business microdata via a remote system

Journal Article


Download full-text (Open Access)

Abstract


  • Releasing business microdata is a challenging problem for many statistical agencies. Businesses with distinct continuous characteristics such as extremely high income could easily be identified while these businesses are normally included in surveys representing the population. In order to provide data users with useful statistics while maintaining confidentiality, some statistical agencies have developed online based tools to allow users to specify and request tables created from microdata. These tools only release perturbed cell values generated from automatic output perturbation algorithms in order to protect each underlying observation against various attacks, such as differencing attacks. An example of the perturbation algorithms has been proposed by Thompson et al. (2013). The algorithm focuses largely on reducing disclosure risks without addressing much on data utility. As a result, the algorithm has limitations, including a limited scope of applicable cells and uncontrolled utility loss. In this paper we introduce a new algorithm for generating perturbed cell values. As a comparison, The new algorithm allows more control over utility loss, while it could also achieve better utility-disclosure tradeoffs in many cases, and is conjectured to be applicable to a wider scope of cells.

Publication Date


  • 2016

Citation


  • Ma, Y., Lin, Y., Chipperfield, J., Newman, J. & Leaver, V. (2016). A new algorithm for protecting aggregate business microdata via a remote system. Lecture Notes in Computer Science, 9867 210-221. Dubrovnik, Croatia Proceedings of the International Conference on Privacy in Statistical Databases, PSD 2016

Scopus Eid


  • 2-s2.0-84988037045

Ro Full-text Url


  • http://ro.uow.edu.au/cgi/viewcontent.cgi?article=7124&context=eispapers

Ro Metadata Url


  • http://ro.uow.edu.au/eispapers/6094

Number Of Pages


  • 11

Start Page


  • 210

End Page


  • 221

Volume


  • 9867

Abstract


  • Releasing business microdata is a challenging problem for many statistical agencies. Businesses with distinct continuous characteristics such as extremely high income could easily be identified while these businesses are normally included in surveys representing the population. In order to provide data users with useful statistics while maintaining confidentiality, some statistical agencies have developed online based tools to allow users to specify and request tables created from microdata. These tools only release perturbed cell values generated from automatic output perturbation algorithms in order to protect each underlying observation against various attacks, such as differencing attacks. An example of the perturbation algorithms has been proposed by Thompson et al. (2013). The algorithm focuses largely on reducing disclosure risks without addressing much on data utility. As a result, the algorithm has limitations, including a limited scope of applicable cells and uncontrolled utility loss. In this paper we introduce a new algorithm for generating perturbed cell values. As a comparison, The new algorithm allows more control over utility loss, while it could also achieve better utility-disclosure tradeoffs in many cases, and is conjectured to be applicable to a wider scope of cells.

Publication Date


  • 2016

Citation


  • Ma, Y., Lin, Y., Chipperfield, J., Newman, J. & Leaver, V. (2016). A new algorithm for protecting aggregate business microdata via a remote system. Lecture Notes in Computer Science, 9867 210-221. Dubrovnik, Croatia Proceedings of the International Conference on Privacy in Statistical Databases, PSD 2016

Scopus Eid


  • 2-s2.0-84988037045

Ro Full-text Url


  • http://ro.uow.edu.au/cgi/viewcontent.cgi?article=7124&context=eispapers

Ro Metadata Url


  • http://ro.uow.edu.au/eispapers/6094

Number Of Pages


  • 11

Start Page


  • 210

End Page


  • 221

Volume


  • 9867