Skip to main content
placeholder image

Bayesian weighted inference from surveys

Journal Article


Abstract


  • Data from large surveys are often supplemented with sampling weights that are designed to reflect unequal probabilities of response and selection inherent in complex survey sampling methods. We propose two methods for Bayesian estimation of parametric models in a setting where the survey data and the weights are available, but where information on how the weights were constructed is unavailable. The first approach is to simply replace the likelihood with the pseudo likelihood in the formulation of Bayes theorem. This is proven to lead to a consistent estimator but also leads to credible intervals that suffer from systematic undercoverage. Our second approach involves using the weights to generate a representative sample which is integrated into a Markov chain Monte Carlo (MCMC) or other simulation algorithms designed to estimate the parameters of the model. In the extensive simulation studies, the latter methodology is shown to achieve performance comparable to the standard frequentist solution of pseudo maximum likelihood, with the added advantage of being applicable to models that require inference via MCMC. The methodology is demonstrated further by fitting a mixture of gamma densities to a sample of Australian household income.

Publication Date


  • 2020

Citation


  • Gunawan, D., Panagiotelis, A., Griffiths, W., & Chotikapanich, D. (2020). Bayesian weighted inference from surveys. Australian and New Zealand Journal of Statistics, 62(1), 71-94. doi:10.1111/anzs.12284

Scopus Eid


  • 2-s2.0-85083062951

Web Of Science Accession Number


Start Page


  • 71

End Page


  • 94

Volume


  • 62

Issue


  • 1

Abstract


  • Data from large surveys are often supplemented with sampling weights that are designed to reflect unequal probabilities of response and selection inherent in complex survey sampling methods. We propose two methods for Bayesian estimation of parametric models in a setting where the survey data and the weights are available, but where information on how the weights were constructed is unavailable. The first approach is to simply replace the likelihood with the pseudo likelihood in the formulation of Bayes theorem. This is proven to lead to a consistent estimator but also leads to credible intervals that suffer from systematic undercoverage. Our second approach involves using the weights to generate a representative sample which is integrated into a Markov chain Monte Carlo (MCMC) or other simulation algorithms designed to estimate the parameters of the model. In the extensive simulation studies, the latter methodology is shown to achieve performance comparable to the standard frequentist solution of pseudo maximum likelihood, with the added advantage of being applicable to models that require inference via MCMC. The methodology is demonstrated further by fitting a mixture of gamma densities to a sample of Australian household income.

Publication Date


  • 2020

Citation


  • Gunawan, D., Panagiotelis, A., Griffiths, W., & Chotikapanich, D. (2020). Bayesian weighted inference from surveys. Australian and New Zealand Journal of Statistics, 62(1), 71-94. doi:10.1111/anzs.12284

Scopus Eid


  • 2-s2.0-85083062951

Web Of Science Accession Number


Start Page


  • 71

End Page


  • 94

Volume


  • 62

Issue


  • 1