Bootstrapping Data – How to Bootstrap Data Using Sampling Weights

bootstrapresamplingweighted-sampling

Suppose you only have these information from a sample data: $X_i$ and $w_i$, $i=1,…,N$, where $w_i$'s are the respective sampling weights(not integers).

Is it possible to obtain a valid bootstrap estimate of say, variance of $X_i$?

I understand that I might have to readjust the sampling weights(for weighted data) for each bootstrap replicate to obtain a valid bootstrap estimate. But considering if I only have the above information, are there other ways to do this?

Best Answer

Short answer: No.

Marginally longer answer: A single vector of weights tells you nothing about:

  1. Stratification
  2. Clustering
  3. Calibration variables

A proper bootstrap scheme would involve the following (I can refer to my paper on this).

  1. Take a bootstrap sample (with replacement) of clusters within each stratum, independently across strata.
  2. Re-calibrate the sample to the same population totals that were used for the main weights.

Since you don't have strata and clusters, you cannot do 1. Since you don't have the calibration variables, you cannot do 2.