

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Unequal probability sampling with replacement, specifically the hansen-hurwitz estimator and probability proportional to size (pps) sampling in cluster sampling. How to calculate the estimators and their respective variances. References to original research are provided.
Typology: Study notes
1 / 2
This page cannot be seen from the preview
Don't miss anything!


Introduction to unequal-probability sampling; PPS sampling with replacement
Unequal probability sampling with replacement: As we saw in the jobs example, there are situations in which it is desirable to have un- equal probabilities of selecting elements into the sample. In section 3.3 of the text, it is stated that for a population with elements {u 1 , u 2 , ..., uN }, we might choose to sample elements with replacement with respective selec- tion probabilities of {δ 1 , δ 2 , ..., δN }. In that situation the estimator
̂ τ =
n
∑^ n
i=
yi δi
is an unbiased estimator for τ, and an unbiased estimator of V (τ̂ ) is given by:
V̂ (̂τ ) =^1 n
[ (^) n ∑
i=
yi δi
− τ̂
/ (n − 1)
n
[ (^) n ∑
i=
(̂τi − ̂τ )^2 / (n − 1)
Note that this variance estimator is of the form s^2 /n, and is unbiased for V (̂τ ) because the with replacement sampling scheme yields a sample of n separate independent estimates of τ , namely ̂τi = yi/δi. This estima- tor and its’ variance estimator are due to Hansen and Hurwitz (1943), and thus the estimator is called the Hansen-Hurwitz estimator. Sampling with replacement yields estimators whose theoretical properties are easy to understand, but sampling with replacement is often inefficient and can be impractical in some situations. Later we will discuss general approaches to unequal-probability sampling that have sampling without replacement, using estimators developed by Horvitz and Thompson (1952). PPS sampling with replacement: In cluster sampling, it is often use- ful to use unequal-probability sampling with replacement with probabilities proportional to size (PPS). In this case, δi = mi/M , and our estimator of the total is:
̂ τpps =
n
∑^ n
i=
yi δi
n
∑^ n
i=
yi (mi/M )
n
∑^ n
i=
yi mi
n
∑^ n
i=
yi,
where yi = yi/mi is the average of the observations in cluster i. To obtain an estimator of the mean we can just divide by M , giving
̂μpps =
n
∑^ n
i=
yi.
The estimated variances of these two estimators are then:
V̂ (̂τpps) =^1 n
[ (^) n ∑
i=
(̂τi − ̂τpps)^2 / (n − 1)
n
[ (^) n ∑
i=
(yi − μ̂ pps)^2 / (n − 1)
and
V̂ (μ̂ pps) =^1 n
[ (^) n ∑
i=
(̂μi − ̂μpps)^2 / (n − 1)
n
[ (^) n ∑
i=
(yi − μ̂ pps)^2 / (n − 1)
See the examples from lecture and in the SAS code on the web. References: Hansen, M.H. and Hurwitz, W.N. (1943) On the theory of sampling from a finite population. Annals of Mathematical Statistics 14: 333-362. Horvitz, D.G. and Thompson, D.J. (1952) A generalization of sampling without replacement from a finite universe. Journal of the American Statis- tical Association 47: 663-685.