Unequal Probability Sampling: Hansen-Hurwitz Estimator and PPS Sampling - Prof. Christophe, Study notes of Survey Sampling Techniques

Unequal probability sampling with replacement, specifically the hansen-hurwitz estimator and probability proportional to size (pps) sampling in cluster sampling. How to calculate the estimators and their respective variances. References to original research are provided.

Typology: Study notes

Pre 2010

Uploaded on 08/19/2009

koofers-user-xy6-1
koofers-user-xy6-1 🇺🇸

10 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Introduction to unequal-probability sampling; PPS sampling with
replacement
Unequal probability sampling with replacement: As we saw in
the jobs example, there are situations in which it is desirable to have un-
equal probabilities of selecting elements into the sample. In section 3.3 of
the text, it is stated that for a population with elements {u1, u2, ..., uN}, we
might choose to sample elements with replacement with respective selec-
tion probabilities of {δ1, δ2, ..., δN}.In that situation the estimator
bτ=1
n
n
X
i=1
yi
δi
is an unbiased estimator for τ, and an unbiased estimator of V(bτ) is given
by:
b
V(bτ) = 1
n"n
X
i=1 µyi
δi
bτ2
/(n1)#=1
n"n
X
i=1
(bτibτ)2/(n1)#.
Note that this variance estimator is of the form s2/n, and is unbiased
for V(bτ) because the with replacement sampling scheme yields a sample of
nseparate independent estimates of τ, namely bτi=yii. This estima-
tor and its’ variance estimator are due to Hansen and Hurwitz (1943), and
thus the estimator is called the Hansen-Hurwitz estimator. Sampling
with replacement yields estimators whose theoretical properties are easy to
understand, but sampling with replacement is often inefficient and can be
impractical in some situations. Later we will discuss general approaches to
unequal-probability sampling that have sampling without replacement, using
estimators developed by Horvitz and Thompson (1952).
PPS sampling with replacement: In cluster sampling, it is often use-
ful to use unequal-probability sampling with replacement with probabilities
proportional to size (PPS). In this case, δi=mi/M, and our estimator of
the total is:
bτpps =1
n
n
X
i=1
yi
δi
=1
n
n
X
i=1
yi
(mi/M)=M
n
n
X
i=1
yi
mi
=M
n
n
X
i=1
yi,
where yi=yi/miis the average of the observations in cluster i. To
obtain an estimator of the mean we can just divide by M, giving
1
pf2

Partial preview of the text

Download Unequal Probability Sampling: Hansen-Hurwitz Estimator and PPS Sampling - Prof. Christophe and more Study notes Survey Sampling Techniques in PDF only on Docsity!

Introduction to unequal-probability sampling; PPS sampling with replacement

Unequal probability sampling with replacement: As we saw in the jobs example, there are situations in which it is desirable to have un- equal probabilities of selecting elements into the sample. In section 3.3 of the text, it is stated that for a population with elements {u 1 , u 2 , ..., uN }, we might choose to sample elements with replacement with respective selec- tion probabilities of {δ 1 , δ 2 , ..., δN }. In that situation the estimator

̂ τ =

n

∑^ n

i=

yi δi

is an unbiased estimator for τ, and an unbiased estimator of V (τ̂ ) is given by:

V̂ (̂τ ) =^1 n

[ (^) n ∑

i=

yi δi

− τ̂

/ (n − 1)

]

n

[ (^) n ∑

i=

(̂τi − ̂τ )^2 / (n − 1)

]

Note that this variance estimator is of the form s^2 /n, and is unbiased for V (̂τ ) because the with replacement sampling scheme yields a sample of n separate independent estimates of τ , namely ̂τi = yi/δi. This estima- tor and its’ variance estimator are due to Hansen and Hurwitz (1943), and thus the estimator is called the Hansen-Hurwitz estimator. Sampling with replacement yields estimators whose theoretical properties are easy to understand, but sampling with replacement is often inefficient and can be impractical in some situations. Later we will discuss general approaches to unequal-probability sampling that have sampling without replacement, using estimators developed by Horvitz and Thompson (1952). PPS sampling with replacement: In cluster sampling, it is often use- ful to use unequal-probability sampling with replacement with probabilities proportional to size (PPS). In this case, δi = mi/M , and our estimator of the total is:

̂ τpps =

n

∑^ n

i=

yi δi

n

∑^ n

i=

yi (mi/M )

M

n

∑^ n

i=

yi mi

M

n

∑^ n

i=

yi,

where yi = yi/mi is the average of the observations in cluster i. To obtain an estimator of the mean we can just divide by M , giving

̂μpps =

n

∑^ n

i=

yi.

The estimated variances of these two estimators are then:

V̂ (̂τpps) =^1 n

[ (^) n ∑

i=

(̂τi − ̂τpps)^2 / (n − 1)

]

M 2

n

[ (^) n ∑

i=

(yi − μ̂ pps)^2 / (n − 1)

]

and

V̂ (μ̂ pps) =^1 n

[ (^) n ∑

i=

(̂μi − ̂μpps)^2 / (n − 1)

]

n

[ (^) n ∑

i=

(yi − μ̂ pps)^2 / (n − 1)

]

See the examples from lecture and in the SAS code on the web. References: Hansen, M.H. and Hurwitz, W.N. (1943) On the theory of sampling from a finite population. Annals of Mathematical Statistics 14: 333-362. Horvitz, D.G. and Thompson, D.J. (1952) A generalization of sampling without replacement from a finite universe. Journal of the American Statis- tical Association 47: 663-685.