Two Stage Cluster Sampling, Study notes of Survey Sampling Techniques

Material Type: Notes; Class: Sample Survey Methods; Subject: Statistics; University: University of Idaho; Term: Unknown 1989;

Typology: Study notes

Pre 2010

Uploaded on 08/19/2009

koofers-user-m7j
koofers-user-m7j 🇺🇸

10 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Two-stage cluster sampling; unbiased estimation of a population
mean and total
Two-stage cluster sampling: In this chapter we examine two-stage
cluster sampling, a very simple kind of multi-stage sampling. In real-life
surveys, it is common to have several levels of sampling, including use of
stratification and ratio estimators. Here we will consider two-stage cluster
sampling where a simple random sample of clusters is taken, then a simple
random sample of elements in the selected clusters is taken. One example
would be to estimate the proportion of U.S. college students who like Korean
food by taking a SRS of U.S. colleges, then taking a SRS of students in each
selected college.
Notation: The extra stage of sampling here gives us notation that is
slightly changed from that used for single-stage cluster sampling:
N= the number of clusters in the population
n= the number of clusters selected in a SRS
Mi= the number of elements in cluster i
mi= the number of elements selected in a SRS from cluster i
M=
N
P
i=1
Mi= the number of elements in the population
M/N = the average cluster size for the population
yij = the jth observation in the sample from the ith cluster
yi=1
mi
mi
P
j=1
yij = the sample mean for the ith cluster
Unbiased estimation of the population mean µ:In single-stage
cluster sampling the unbiased estimator of τwas N
n
n
P
i=1
yi, where the yiterm
was the total of observations in the ith cluster. We are now sampling from
each cluster, so we do not know these totals. However, we can estimate
the cluster total by multiplying the cluster average (yi) by the number of
elements in the cluster (Mi). We can divide by Mto estimate µ. Then we
have:
N
Mn
n
X
i=1
yiis estimated by N
Mn
n
X
i=1
Miyi, so bµ=N
Mn
n
X
i=1
Miyi=1
M
n
P
i=1
Miyi
n,
and the estimated variance is:
1
pf2

Partial preview of the text

Download Two Stage Cluster Sampling and more Study notes Survey Sampling Techniques in PDF only on Docsity!

Two-stage cluster sampling; unbiased estimation of a population mean and total

Two-stage cluster sampling: In this chapter we examine two-stage cluster sampling, a very simple kind of multi-stage sampling. In real-life surveys, it is common to have several levels of sampling, including use of stratification and ratio estimators. Here we will consider two-stage cluster sampling where a simple random sample of clusters is taken, then a simple random sample of elements in the selected clusters is taken. One example would be to estimate the proportion of U.S. college students who like Korean food by taking a SRS of U.S. colleges, then taking a SRS of students in each selected college. Notation: The extra stage of sampling here gives us notation that is slightly changed from that used for single-stage cluster sampling: N = the number of clusters in the population n = the number of clusters selected in a SRS Mi = the number of elements in cluster i mi = the number of elements selected in a SRS from cluster i M =

∑N

i=

Mi = the number of elements in the population

M /N = the average cluster size for the population yij = the jth observation in the sample from the ith cluster yi = (^) m^1 i

∑^ mi j=

yij = the sample mean for the ith cluster

Unbiased estimation of the population mean μ: In single-stage

cluster sampling the unbiased estimator of τ was Nn

∑n i=

yi, where the yi term

was the total of observations in the ith cluster. We are now sampling from each cluster, so we do not know these totals. However, we can estimate the cluster total by multiplying the cluster average (yi) by the number of elements in the cluster (Mi). We can divide by M to estimate μ. Then we have:

N

M n

∑^ n

i=

yi is estimated by

N

M n

∑^ n

i=

Miyi, so μ̂ =

N

M n

∑^ n

i=

Miyi =

M

∑^ n i=

Miyi

n

and the estimated variance is:

V̂ (μ̂ ) =

N − n N

M

2

s^2 b n

nN M

2

∑^ n

i=

M (^) i^2

Mi − mi Mi

s^2 i mi

where

s^2 b =

∑^ n i=

(Miyi − M ̂μ)^2

n − 1

and s^2 i =

∑^ mi j=

(yij − yi)^2

mi − 1

In the variance estimator, s^2 b measures variation between clusters, and s^2 i measures variation within cluster i. We can obtain an unbiased estimator of τ and a variance estimator by multiplying the above expressions by M :

̂ τ =

N

n

∑^ n

i=

Miyi = N

∑^ n i=

Miyi

n

with variance estimator:

V̂ (̂τ ) =

N − n N

N 2

s^2 b n

N

n

∑^ n

i=

M (^) i^2

Mi − mi Mi

s^2 i mi

where s^2 b and s^2 i are defined as above. See the text and SAS code on the web for example calculations.