













































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Survey Sampling Techniques course is one of important courses in Statisitics. Major poiuts of this course are: probability sampling, confidence intervals, Two-stage cluster sampling, Two-stage cluster sampling, estimation for mean, choosing strata, allocation across strata, ratio estimation, domain estimation, Two-stage cluster sampling. Keywords in these slides are: Survery Error, Target Population, Sampling Frame, Probability Sample, Population Parameters, Histogram, Population Distribution o
Typology: Slides
1 / 53
This page cannot be seen from the preview
Don't miss anything!














































Target population
Sampling frame - Selecting a probability sample
Characteristics of the target population
The average school loan debt owed by currently enrolled
students
The total surface area of county parks in the
The fraction of Des Moines households that fall below the poverty line
Data value for a target population element
data value is y i^
the value of characteristic for element i
Examples - Element = county in US y i^ = - Element = student enrolled at ISU y i^ = - Element = Des Moines household y i^ =
Variable of interest
There is one value of y for each of the
elements in the target population There may be fewer than
unique values
So y has a
distribution
Histogram for y i^
number of courses a 2003 Stat
student i was registered for
Histogram 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 Number of courses y c(y) Number of students Number of courses y Number of students c(y) 1 0 2 2 3 7 4 6 5 3 6 6
Discrete probability distribution of y
Horizontal axis = all UNIQUE values of y - Vertical axis = RELATIVE frequency of elements with a specific value of y , called P( y ) - Like a histogram with the frequencies (vertical axis) divided by
Usually start by making a table of
Unique values of the variable , y - Relative frequency of the unique values of y , P( y )
Total for y
courses for all students)
Mean of y
courses per student)
The population distribution of y is what we are trying to describe when we draw a sample, collect data, and calculate an estimate for a summary parameter
The population distribution of y is
No matter what sample design we choose - Regardless of the sample we draw from a given design - We do not assume a parametric distribution - Forget normal distributions assumed in other classes (for now)
Assume we have selected a probability sample from a frame
The next step is to collect data from each sampled element - This will lead to values for y i^ for each element in the sample - We will act like we only collect one characteristic from each sampled element, but usually, we are collecting dozens or even 100s of different characteristics from each element
Outcomes for a sample selected from a frame
Can not locate/contact a sampled unit (e.g., household)
Recall that the frame (and thus sample) may contain units that do not belong in the target population
In this case, we need to “screen” the unit to determine if the unit is eligible to be included in the survey - Eligible means that the unit belongs to the target population - In practice, there are cases where we can not determine if a unit is eligible or not to participate in study
Could have been chosen in a sample, and
Would have provided data during the response process if sampled
Were available during interview period,
Were willing to be interviewed, and - Were physically/mentally capable of providing responses