










































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An overview of sampling methods, chance error, bias, and the estimation of population percentages using simple random sampling. It covers the concepts of expected value, standard error, and the normal distribution for percentages, as well as the differences between drawing with and without replacement.
Typology: Study notes
1 / 50
This page cannot be seen from the preview
Don't miss anything!











































Lecture 25:
Recall: sampling โ It's usually impractical to survey a whole population, so we take a sample instead โ A well-chosen sample of a few thousand can give a accurate picture of the whole US population
Recall: bias โ Selection bias means the method for selecting the sample is more likely to select some members of the population than others in a way that's not anticipated โ Non-response bias occurs because those who don't respond differ from those who do
Recall: sampling methods โ Quota sampling lets interviewers select subjects according to some quotas; strong risk of bias โ Simple random sampling selects members of a population with equal probability, and is unbiased โ In practice, multistage cluster sampling is often employed
A chance model for simple random sampling
The problem โ We wish to estimate the percentage of a certain population with a certain characteristic e.g. the percentage of registered American voters who would vote for Obama
Estimating parameters from statistics โ For a simple random sample, the sample percentage is our estimate of the population percentage โ For a more complex sampling scheme, we may need to adjust our sample percentage for possible biases
The box model for a simple random sample โ Need one ticket for each member of the population โ Write a โ1โ on the ticket if they have the characteristic; write a โ0โ if they don't โ Put the tickets in a box, shuffle, and draw n tickets without replacement
Box SD โ The SD of the box of zeroes and ones, according to the two-valued box SD formula, is Box SD = sqrt(prop. of zeroes times prop. of ones)
Standard error of the sum of draws โ If the population size is huge, there will be no real difference between drawing with and without replacement โ SE of sum of draws (with replacement) = sqrt( n )*(box SD) = sqrt( n * prop. ones * prop. zeros)
Example โ Suppose the true percentage of voters who support Obama is 50%. โ The expected percentage of a sample who support Obama is 50% (no matter what the sample size is).
Example โ If n =100, the standard error of the sample percentage is sqrt(0.50.5/100)100% = 5% โ If n =1000, SE = 1.6% โ If n =10000, SE = 0.5% The SE of a percentage goes down like the square root of n. (The SE of a sum goes up like sqrt( n ).)
Sample percentages and the normal distribution
The distribution of sample percentages โ The percentage of 1's among draws from a box of 0's and 1's is likely to be around the expected value (of the percentage), give or take its SE โ SE of percentage depends on percentage of 1's, number of draws, and whether they're with or without replacement