Sampling and Estimating Population Percentages: Chance Error, Bias, and Standard Errors, Study notes of Statistics

An overview of sampling methods, chance error, bias, and the estimation of population percentages using simple random sampling. It covers the concepts of expected value, standard error, and the normal distribution for percentages, as well as the differences between drawing with and without replacement.

Typology: Study notes

Pre 2010

Uploaded on 10/01/2009

koofers-user-uqa
koofers-user-uqa ๐Ÿ‡บ๐Ÿ‡ธ

9 documents

1 / 50

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
STAT 2
Lecture 25:
Chance errors in sampling
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e
pf1f
pf20
pf21
pf22
pf23
pf24
pf25
pf26
pf27
pf28
pf29
pf2a
pf2b
pf2c
pf2d
pf2e
pf2f
pf30
pf31
pf32

Partial preview of the text

Download Sampling and Estimating Population Percentages: Chance Error, Bias, and Standard Errors and more Study notes Statistics in PDF only on Docsity!

STAT 2

Lecture 25:

Chance errors in sampling

Recall: sampling โ— It's usually impractical to survey a whole population, so we take a sample instead โ— A well-chosen sample of a few thousand can give a accurate picture of the whole US population

Recall: bias โ— Selection bias means the method for selecting the sample is more likely to select some members of the population than others in a way that's not anticipated โ— Non-response bias occurs because those who don't respond differ from those who do

Recall: sampling methods โ— Quota sampling lets interviewers select subjects according to some quotas; strong risk of bias โ— Simple random sampling selects members of a population with equal probability, and is unbiased โ— In practice, multistage cluster sampling is often employed

I

A chance model for simple random sampling

The problem โ— We wish to estimate the percentage of a certain population with a certain characteristic e.g. the percentage of registered American voters who would vote for Obama

Estimating parameters from statistics โ— For a simple random sample, the sample percentage is our estimate of the population percentage โ— For a more complex sampling scheme, we may need to adjust our sample percentage for possible biases

The box model for a simple random sample โ— Need one ticket for each member of the population โ— Write a โ€œ1โ€ on the ticket if they have the characteristic; write a โ€œ0โ€ if they don't โ— Put the tickets in a box, shuffle, and draw n tickets without replacement

Box SD โ— The SD of the box of zeroes and ones, according to the two-valued box SD formula, is Box SD = sqrt(prop. of zeroes times prop. of ones)

Standard error of the sum of draws โ— If the population size is huge, there will be no real difference between drawing with and without replacement โ— SE of sum of draws (with replacement) = sqrt( n )*(box SD) = sqrt( n * prop. ones * prop. zeros)

Example โ— Suppose the true percentage of voters who support Obama is 50%. โ— The expected percentage of a sample who support Obama is 50% (no matter what the sample size is).

Example โ— If n =100, the standard error of the sample percentage is sqrt(0.50.5/100)100% = 5% โ— If n =1000, SE = 1.6% โ— If n =10000, SE = 0.5% The SE of a percentage goes down like the square root of n. (The SE of a sum goes up like sqrt( n ).)

II

Sample percentages and the normal distribution

The distribution of sample percentages โ— The percentage of 1's among draws from a box of 0's and 1's is likely to be around the expected value (of the percentage), give or take its SE โ— SE of percentage depends on percentage of 1's, number of draws, and whether they're with or without replacement