Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Approximating Sampling Distributions & Confidence Intervals: Understanding p & Properties , Study notes of Data Analysis & Statistical Methods

How to approximate a sampling distribution for p, the proportion of successes in a random sample from a population, using the central limit theorem. It covers the mean and standard deviation of p's distribution, the normal distribution assumption, and the calculation of probabilities and confidence intervals using standard errors and z-scores.

Typology: Study notes

2010/2011

Uploaded on 11/15/2011

mardiguian26149
mardiguian26149 🇺🇸

4.4

(18)

687 documents

1 / 3

Toggle sidebar

Related documents


Partial preview of the text

Download Approximating Sampling Distributions & Confidence Intervals: Understanding p & Properties and more Study notes Data Analysis & Statistical Methods in PDF only on Docsity! 16 September How to approximate a sampling distribution for p Decide on a sample size n Randomly select a sample of size n from the population Compare the proportion of successes, p Repeat steps 2 and 3 ≥ 1000 times. Plot a histogram of the p values to see what the sampling distribution looks like. Properties for the distribution of p Let p be the proportion of successes in a random sample of size n from a population whose proportion of successes is π. Denote the mean value of p’s distribution (thus, of p) by μp, and the standard deviation of p by σp. We know μp = π σp = √ π (1−π )n If it is large and π is not extreme (i.e., π is not close to 0 or 1), then the distribution of p is approximately normal. Rule: Normal if both πn ≥ 10 and (1-π)n ≥ 10 Probabilities for p Say we want to find P(p < .34) We can find this since P~N(μp, σ2p) when rule holds. So we standardize p using a Z-score where Z = p−μpp σpp = p−π √ π (1−π )n And use standard normal tables to solve. Standard error The standard error is an estimate of the standard deviation. The standard error for p, denoted SE(p) is SE(p) = √ p(1−p)n Why do we use the standard curve? The estimates we get from the samples are incorrect. We can only hope that the estimates are close to the true values. Standard errors help us determine a range of possible values that may contain the true value. These ranges are called confidence intervals. Estimation using a single sample Point estimates A point estimate of a population characteristic is a single number that is based on sample data and represents a plausible value of the population characteristic. i.e. p is a point estimate for π Standard error is a point estimate for standard deviation x is a point estimate for μ There can be different point estimates for one population characteristic x and median can both estimate μ We want point estimates to be unbiased. Its mean value is equal to the population parameter. p is unbiased, since μp = π If we have multiple (or no) unbiased point estimates, we choose the point estimate with the smallest variance Interval Estimation Let Δ be a fixed, but unknown parameter value Let δ be the point estimate for Δ