



Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The concept of sampling distributions, focusing on the mean, proportion, and total in statistics. It covers the terms 'sampling distribution', 'law of large numbers', and 'central limit theorem'. The document also includes examples of calculating the mean and standard deviation of the sampling distribution for the sample mean and sample proportion, as well as finding probabilities using the central limit theorem.
Typology: Study Guides, Projects, Research
1 / 5
This page cannot be seen from the preview
Don't miss anything!




data distribution the distribution of a variable in a sample population distribution the probability distribution of a single observation of a variable sampling distribution the probability distribution of a statistic
sampling distribution : a probability distribution of a statistic; it is a distribution of all possible samples (random samples) from a population and how often each outcome occurs in repeated sampling (of the same size n). Given simple random samples of size n from a given population with a measured characteristic such as mean X, proportion (πˆ)^1 , or standard deviation (s) for each sample, the probability distribution of all the measured characteristics is called a sampling distribution. It is the distribution of all possible samples (outcomes) of that statistic. Use of a statistic to estimate the parameter is the main function of inferential statistics as it provides the properties of the statistic.
law of large numbers states that as the number of repetitions of an experiment is increased, the relative frequency obtained in the experiment tends to become ever closer to the theoretical probability. Even though the outcomes do not happen according to any set pattern or order (overall), the long-term observed relative frequency will approach the theoretical probability (^1) π and ˆ π are NOT 3.14159... , they are being used like μ and the other Greek letters we are using for notation.
Definition The sampling distribution of the sample mean is approximately normal with mean μ X and standard deviation (of the sampling distribution of the sample mean) se = σ √ Xn , provided n is sufficiently large.
If we take n observations of a quantitative variable and then compute the mean (x¯) of those observations in the sample, then x¯ is the sample mean statistic. Assumptions: Each observation x has the same probability distribution with mean μ and standard deviation σ, and the observations are independent.
(1) The mean of the sampling distribution is μ
(2) The standard deviation of the sampling distribution is se = √ σn
(3) The shape of the sampling distribution becomes more like a normal distribution as n increases
X ∼ N (μ, se mean )
Standard error of the mean: σ X = se mean =
σ √ n
z = X − μ se mean
Sample sizes should be n ≥ 30 for the sample mean If a distribution is already inherently normal, the sample size stipulation can be ignored.
The linked file shows how taking multiple random samples of the same size from the same population will produce a normal distribution of the sample means. The examples show a normal distribution, exponential distribution, and a binomial distribution. CLT simulation
for sample mean (X) and total (τˆ ) The level of a particular pollutant, nitrogen dioxide (N O 2 ), in the exhaust of a hypothetical model of car, that when driven in city traffic, has a mean level of 2.1 grams per mile (g/m) and a standard deviation of 0. g/m. Suppose a company has a fleet of 35 of these cars.
(a) What is the mean and standard deviation of the sampling distribution of the sample mean?
mean: μ X = μ = 2. 1 and se mean = √ σn = √^0_._ 353 = 0. 0507
X ∼ N (μ, se mean ) = X ∼ N (2. 1 , 0 .0507)
(b) find the probability that the mean N O 2 level is less than 2.03 g/m
(c) Mandates by the EPA state that the average of the fleet of these cars cannot exceed 2.2 g/m, find the probability that the fleet N O 2 levels from their fleet exceed the EPA mandate
(d) At most, 25% of these cars exceed what mean N O 2 value?
Find the z score that represents the top 25%, which is the same as the bottom 75% (is also Q 3 ) and what is needed to find z 0_._ 75 = 0. 67449. Next use z = (^) seXmean − μ and solve for X: X = z(se mean ) + μ
(e) what is the mean and standard deviation of the total amount (sum), in g/m, of N O 2 in the exhaust for the fleet?
τ = nμ = 35(2.1) = 73. 5
se sum =
nσ =
τˆ ∼ N (τ, se sum ) = ˆτ ∼ N (73. 5 , 1 .7748)
(f) find the probability that the total amount of N O 2 for the fleet is between 70 and 75 g/m
P (70 < τ <ˆ 75) = P
Mars company claims that 10% of the M&M’s it produces are green. Suppose that candies are packaged at random in bags that contain 60 candies. (a) Describe the sampling distribution of the sample proportion (what should the distribution look like?); calculate the mean proportion and standard deviation of the sampling distribution of the sample proportion of green M&M’s in bags that contain 60 candies (calculate π and se). (b) What is the probability that a bag of 60 candies will have more than 13% green M&M’s?
(a) Describe the sampling distribution of the sample proportion; calculate the mean proportion and standard deviation of the sampling distribution of the sample proportion of green M&M’s in bags that contain 60 candies. The distribution of the sample proportion will be approximately normal since n ≥ 60. The mean proportion π = 0. 1 and the standard error is
π (1− π ) n =
(0_._ 1)(1− 0_._ 1) 60 = 0.^0387 (the standard deviation of the sampling distribution of the sample proportion). Thus
ˆπ ∼ N (0. 1 , 0 .0387)
(b) What is the probability that a bag of 60 candies will have more than 13% green M&M’s?
P (ˆπ > 0 .13) = P