








































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An overview of estimation problems, focusing on classical methods of statistical inference. It covers point estimates, confidence intervals, single sample estimations, and the estimation of differences between two means. Examples for calculating confidence intervals for population means, prediction intervals, and tolerance limits. It is designed to help students understand how to draw conclusions about population parameters from experimental data, using concepts such as the central limit theorem and sampling distributions. The material is presented with practical examples, such as calculating zinc concentration in a river and reaction times in psychological experiments, to illustrate the application of statistical methods in real-world scenarios. The document also addresses the determination of sample sizes required for specific levels of confidence and precision in estimations.
Typology: Slides
1 / 48
This page cannot be seen from the preview
Don't miss anything!









































EM 7: Engineering Data Analysis
Second Semester, 2019-
Pamantasan ng Lungsod ng Valenzuela
the sample mean and variances. This is to
build a foundation that allows us to draw
conclusions about the population parameters
from experimental data.
about the distribution of the sample mean ๐
and the population mean ๐. Similar comments
apply to ๐
เฌถ
and ๐
เฌถ
(chi-squared distributions).
2
major areas: estimation and tests of
hypotheses.
sampling distribution of a proportion.
need to estimate a parameter just need to try to
arrive at a correct decision about a pre-stated
hypothesis.
4
5
population parameter without error, but we
certainly hope that the estimate is not far off.
to estimate ๐
exactly, it is possible to obtain a closer
estimate of ๐ by using the sample median ๐
as
an estimator. However, not knowing the true
value of ๐, we must decide in advance whether
to use ๐
or ๐
as the estimator.
7
Let ฮ
be an estimator whose value ๐
is a point
estimate of some unknown population
parameter ๐. Certainly, we would like the
sampling distribution of ฮ
to have a mean equal
to the parameter estimated. An estimator
possessing this property is said to be unbiased.
8
DEFINITION: A statistic ฮ เทก is said to be an unbiased estimator of the
parameter ๐ if
๐ เฎ เทก = ๐ธ ฮ เทก = ๐
10
Sampling distributions of different estimators of ๐.
Even the most efficient unbiased estimator is
unlikely to estimate the population parameter
exactly. In situations in which it is preferable to
determine an interval within which we would
expect to find the value of the parameter, the
interval estimate is used.
11
DEFINITION: An interval estimate of the population parameter ๐ is an
interval of the form,
๐
แ เฏ < ๐ < ๐
แ เฏ
where ๐ แ เฏ
and ๐ แ เฏ
are bounds of the interval estimate that depends on the
value of the statistic ฮ เทก for a particular sample and also on the sampling
distribution of ฮ
เทก .
The interval ฮ
เฏ
เฏ
is called a 100
๐ผ)% confidence interval, the fraction 1 โ ๐ผ is
called the confidence coefficient or the degree
of confidence. The endpoints ฮ
เฏ
and ฮ
เฏ
are
called the lower and upper confidence limits,
respectively.
13
When ๐ผ = 0.05, we have a 95% confidence
interval. When ๐ผ = 0.01, we have a wider 99%
confidence interval. The wider the confidence
interval is, the more confident we are that the
interval contains the unknown parameter.
Ideally, we prefer a short interval with a high
degree of confidence. Sometimes, restrictions
on the size of our sample prevent us from
achieving short intervals without sacrificing some
degree of confidence.
14
The sampling distribution of ๐
is centered at ๐,
and in most applications the variance is smaller
than that of any other estimators of ๐. Thus, the
sample mean ๐ฅฬ will be used as a point estimate
for the population mean ๐.
Considering the interval estimate of ๐. If our
sample is selected from a normal population or,
failing this, if ๐ is sufficiently large, we can
establish a confidence interval for ๐ by
considering the sampling distribution of ๐
16
According to CLT, we can expect the sampling
distribution of ๐
to be approximately normally
distributed with mean ๐ เฏ
เดค = ๐ and standard
deviation ๐ เฏ
เดค
17
๐ โ๐ง เฐ/เฌถ <
๐ เดค โ ๐
๐/โ๐
< ๐ง เฐ/เฌถ = 1 โ ๐ผ
The average zinc concentration recovered from
a sample of measurements taken in 36 different
locations in a river is found to be 2.6 grams per
milliliter. Find the 95% and 99% confidence
intervals for the mean zinc concentration in the
river. Assume that the population standard
deviation is 0.3 gram per milliliter.
19
Point estimate of mu is x-bar = 2.60.
z-value with an area of 0.025 to the right (0.975 to the left) is z0.025 = 1.96 (z-table)
Hence, 95% CI is 2.6 โ 1.96(0.3)/sqrt(36) < mu < 2.6 + 1.96(0.3)/sqrt(36)
which reduces to 2.50 < mu < 2.
For 99% CI, z0.005 = 2.575, hence 2.47 < mu < 2.
When solving for the sample size n, we round all
fractional values up to the next whole number.
This is to be sure that the degree of confidence
never falls below 100 1 โ ๐ผ %.
20
THEOREM: If ๐ฅฬ is used as an estimate of ๐, we can be 100 1 โ ๐ผ %
confident that the error will not exceed ๐ง เฐ/เฌถ
เฐ
เฏก
.
THEOREM: If ๐ฅฬ is used as an estimate of ๐, we can be 100 1 โ ๐ผ %
confident that the error will not exceed a specified amount ๐ when the
sample size is,
๐ =
๐ง เฐ/เฌถ
๐
๐
เฌถ