Estimation Problems: Point Estimates and Confidence Intervals, Slides of Data Analysis & Statistical Methods

An overview of estimation problems, focusing on classical methods of statistical inference. It covers point estimates, confidence intervals, single sample estimations, and the estimation of differences between two means. Examples for calculating confidence intervals for population means, prediction intervals, and tolerance limits. It is designed to help students understand how to draw conclusions about population parameters from experimental data, using concepts such as the central limit theorem and sampling distributions. The material is presented with practical examples, such as calculating zinc concentration in a river and reaction times in psychological experiments, to illustrate the application of statistical methods in real-world scenarios. The document also addresses the determination of sample sizes required for specific levels of confidence and precision in estimations.

Typology: Slides

2019/2020

Available from 06/04/2025

imwinter
imwinter ๐Ÿ‡ต๐Ÿ‡ญ

5

(1)

148 documents

1 / 48

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
8: Estimation
Problems
EM 7: Engineering Data Analysis
Second Semester, 2019-20
Pamantasan ng Lungsod ng Valenzuela
1
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e
pf1f
pf20
pf21
pf22
pf23
pf24
pf25
pf26
pf27
pf28
pf29
pf2a
pf2b
pf2c
pf2d
pf2e
pf2f
pf30

Partial preview of the text

Download Estimation Problems: Point Estimates and Confidence Intervals and more Slides Data Analysis & Statistical Methods in PDF only on Docsity!

8: Estimation

Problems

EM 7: Engineering Data Analysis

Second Semester, 2019-

Pamantasan ng Lungsod ng Valenzuela

Introduction

  • Previously, sampling properties are based on

the sample mean and variances. This is to

build a foundation that allows us to draw

conclusions about the population parameters

from experimental data.

  • For example, the CLT provides information

about the distribution of the sample mean ๐‘‹

and the population mean ๐œ‡. Similar comments

apply to ๐‘†

เฌถ

and ๐œŽ

เฌถ

(chi-squared distributions).

2

Introduction

  • Statistical inference may be divided into two

major areas: estimation and tests of

hypotheses.

  • Estimation requires to establish a knowledge of the

sampling distribution of a proportion.

  • Test of hypotheses do not need such, there is no

need to estimate a parameter just need to try to

arrive at a correct decision about a pre-stated

hypothesis.

4

Classical Methods of

Estimation

5

Classical Methods of

Estimation

  • An estimator is not expected to estimate the

population parameter without error, but we

certainly hope that the estimate is not far off.

  • For example, we do not expect ๐‘‹

to estimate ๐œ‡

exactly, it is possible to obtain a closer

estimate of ๐œ‡ by using the sample median ๐‘‹

as

an estimator. However, not knowing the true

value of ๐œ‡, we must decide in advance whether

to use ๐‘‹

or ๐‘‹

as the estimator.

7

Unbiased Estimator

Let ฮ˜

be an estimator whose value ๐œƒ

is a point

estimate of some unknown population

parameter ๐œƒ. Certainly, we would like the

sampling distribution of ฮ˜

to have a mean equal

to the parameter estimated. An estimator

possessing this property is said to be unbiased.

8

DEFINITION: A statistic ฮ˜ เทก is said to be an unbiased estimator of the

parameter ๐œƒ if

๐œ‡ เฎ€ เทก = ๐ธ ฮ˜ เทก = ๐œƒ

10

Sampling distributions of different estimators of ๐œƒ.

Interval Estimation

Even the most efficient unbiased estimator is

unlikely to estimate the population parameter

exactly. In situations in which it is preferable to

determine an interval within which we would

expect to find the value of the parameter, the

interval estimate is used.

11

DEFINITION: An interval estimate of the population parameter ๐œƒ is an

interval of the form,

๐œƒ

แˆ˜ เฏ… < ๐œƒ < ๐œƒ

แˆ˜ เฏŽ

where ๐œƒ แˆ˜ เฏ…

and ๐œƒ แˆ˜ เฏŽ

are bounds of the interval estimate that depends on the

value of the statistic ฮ˜ เทก for a particular sample and also on the sampling

distribution of ฮ˜

เทก .

Interval Estimation

The interval ฮ˜

เฏ…

เฏŽ

is called a 100

๐›ผ)% confidence interval, the fraction 1 โˆ’ ๐›ผ is

called the confidence coefficient or the degree

of confidence. The endpoints ฮ˜

เฏ…

and ฮ˜

เฏŽ

are

called the lower and upper confidence limits,

respectively.

13

Interval Estimation

When ๐›ผ = 0.05, we have a 95% confidence

interval. When ๐›ผ = 0.01, we have a wider 99%

confidence interval. The wider the confidence

interval is, the more confident we are that the

interval contains the unknown parameter.

Ideally, we prefer a short interval with a high

degree of confidence. Sometimes, restrictions

on the size of our sample prevent us from

achieving short intervals without sacrificing some

degree of confidence.

14

Single Sample: Estimating the

Mean

The sampling distribution of ๐‘‹

is centered at ๐œ‡,

and in most applications the variance is smaller

than that of any other estimators of ๐œ‡. Thus, the

sample mean ๐‘ฅฬ… will be used as a point estimate

for the population mean ๐œ‡.

Considering the interval estimate of ๐œ‡. If our

sample is selected from a normal population or,

failing this, if ๐‘› is sufficiently large, we can

establish a confidence interval for ๐œ‡ by

considering the sampling distribution of ๐‘‹

16

Single Sample: Estimating the

Mean

According to CLT, we can expect the sampling

distribution of ๐‘‹

to be approximately normally

distributed with mean ๐œ‡ เฏ‘

เดค = ๐œ‡ and standard

deviation ๐œŽ เฏ‘

เดค

17

๐‘ƒ โˆ’๐‘ง เฐˆ/เฌถ <

๐‘‹ เดค โˆ’ ๐œ‡

๐œŽ/โˆš๐‘›

< ๐‘ง เฐˆ/เฌถ = 1 โˆ’ ๐›ผ

Example

The average zinc concentration recovered from

a sample of measurements taken in 36 different

locations in a river is found to be 2.6 grams per

milliliter. Find the 95% and 99% confidence

intervals for the mean zinc concentration in the

river. Assume that the population standard

deviation is 0.3 gram per milliliter.

19

Point estimate of mu is x-bar = 2.60.

z-value with an area of 0.025 to the right (0.975 to the left) is z0.025 = 1.96 (z-table)

Hence, 95% CI is 2.6 โ€“ 1.96(0.3)/sqrt(36) < mu < 2.6 + 1.96(0.3)/sqrt(36)

which reduces to 2.50 < mu < 2.

For 99% CI, z0.005 = 2.575, hence 2.47 < mu < 2.

Single Sample: Estimating the

Mean

When solving for the sample size n, we round all

fractional values up to the next whole number.

This is to be sure that the degree of confidence

never falls below 100 1 โˆ’ ๐›ผ %.

20

THEOREM: If ๐‘ฅฬ… is used as an estimate of ๐œ‡, we can be 100 1 โˆ’ ๐›ผ %

confident that the error will not exceed ๐‘ง เฐˆ/เฌถ

เฐ™

เฏก

.

THEOREM: If ๐‘ฅฬ… is used as an estimate of ๐œ‡, we can be 100 1 โˆ’ ๐›ผ %

confident that the error will not exceed a specified amount ๐‘’ when the

sample size is,

๐‘› =

๐‘ง เฐˆ/เฌถ

๐œŽ

๐‘’

เฌถ